A corpus-based approach to diphthong analysis of standard Slovenian
نویسندگان
چکیده
This paper presents an inventory and relative frequency estimation of glides on the 527,190 word-form Standard Slovenian lexicon. Detailed acoustic-phonetic measurements for the first four most frequent glides /ai/, /au/, /ou/, and /ei/ in stressed syllables are given. Inspection of their formant trajectory plots enabeled measurements of the first four formants in the onset and offset steady-states. Normalized duration patterns for the onset steadystate, glide and offset steady-state are also given. Results represent a broader view to the recently published JIPA paper [4] and are an initial step towards the decision on the most appropriate allophonic symbols to be used in narrow transcription for the glides of Standard Slovenian.
منابع مشابه
Polder dutch: aspects of the /ei/-lowering in standard dutch
This paper is an initial report on the systematic analysis of changes within the vowel system of Standard Dutch. The work focuses on the recent lowering of the diphthong /Ei/, known as ‘Polder Dutch’ (Poldernederlands). The purpose was to find an automatizable method to reliably analyze and compare speakers of a large corpus of Dutch spontaneous speech. Diphthong variants of twelve native speak...
متن کاملروشی جدید جهت استخراج موجودیتهای اسمی در عربی کلاسیک
In Natural Language Processing (NLP) studies, developing resources and tools makes a contribution to extension and effectiveness of researches in each language. In recent years, Arabic Named Entity Recognition (ANER) has been considered by NLP researchers due to a significant impact on improving other NLP tasks such as Machine translation, Information retrieval, question answering, query result...
متن کاملContents and evaluation of the first Slovenian-German online dictionary
This paper presents the first SlovenianGerman and German-Slovenian online dictionary and contains evaluation figures for its Slovenian part. Evaluations are based on coverage of a Slovenian newspaper corpus as well as on user queries.
متن کاملLocus equations determination using the speechdat(II)
This paper presents a corpus-based approach to determination of locus equations for Slovenian language. The SpeechDat(II) spoken language database is analyzed first for all available target VCV contexts in order to yield candidate subsets for the acoustic-phonetic measurements. Only the VCVs embedded within judiciously chosen carrier utterances are then selected for the (F2 vowel, F2 onset) mea...
متن کاملAutomatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation
Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...
متن کامل