A Combinatorial Approach to the Phonetic Similarity of Languages
نویسندگان
چکیده
résumé – Une approche combinatoire de la ressemblance phonétique entre langues En exploitant une représentation géométrique des phonèmes vocaliques, nous réalisons un modèle bidimensionnel dans lequel des voyelles sont des points et les distances entre ces points expriment des différences auditives. Ceci nous permettra de décrire le système vocalique d’une langue du point de vue d’une autre langue au moyen d’une partition d’un ensemble fini dont les propriétés combinatoires peuvent être explorées. Le concept de base que nous utilisons est celui du diagramme de Voronoï, qui a été largement utilisé dans d’autres domaines. Dans le cas présent, nous mettons en évidence quelques particularités combinatoires de partitions d’entiers qui décrivent des dissemblances entre des inventaires vocaliques de différentes langues et classons les relations possibles entre inventaires par le biais des graphes orientés appropriés et, en particulier, parmi les diagrammes de Voronoï adéquats. Nous appliquons cette théorie à quelques langues réelles, en recherchant des améliorations pouvant faciliter la compréhension d’un inventaire vocalique par un auditeur dont la catégorisation auditive est différente. Enfin, nous décrivons des inventaires privilégiés, facilement compréhensibles dans beaucoup de langues en même temps.
منابع مشابه
مقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملA perceptual phonetic similarity space for languages: Evidence from five native language listener groups
The goal of the present study was to devise a means of representing languages in a perceptual similarity space based on their overall phonetic similarity. In Experiment 1, native English listeners performed a free classification task in which they grouped 17 diverse languages based on their perceived phonetic similarity. A similarity matrix of the grouping patterns was then submitted to cluster...
متن کاملCross-language Phonetic Similarity Measure on Terms Appeared in Asian Languages
This study aims to develop a phonetic similarity measurement method across Asian languages. The method, cross-language similarity algorithm aggregates the transcription of language-specific Romanization, the International Phonetic Alphabet, the Soundex algorithm, and Levenshtein distance. To evaluate the proposed algorithm, this study involves an experiment using ninety-two chemical element nam...
متن کاملA Knowledge-Rich Approach to Measuring the Similarity between Bulgarian and Russian Words
We propose a novel knowledge-rich approach to measuring the similarity between a pair of words. The algorithm is tailored to Bulgarian and Russian and takes into account the orthographic and the phonetic correspondences between the two Slavic languages: it combines lemmatization, hand-crafted transformation rules, and weighted Levenshtein distance. The experimental results show an 11-pt interpo...
متن کاملEvaluation Of Several Phonetic Similarity Algorithms On The Task Of Cognate Identification
We investigate the problem of measuring phonetic similarity, focusing on the identification of cognates, words of the same origin in different languages. We compare representatives of two principal approaches to computing phonetic similarity: manually-designed metrics, and learning algorithms. In particular, we consider a stochastic transducer, a Pair HMM, several DBN models, and two constructe...
متن کامل