Phonemic Transcription of Low-Resource Tonal Languages
نویسندگان
چکیده
Transcription of speech is an important part of language documentation, and yet speech recognition technology has not been widely harnessed to aid linguists. We explore the use of a neural network architecture with the connectionist temporal classification loss function for phonemic and tonal transcription in a language documentation setting. In this framework, we explore jointly modelling phonemes and tones versus modelling them separately, and assess the importance of pitch information versus phonemic context for tonal prediction. Experiments on two tonal languages, Yongning Na and Eastern Chatino, show the changes in recognition performance as training data is scaled from 10 minutes to 150 minutes. We discuss the findings from incorporating this technology into the linguistic workflow for documenting Yongning Na, which show the method’s promise in improving efficiency, minimizing typographical errors, and maintaining the transcription’s faithfulness to the acoustic signal, while highlighting phonetic and phonemic facts for linguistic consideration.
منابع مشابه
An Acoustic Study of 'tonal Accent' in Creek
sionaries, native speakers, and ethnographers had previously been aware of the importance of suprasegmentals in the language, Haas was the first to accurately record Creek, the first to develop a phonemic transcription (Haas 1940; 1977b), and the first to provide rules for the placement of what she called "tonal accent." It is one measure of the complexity of the system that some 36 years separ...
متن کاملNative language shapes automatic neural processing of speech.
The development of the phoneme inventory is driven by the acoustic-phonetic properties of one's native language. Neural representation of speech is known to be shaped by language experience, as indexed by cortical responses, and recent studies suggest that subcortical processing also exhibits this attunement to native language. However, most work to date has focused on the differences between t...
متن کاملCross-language perception of non-native tonal contrasts: effects of native phonological and phonetic influences.
This study examined the perception of the four Mandarin lexical tones by Mandarin-naïve Hong Kong Cantonese, Japanese, and Canadian English listener groups. Their performance on an identification task, following a brief familiarization task, was analyzed in terms of tonal sensitivities (A-prime scores on correct identifications) and tonal errors (confusions). The A-prime results revealed that t...
متن کاملPhonetics of intonation in South African Bantu languages
Much is already known about the prosodic systems of the indigenous South African languages from descriptions and analyses in the existing literature. All of the existing work has been carried out in the field of African studies or formal linguistics. In order to be able to implement the generalisations obtained into computational models in speech processing, the existing sources and results mus...
متن کاملTone: Neurophonetics
The notion of speech prosody dates back to MonradKrohn's (1947) case study of a woman who was unable to produce the phonemic tone contrasts in her native Norwegian dialect, even though she retained considerable musical ability. Languages that exploit phonologically relevant variations in pitch a t the syllable level are called tone languages (for review of the phonetics of tone languages, see G...
متن کامل