Modeling Frequent Allophones in Jap
نویسندگان
چکیده
In this paper, we describe a technique to model frequent allophones in Japanese speech recognition. The Consonant-Vowel syllabic structure (CV) is dominant in Japanese. Based on frequency, the distribution of CV pairs is rather skewed. Isolating out the most frequent allophones through the use of additional phonemes in acoustic modeling can achieve better recognition accuracy. By introducing ten new phonemes for the five most common CV pairs, we achieved a 30% relative reduction in word error rate for spontaneous speech and 6% relative reduction overall for all speech categories in a Japanese broadcast news transcription task.
منابع مشابه
Allophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملStudy of the most frequent natural tooth colors in the Spanish population using spectrophotometry
PURPOSE To identify the most frequent natural tooth colors using the Easyshade Compact (Vita -Zahnfabrik) spectrophotometer on a sample of the Spanish population according to the 3D Master System. MATERIALS AND METHODS The middle third of the facial surface of natural maxillary central incisors was measured with an Easyshade Compact spectrophotometer (Vita Zahnfabrik) in 1361 Caucasian Spanis...
متن کاملSynthesized Fricative ch Specific Features and Influence on Speech Quality Analysis
One of speech synthesis main problems is synthesis of unvoiced fricatives. One of our previously stated conclusions is that consonant x is influenced by before and behind existing phonetic elements. The aim of experiments described in this paper is to evaluate influence of different x allophones for speech intelligibility and automatic speech recognition. In this paper the formal system, which ...
متن کاملModeling and recognition of phonetic and prosodic factors for improvements to acoustic speech recognition models
This paper examines the usefulness of including prosodic and phonetic context information in the phoneme model of a speech recognizer. This is done by creating a series of prosodic and phonetic models and then comparing the mutual information between the observations and each possible context variable. Prosodic variables show improvement less often than phone context variables, however, prosodi...
متن کاملA MetaPhoneme inventory
This paper focuses on the sharing of phonolog-ical information in a multilingual inheritance-based lexicon. It explores the possibility of establishing a phoneme inventory for a group of languages in which language-speciic phonemes function as \allophones" of newly deened meta-phonemes. Danish, Dutch, English, and Ger-man were taken as a test bed and their vowel phoneme inventories were studied...
متن کامل