Multi-path Syllable Models Based on Phonetic Knowledge
نویسندگان
چکیده
Recent research suggests that syllable-length acoustic models might be more appropriate for pronunciation variation modelling than the context-dependent phones that conventional automatic speech recognisers use. In this paper, we compare the recognition performance of two types of recognisers: a conventional recogniser that only uses triphones, and an experimental recogniser that employs a mix of context-independent syllable models for a set of frequent syllables and sequences of triphones for the less frequent ones. The syllable models of the mixed-model recogniser are designed to consist of multiple HMM paths that are expected to capture major pronunciation variants. These paths are initialised using phonetic knowledge and re-estimated using a data-driven solution. When applied to 94 frequent syllables in a 37-hour corpus of Dutch read speech, the multi-path mixed-model recogniser outperforms a much more complex triphone recogniser.
منابع مشابه
Construction and analysis of multiple paths in syllable models
In this paper, we construct multi-path syllable models using phonetic knowledge for initialising the parallel paths, and a data-driven solution for their re-estimation. We hypothesise that the richer topology of multi-path syllable models would be better at accounting for pronunciation variation than context-dependent phone models that can only account for the effects of left and right neighbou...
متن کاملPronunciation variant-based multi-path HMMs for syllables
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with context-dependent phones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multi-path model topologies appears necessary. In this paper, we propose a novel approach for constructing multi-path models...
متن کاملWhither Linguistic Interpretation of Acoustic Pronunciation Variation
Recent research suggests that modelling pronunciation variation is more appropriate at the syllable level than at the level of contextdependent phones. Due to the large number of factors affecting syllable pronunciation, the creation of multi-path topologies is nec essary. Previous research on multi-path models in connected digit recognition has proved trajectory clustering to be an attractive...
متن کاملModelling pronunciation variation with single-path and multi-path syllable models: Issues to consider
In this paper, we construct context-independent single-path and multi-path syllable models aimed at improved pronunciation variation modelling. We use phonetic transcriptions to define the topologies of the syllable models and to initialise the model parameters, and the Baum-Welch algorithm for the re-estimation of the model parameters. We hypothesise that the richer topology of multi-path syll...
متن کاملSpeech Recognition using Phonetically Featured Syllables
Speech can be naturally described by phonetic features, such as a set of acoustic phonetic features or a set of articulatory features. This thesis establishes the effectiveness of using phonetic features in phoneme recognition by comparing a recogniser based on them to a recogniser using an established parametrisation as a baseline. The usefulness of phonetic features serves as the foundation f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005