Acoustic phonetic modeling using local codebook features
نویسندگان
چکیده
In this article we present an alternative method for defining the question set used for the induction of acoustic phonetic decision trees. The method is data driven and employs local similarities between the probability density functions of hidden Markov models. The method is shown to work at least as well as the standard method using question sets devised by human experts.
منابع مشابه
Local Codebook Features for Mono- and Multilingual Acoustic Phonetic Modelling
In this article we present an alternative method for defining the question set used for the induction of acoustic phonetic decision trees. The method is data driven and employs local similarities between the probability density functions of hidden Markov models. We apply the method to monoand multilingual acoustic phonetic modelling, showing that comparable results to the standard method, using...
متن کاملAcoustic Phonetic Modelling using Local Codebook Features
In this article we present an alternative method for defining the question set used for the induction of acoustic phonetic decision trees. The method is data driven and employs local similarities between the probability density functions of hidden Markov models. The method is shown to work at least as well as the standard method using question sets devised by human experts.
متن کاملContinuous local codebook features for multi- and cross-lingual acoustic phonetic modelling
In this paper we present a method for defining the question set for the induction of acoustic phonetic decision trees. The method is data driven resulting in a continuous feature space in contrast to the usual categorical one. We apply the features to a multilingual speech recognition task, outperforming consistently the standard method using IPA-based characteristics. An extension to cross-lin...
متن کاملCodebook Based Face Point Trajectory Synthesis Algo - rithm Using Speech
This paper presents a novel algorithm which generates three-dimensional face point trajectories for a given speech le with or without its text. The proposed algorithm rst employs an oo-line training phase. In this phase, recorded face point trajectories along with their speech data and phonetic labels are used to generate phonetic codebooks. These codebooks consist of both acoustic and visual f...
متن کامل3-D Face Point Trajectory Synthesis Using An Automatically Derived Visual Phoneme Similarity Matrix
This paper presents a novel algorithm which generates three-dimensional face point trajectories for a given speech le with or without its text. The proposed algorithm rst employs an o -line training phase. In this phase, recorded face point trajectories along with their speech data and phonetic labels are used to generate phonetic codebooks. These codebooks consist of both acoustic and visual f...
متن کامل