Asynchronous-transition Hmm for Acoustic Modeling
نویسندگان
چکیده
We propose a new class of hidden Markov model (HMM) which we call Asynchronous-Transition HMM (AT-HMM) to model asynchronous temporal structure of acoustic feature sequences. Conventional HMM models a sequence of feature vectors, while temporally changing patterns of acoustic features do not necessarily synchronize with each other. In this paper, AT-HMMs with and without sequential constraints are discussed. Algorithms for generating context-dependent AT-HMM and for deriving sequentially constrained AT-HMM are provided. A new concept of “state tying across time” is also introduced. Speaker-dependent speech recognition experiments demonstrated error reduction rates of more than 30% and 40% in phoneme and isolated word recognition, respectively, compared with conventional HMMs.
منابع مشابه
Asynchronous-transition HMM
We propose a new class of hidden Markov model (HMM) called asynchronous-transition HMM (AT-HMM). Opposed to conventional HMMs where hidden state transition occurs simultaneously to all features, the new class of HMM allows state transitions asynchronized between individual features to better model asynchronous timings of acoustic feature changes. In this paper, we focus on a particular class of...
متن کاملMaximum entropy direct model as a unified model for acoustic modeling in speech recognition
Traditional statistical models for speech recognition have been dominated by generative models such as Hidden Markov Models (HMMs). We recently proposed a new framework for speech recognition using maximum entropy direct modeling, where the probability of a state or word sequence given an observation sequence is computed directly from the model. In contrast to HMMs, features can be non-independ...
متن کاملBernoulli versus Markov: Investigation of state transition regime in switching-state acoustic models
In this paper, a new acoustic model called time-inhomogeneous hidden Bernoulli model (TI-HBM) is introduced as an alternative to hidden Markov model (HMM) in continuous speech recognition. Contrary to HMM, the state transition process in TI-HBM is not a Markov process, rather it is an independent (generalized Bernoulli) process. This difference leads to elimination of dynamic programming at sta...
متن کاملAdvanced Acoustic Modeling with the Hybrid HMM/BN Framework
Most of the current state-of-the-art speech recognition systems are based on HMMs which usually use mixture of Gaussian functions as state probability distribution model. It is a common practice to use EM algorithm for Gaussian mixture parameter learning. In this case, the learning is done in a ”blind”, data-driven way without taking into account how the speech signal has been produced and whic...
متن کاملImplicit Trajectory Modeling through Gaussian Transition Models for Speech Recognition
It is well known that frame independence assumption is a fundamental limitation of current HMM based speech recognition systems. By treating each speech frame independently, HMMs fail to capture trajectory information in the acoustic signal. This paper introduces Gaussian Transition Models (GTM) to model trajectories implicitly. Comparing to alternative approaches, such as segment modeling and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999