HMM composition of segmental unit input HMM for noisy speech recognition

نویسندگان

Kazumasa Yamamoto

Seiichi Nakagawa

چکیده

For robust speech recognition in noisy environments, various methods have been studied. In this paper, we apply parallel model combination (PMC) for segmental unit input HMM to recognize corrupted speech in additive noise. Since several successive frames are combined and treated as an input vector in segmental unit input modeling, the increased dimension of vector degrades the precision in estimating covariance matrices. Therefore Karhunen-Loeve expansion or LDA is used to reduce the dimension. Thus the inverse transformation of segmental statistics to cepstral domain is needed and correlations between frames have to be taken into account. We expanded the original PMC to segmental unit input HMM. Experimental results showed PMC for segmental unit input HMM proposed here gives better recognition performance than the original PMC.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hidden Markov models merging acoustic and articulatory information to automatic speech recognition

This paper describes a new scheme for robust speech recognition systems where visual information and acoustic features are merged. Using as robust unit the « pseudo-diphone », we compare a global Hidden Markov Model (HMM) and a Master/Slave HMM through a centisecond preprocessing and through a segmental one. We confirm by experimentation the importance of articulatory features in clean and nois...

متن کامل

Improved HMM Separation for Distant-Talking Speech Recognition

In distant-talking speech recognition, the recognition accuracy is seriously degraded by reverberation and environmental noise. A robust speech recognition technique in such environments, HMM separation and composition, has been described in [1]. HMM separation estimates the model parameters of the acoustic transfer function using adaptation data uttered from an unknown position in noisy and re...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Feature Transformation Based on Generalization of Linear Discriminant Analysis

Hidden Markov models (HMMs) have been widely used to model speech signals for speech recognition. However, they cannot precisely model the time dependency of feature parameters. In order to overcome this limitation, several researchers have proposed extensions, such as segmental unit input HMM (Nakagawa & Yamamoto, 1996). Segmental unit input HMM has been widely used for its effectiveness and t...

متن کامل

Noise and room acoustics distorted speech recognition by HMM composition

This paper presents a robust speech recognition method based on the HMM composition for the noisy room acoustics distorted speech. The method realizes an improved user interface such as the user is not encumbered by microphone equipments. The proposed HMM composition is obtained by naturally extending the HMM composition method of an additive noise to that of the convolutional room acoustics di...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

HMM composition of segmental unit input HMM for noisy speech recognition

نویسندگان

چکیده

منابع مشابه

Hidden Markov models merging acoustic and articulatory information to automatic speech recognition

Improved HMM Separation for Distant-Talking Speech Recognition

Speech enhancement based on hidden Markov model using sparse code shrinkage

Feature Transformation Based on Generalization of Linear Discriminant Analysis

Noise and room acoustics distorted speech recognition by HMM composition

عنوان ژورنال:

اشتراک گذاری