3. Training of the Hybrid Rbf-hmm System

نویسندگان

  • B. H. Juang
  • S. Katagiri
  • R. Lippmann
  • E. Singer
  • R. Lippmann E. Singer
چکیده

Table 1: Phoneme recognition rates for diierent hybrid RBF-HMM systems; all values in %. In line two the results for a NN with one additional hidden layer of 100 sigmoidal units are depicted. The recognition rates for both training procedures are inferior to the net without hidden layer. This is mainly contributed to the relatively small hidden layer with 100 nodes, which acts like a bottleneck in computing the state probabilities. The information from 400 RBF a-posteriori probabilities is compressed in 100 hidden node scores and expanded to calculate the 169 state probabilities. The number of trainable parameters (weights) for this system is about 57,000 and for the baseline system without hidden layer about 68,000. The enhanced transformation capability from the hidden layer was of no additional use for this task. In a third experiment contextual information from the adjacent feature vectors is used for the calculation of state probabilities in the net. The output nodes refer to the a-posteriori probabilities of the RBF nodes from 3 frames. This expands the hidden layer to 1200 normalized scores and results in about 203,000 trainable parameters. Since the computing of the delta-loudness incorporates the processing of ve frames for every feature vector, the usage of the 3 frame contextual window results in acoustic information from 70ms in total. The phoneme recognition results for this RBF net are printed in line three of table 1. 62.5% of the phonemes in the test data are correctly classiied after the MSE optimization and 63.8% after the MCE-GPD training. The incorporation of contextual information in the estimation of a-posterioris leads to some improvements in performance. The minimum error (MCE) training in the second optimization phase is started after no more increase in recognition performance on the test data for the MSE training occurred. The optimization of the MCE objective function , which is more related to the classiier error rate than the MSE, leads to some additional improvements in performance. The best result (63.8%) for the RBF-HMM is achieved , exploiting contextual information. SCHMMs with the same number of prototypes and model structure were trained with MLE for comparison. The SCHMM phoneme recognition rate for the training data is 58.5% and 57.9% for the test data. The improvement of about 6% for the RBF-HMM is attributed to the discriminative structure based on Bayes probabilities, the discriminative learning techniques and the incorporation of contextual input. 5. CONCLUSIONS …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Hidden Markov Models Speech Recognition Using Radial Basis Function Networks

A high performance speaker-independent isolated-word hybrid speech recognizer was developed which combines Hidden Markov Models (HMMs) and Radial Basis Function (RBF) neural networks. In recognition experiments using a speaker-independent E-set database, the hybrid recognizer had an error rate of 11.5% compared to 15.7% for the robust unimodal Gaussian HMM recognizer upon which the hybrid syste...

متن کامل

Improved Hidden Markov Model Speech Recognition Using Radial Basis Function Networks

A high performance speaker-independent isolated-word hybrid speech recognizer was developed which combines Hidden Markov Models (HMMs) and Radial Basis Function (RBF) neural networks. In recognition experiments using a speaker-independent E-set database, the hybrid recognizer had an error rate of 11.5% compared to 15.7% for the robust unimodal Gaussian HMM recognizer upon which the hybrid syste...

متن کامل

A NN/HMM hybrid for continuous speech recognition with a discriminant nonlinear feature extraction

This paper deals with a hybrid NN/HMM architecture for continuous speech recognition. We present a novel approach to set up a neural linear or nonlinear feature transformation that is used as a preprocessor on top of the HMM system’s RBF-network to produce discriminative feature vectors that are well suited for being modeled by mixtures of Gaussian distributions. In order to omit the computatio...

متن کامل

A hybrid RBF-HMM system for continuous speech recognition

A hybrid system for continuous speech recognition con sisting of a neural network with Radial Basis Functions and Hidden Markov Models is described in this paper together with discriminant training techniques Initially the neural net is trained to approximate a posteriori probabilities of single HMM states These probabilities are used by the Viterbi algorithm to calculate the total scores for t...

متن کامل

A New Fast and Efficient HMM-Based Face Recognition System Using a 7-State HMM Along With SVD Coefficients

In this paper, a new Hidden Markov Model (HMM)-based face recognition system is proposed. As a novel point despite of five-state HMM used in pervious researches, we used 7-state HMM to cover more details. Indeed we add two new face regions, eyebrows and chin, to the model. As another novel point, we used a small number of quantized Singular Values Decomposition (SVD) coefficients as feature...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995