Robustness of Phoneme Classification Using Generative Classifiers: Comparison of the Acoustic Waveform and Plp Representations
نویسندگان
چکیده
The robustness of classification of isolated phoneme segments using generative classifiers is investigated for the acoustic waveform and PLP speech representations. Probabilistic PCA is used to fit a density to each phoneme class followed by maximum likelihood classification. The results show that although PLP performs best in quiet conditions, as the SNR decreases below 0dB acoustic waveforms have a lower classification error. This is the case even though the waveform classifier is trained explicitly only on quiet data and is then modified by a simple transformation to account for the noise, whereas for PLP separate classifiers are trained for each noise condition. Even at −18dB SNR, multiclass performance of classification from waveforms is still significantly better than chance level. In addition the effect of time-alignment is tested and initial solution shown.
منابع مشابه
Robustness of Phoneme Classification Using Support Vector Machines: a Comparison between Plp and Acoustic Waveform Representations
Robustness of phoneme recognition to additive noise is investigated for PLP and acoustic waveform representations of speech using support vector machines (SVMs) combined via error-correcting code methods. While recognition in the PLP domain attains superb accuracy on clean data, it is significantly affected by mismatch between training and testing noise levels. The classification in the high-di...
متن کاملPhoneme Classification in Linear Feature Domains
Phoneme classification is investigated for linear feature domains with the aim of improving robustness to additive noise. In linear feature domains noise adaptation is exact, potentially leading to more accurate classification than representations involving non-linear processing and dimensionality reduction. A generative framework is developed for isolated phoneme classification using linear fe...
متن کاملSpeech Recognition Front End Without Information Loss
Phoneme classification is investigated for linear feature domains with the aim of improving robustness to additive noise. In linear feature domains noise adaptation is exact, potentially leading to more accurate classification than representations involving non-linear processing and dimensionality reduction. A generative framework is developed for isolated phoneme classification using linear fe...
متن کاملTuning support vector machines for robust phoneme classification with acoustic waveforms
This work focuses on the robustness of phoneme classification to additive noise in the acoustic waveform domain using support vector machines (SVMs). We address the issue of designing kernels for acoustic waveforms which imitate the state-ofthe-art representations such as PLP and MFCC and are tuned to the physical properties of speech. For comparison, classification results in the PLP represent...
متن کاملImproving the Robustness of Phoneme Classification Using Hybrid Features
This work is concerned with improving the robustness of phoneme classification to additive noise with hybrid features using support vector machines (SVMs). In particular, the cepstral features are combined with local energy features of acoustic waveform segments to form a hybrid representation. The local energy features are taken into account separately in the SVM kernel, and a simple subtracti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007