Mel-scaled Wavelet Filter Base Unvoiced Phoneme Re
نویسنده
چکیده
In this paper we propose a filter bank structure derived by using admissible wavelet packet transform. These filters have Mel scale spacing and have an advantage of easy implementation with higher resolution in time-frequency domain because of wavelet transform. The features are obtained by first calculating the energy in each filter band and then applying the Discrete Cosine Transform (DCT) to the energy vector. We evaluate the recognition performance of the features derived from the MelScaled Wavelet Filter (MSWF) bank structure and compare it with that derived from Mel Frequency Cepstral Coefficients (MFCC). Experimental results on the phoneme recognition from the TIMIT database show that, features derived by using MSWF performs better as compared to MFCC features for unvoiced stops and unvoiced fricatives. Further the noise performance of these features are also found to be better as compared to MFCC features.
منابع مشابه
Designing a Speaker-discrim Filter Bank for Speake
A new filter bank approach for speaker recognition front-end is proposed. The conventional mel-scaled filter bank is replaced with a speaker-discriminative filter bank. Filter bank is selected from a library in adaptive basis, based on the broad phoneme class of the input frame. Each phoneme class is associated with its own filter bank. Each filter bank is designed in a way that emphasizes disc...
متن کاملGenetic Optimization of Cepstrum Filterbank for Phoneme Classification
Some of the most commonly used speech representations, such as mel-frequency cepstral coefficients, incorporate biologically inspired characteristics into artificial systems. Recent advances have been introduced modifying the shape and distribution of the traditional perceptually scaled filterbank, commonly used for feature extraction. Some alternatives to the classic mel scaled filterbank have...
متن کاملDesigning a speaker-discriminative adaptive filter bank for speaker recognition
A new filter bank approach for speaker recognition front-end is proposed. The conventional mel-scaled filter bank is replaced with a speaker-discriminative filter bank. Filter bank is selected from a library in adaptive basis, based on the broad phoneme class of the input frame. Each phoneme class is associated with its own filter bank. Each filter bank is designed in a way that emphasizes disc...
متن کاملThe use of wavelet transforms in phoneme recognition
This study investigates the usefulness of wavelet transforms in phoneme recognition. Both discrete wavelet transforms (DWT) and sampled continuous wavelet transforms (SCWT) are tested. The wavelet transform is used as a part of the front-end processor which extracts feature vectors for a speakerindependent HMM-based phoneme recognizer. The results are evaluated on a portion of TIMIT corpus cons...
متن کاملVoiced/Unvoiced and Silent Classification Using HMM Classifier based on Wavelet Packets BTE features
Wavelet Packets Best Tree Encoded (BTE) features is used here as base features for HMM classifier. The research aimed to introduce the newly designed features that are discussed in [1]. The considered problem is Voiced, Unvoiced and Silent classification. Comparison to the 19 filter banks features is provided. Although it is simple and straight forward, BTE makes comparable results to the 19 el...
متن کامل