Multi-scale Fractal Dimension for Speaker Identification System
نویسندگان
چکیده
This paper looks at the extraction of nonlinear turbulence information of the speech signal using Multi-Scale Fractal Dimension (MFD) as the additional feature to the convectional MFCC features. The MFD is estimated using both Box-Counting and Minkowiski-Bouligand dimensions. The proposed framework uses this feature extraction together with sub-band based speaker modeling rather than the wide-band approach. Results showed that the proposed framework with Box-Counting feature extraction improves the performance of the classical wideband approach with up 10% identification rate. Key-Words: Multi-scale fractal dimension, Box-Counting, Minkowiski-Bouligand, speaker identification
منابع مشابه
Improving Speaker Identification Rate Using Fractals [IJCNN1005]
: This paper reports on a text-dependent speaker identification system that combines Mel-frequency cepstral coefficients with non-linear turbulence information extracted using Multi-Scale Fractal Dimension (MFD). The MFD is estimated using Box-Counting and Minkowiski-Bouligand dimension. The proposed framework is implemented in conjunction with sub-band based speaker identification system. Resu...
متن کاملSpeaker Recognition with Spectral Dimension Features of Human Voices for Personal Authentication
Biometric recognition is more and more important due to security applications all over the world. Mobile phone becomes popular in recent years. Therefore, voice recognition on mobile devices for recognizing a speaker’s identity plays a potential role. This paper presents a speaker recognition method which combines a non-linear feature, named spectral dimension (SD), with Mel Frequency Cepstral ...
متن کاملFractal dimension applied to speaker identification
This paper reports the results obtained in a speaker identification system based in Bhattacharrya distance, which combines LPderived cepstral coefficients, with a nonlinear dynamic feature namely fractal dimension. The nonlinear dynamic analysis starts with the phase espace reconstruction, and the fractal dimension of the correspondent attractor trajectory is estimated. This analysis is perform...
متن کاملDimension reduction for speaker identification based on mutual information
Dimension reduction is a necessary step for speech feature extraction in a speaker identification system. Discrete Cosine Transform (DCT) or Principal Component Analysis (PCA) is widely used for dimension reduction. By choosing basis vectors from basis vector pool of DCT or PCA which contribute more to data distribution variance or reconstruction accuracy of speech data set, we can transform th...
متن کاملFractal and Multi-Scale Fractal Dimension analysis: a comparative study of Bouligand-Minkowski method
Shape is one of the most important visual attributes used to characterize objects, playing a important role in pattern recognition. There are various approaches to extract relevant information of a shape. An approach widely used in shape analysis is the complexity, and Fractal Dimension and Multi-Scale Fractal Dimension are both well-known methodologies to estimate it. This papers presents a co...
متن کامل