Improving Speaker Identification Rate Using Fractals [IJCNN1005]

نویسندگان

  • Fulufhelo V. Nelwamondo
  • Unathi Mahola
  • Tshilidzi Marwala
چکیده

: This paper reports on a text-dependent speaker identification system that combines Mel-frequency cepstral coefficients with non-linear turbulence information extracted using Multi-Scale Fractal Dimension (MFD). The MFD is estimated using Box-Counting and Minkowiski-Bouligand dimension. The proposed framework is implemented in conjunction with sub-band based speaker identification system. Results show that the proposed framework with Box-Counting feature extraction improves the performance of the classical wideband approach by up to 10% identification rate. It is further observed that the proposed framework gives the improved Bhattacharyya distance between impostors and speakers’ speech distributions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inter-Task System Fusion for Speaker Recognition

Fusion is a common approach to improving the performance of speaker recognition systems. Multiple systems using different data, features or algorithms tend to bring complementary contributions to the final decisions being made. It is known that factors such as native language or accent contribute to speaker identity. In this paper, we explore inter-task fusion approaches to incorporating side i...

متن کامل

A method of speaker identification based on phoneme mean F-ratio contribution

This paper proposes a new method for speaker identification, which based on the non-uniformly distributed speaker information in frequency bands. In order to discard the linguistic information effectively, in this study, we adopt an improved Fisher’s F-ratio called the phoneme mean F-ratio to measure the dependences between frequency components and individual characteristics. Then we adopt an a...

متن کامل

Text Independent Speaker Identification Using Automatic Acoustic Segmentation

This paper describes an acoustic class dependent technique for text independent speaker identification on very short utterances. The technique is based on maximum likelihood estimation of a Gaussian mixture model representation of speaker identity. Gaussian mixtures are noted for their robustness as a parametric model and their ability to form smooth estimates of rather arbitrary underlying den...

متن کامل

Human-like ears versus two-microphone array, which works better for speaker identification?

In this paper we try to answer with justifications the question posed in the title! We have used for this purpose a speech recording hardware; an acoustic artificial head, which accurately imitates human head, shoulder, and outer ears. It offers excellent level of realism and clarity in audio recording. Special speech corpuses are prepared under different noise conditions using the artificial h...

متن کامل

Improving Speaker Identification Performance Under the Shouted Talking Condition Using the Second-Order Hidden Markov Models

Speaker identification systems perform well under the neutral talking condition; however, they suffer sharp degradation under the shouted talking condition. In this paper, the second-order hidden Markov models (HMM2s) have been used to improve the recognition performance of isolated-word text-dependent speaker identification systems under the shouted talking condition. Our results show that HMM...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006