Optimizing Features Extraction Parameters for Speaker Verification

نویسندگان

  • DONATO IMPEDOVO
  • MARIO REFICE
چکیده

In this paper the role of the frame length on the computation of Mel Frequency Cepstral Coefficients (MFCCs) is investigated in a text-dependent speaker verification system. The variations of vocal characteristics of subjects along the time and the related information conveyed in the MFCCs cause a significant degradation on verification performance. In our experiments we tested the use of different frame lengths for feature extraction in the training and in the verification phases, for a set of speakers whose speech productions were spanned over approximately 3 months. Results show that a suitable choice of the frame lengths combination for training and testing phases can improve performance. The approach shows its potentialities up to 40% in ER reduction for female speakers and up to 58% for the male subset. Key-Words: Speaker Verification, Text Dependent, Mismatch, Frame Length, CD-HMM

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative Analysys of Speech Parameters for the Design of Speaker Verification Systems

Speaker verification systems are basically composed of three stages: feature extraction, feature processing and comparison of the modified features from speaker voice and from the voice that should be verified. Many features have been used in the first stage, although the current literature has not already shown the best of them. Based on the biometrics hypothesis, which states that each indivi...

متن کامل

Optimizing feature complementarity by evolution strategy: Application to automatic speaker verification

Conventional automatic speaker verification systems are based on cepstral features like Mel-scale Frequency Cepstrum Coefficient (MFCC), or Linear Predictive Cepstrum Coefficient (LPCC). Recent published works showed that the use of complementary features can significantly improve the system performances. In this paper, we propose to use an evolution strategy to optimize the complementarity of ...

متن کامل

Time –Frequency Representation of Vocal Source Signal for Speaker Verification

We propose an effective feature extraction technique for obtaining essential time-frequency information from the linear prediction (LP) residual signal, which are closely related to the glottal vibration of individual speaker. With pitch synchronous analysis, wavelet transform is applied to every two pitch cycles of the LP residual signal to generate a new feature vector, called Wavelet Based F...

متن کامل

Cascading appearance-based features for visual speaker verification

The cascading appearance-based (CAB) feature extraction technique has established itself as the state of the art in extracting dynamic visual speech features for speech recognition. In this paper, we will focus on investigating the effectiveness of this technique for the related speaker verification application. By investigating the speaker verification ability of each stage of the cascade we w...

متن کامل

Limited Data Speaker Verification: Fusion of Features

The present work demonstrates experimental evaluation of speaker verification for different speech feature extraction techniques with the constraints of limited data (less than 15 seconds). The state-of-the-art speaker verification techniques provide good performance for sufficient data (greater than 1 minutes). It is a challenging task to develop techniques which perform well for speaker verif...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008