A Study on the Effect of Pitch on LPCC and PLPC Features for Children's ASR in Comparison to MFCC

نویسندگان

  • Shweta Ghai
  • Rohit Sinha
چکیده

In this work, following our previous studies, we study and quantify the effect of pitch on LPCC and PLPC features and explore their efficacy for children’s mismatched ASR in comparison to MFCC. Our analysis shows that, unlike MFCC, LPCC feature has no major influence of pitch variations. On the other hand, similar to MFCC, though PLPC is also found to be significantly effected by pitch variations but comparatively to a lesser extent. However, after explicit pitch normalization of children’s speech, MFCC is found to result in the best children’s speech recognition performance on adults’ speech trained models in comparison to LPCC and PLPC features.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Speaker Recognition using LPCC and MFCC

A person's voice contains various parameters that convey information such as emotion, gender, attitude, health and identity. This report talks about speaker recognition which deals with the subject of identifying a person based on their unique voiceprint present in their speech data. Pre-processing of the speech signal is performed before voice feature extraction. This process ensures the voice...

متن کامل

Improving the Robustness of LPCC Feature Against Impulsive Noise by Applying the FOP Method

Performance of an automatic speech recognition (ASR) system tends to be dramatically degraded in the presence of impulsive noise. In the previous work [1], we proposed flooring the observation probability (FOP) to compensate the adverse effect of impulsive noise on sensitive dimensions of Mel-frequency cepstral coefficient (MFCC) features. Linear prediction cepstral coefficient (LPCC) is anothe...

متن کامل

Feature Extraction and Selection in Speech Emotion Recognition

Speech Emotion Recognition (SER) is a hot research topic in the field of Human Computer Interaction (HCI). In this paper, we recognize three emotional states: happy, sad and neutral. The explored features include: energy, pitch, linear predictive spectrum coding (LPCC), Mel-frequency spectrum coefficients (MFCC), and Mel-energy spectrum dynamic coefficients (MEDC). A German Corpus (Berlin Datab...

متن کامل

A Comparative Study for Identification of Sex of the Speaker With Reference to Bodo Vowels

This work presents an application of Fundamental Frequency (Pitch), Linear Predictive Cepstral Coefficient (LPCC) and Mel Frequency Cepstral Coefficient (MFCC) in identification of sex of the speaker in speech recognition research. The aim of this article is to compare the performance of these three methods for identification of sex of the speakers. A successful speech recognition system can he...

متن کامل

Identification of Sex of the Speaker With Reference To Bodo Vowels: A Comparative Experimental Study

This work presents an application of Fundamental Frequency (Pitch), Linear Predictive Cepstral Coefficient (LPCC) and Mel Frequency Cepstral Coefficient (MFCC) in identification of sex of the speaker in speech recognition research. The aim of this article is to compare the performance of these three methods for identification of sex of the speakers. A successful speech recognition system can he...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011