نتایج جستجو برای: mel frequency cepstral coefficients mfcc

تعداد نتایج: 584588  

2006
Sourabh Ravindran David V. Anderson Malcolm Slaney

In this paper we study the noise-robustness of mel-frequency cepstral coefficients (MFCCs) and explore ways to improve their performance in noisy conditions. Improvements based on a more accurate model of the early auditory system are suggested to make the MFCC features more robust to noise while preserving their class discrimination ability. Speech versus non-speech classification and speech r...

2007
Valiantsina Hubeika Igor Szöke Lukás Burget Jan Cernocký

Gender and age estimation based on Gaussian Mixture Models (GMM) is introduced. Telephone recordings from the Czech SpeechDatEast database are used as training and test data set. Mel-Frequency Cepstral Coefficients (MFCC) are extracted from the speech recordings. To estimate the GMMs’ parameters Maximum Likelihood (ML) training is applied. Consequently these estimations are used as the baseline...

2006
Hiroko Terasawa Malcolm Slaney Jonathan Berger

We describe a perceptual space for timbre, define an objective metric that takes into account perceptual orthogonality and measure the quality of timbre interpolation. We discuss two timbre representations and using these two representations, measure perceived relationships between pairs of sounds on a equivalent range of timbre variety. We determine that a timbre space based on Mel-frequency c...

2002
András Zolnay Ralf Schlüter Hermann Ney

In this paper, a voiced-unvoiced measure is used as acoustic feature for continuous speech recognition. The voiced-unvoiced measure was combined with the standard Mel Frequency Cepstral Coefficients (MFCC) using linear discriminant analysis (LDA) to choose the most relevant features. Experiments were performed on the SieTill (German digit strings recorded over telephone line) and on the SPINE (...

2003
Britta Wrede

The present investigation analyses the behaviour of the first order derivatives of the log-mel-spectrum of vowels which constitutes the basis for the mel-frequency cepstral coefficients (MFCC). The results indicate that the dynamic features when inspected at log-mel-spectra level seem to be less influenced by speaker specific characteristics and degrade less in fast speech. However, when analys...

Journal: :I. J. Speech Technology 2013
Biswajit Das Sandipan Mandal Pabitra Mitra Anupam Basu

The article studies age related variations of speech characteristics of two age groups, in the Bengali language. The study considers 60 speakers in the each age groups, 60– 80 years and 20–40 years, respectively. We have considered different voice source features like fundamental frequency, formant frequencies, jitter, shimmer and harmonic to noise ratio. Cepstral domain feature, Mel Frequency ...

2007
Sandipan Chakroborty Goutam Saha

A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for speech related applications. On a recent contribution by authors...

2017
Mohammad Adiban Hossein Sameti Nooshin Maghsoodi Sajjad Shahsavari

Reliability of Automatic Speaker Verification (ASV) systems has always been a concern in dealing with spoofing attacks. Among these attacks, replay attack is the simplest and the easiest accessible method. This paper describes a replay spoofing detection system applied to ASVspoof2017 corpus. To reach this goal, features such as Constant-Q Cepstral Coefficients (CQCC), Modified Group Delay (MGD...

Journal: :International Journal of Science and Research Archive 2023

Data Science is a fairly novel field, and it predominantly deals with analysis assortment of data. Machine Learning field that goes hand in this regard. Various Algorithms, which are trained on dataset predict results based their training, thus the accuracy model determined by testing dataset. Foreground feature extraction another interesting application. Using data visualization processing, we...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید