mel frequency cepstral coefficients mfcc

نتایج جستجو برای: mel frequency cepstral coefficients mfcc

تعداد نتایج: 584588 فیلتر نتایج به سال:

Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing

2006

Sourabh Ravindran David V. Anderson Malcolm Slaney

In this paper we study the noise-robustness of mel-frequency cepstral coefficients (MFCCs) and explore ways to improve their performance in noisy conditions. Improvements based on a more accurate model of the early auditory system are suggested to make the MFCC features more robust to noise while preserving their class discrimination ability. Speech versus non-speech classification and speech r...

متن کامل

Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System

2007

Valiantsina Hubeika Igor Szöke Lukás Burget Jan Cernocký

Gender and age estimation based on Gaussian Mixture Models (GMM) is introduced. Telephone recordings from the Czech SpeechDatEast database are used as training and test data set. Mel-Frequency Cepstral Coefficients (MFCC) are extracted from the speech recordings. To estimate the GMMs’ parameters Maximum Likelihood (ML) training is applied. Consequently these estimations are used as the baseline...

متن کامل

Determining the Euclidean Distance Between Two Steady State Sounds

2006

Hiroko Terasawa Malcolm Slaney Jonathan Berger

We describe a perceptual space for timbre, define an objective metric that takes into account perceptual orthogonality and measure the quality of timbre interpolation. We discuss two timbre representations and using these two representations, measure perceived relationships between pairs of sounds on a equivalent range of timbre variety. We determine that a timbre space based on Mel-frequency c...

متن کامل

Robust speech recognition using a voiced-unvoiced feature

2002

András Zolnay Ralf Schlüter Hermann Ney

In this paper, a voiced-unvoiced measure is used as acoustic feature for continuous speech recognition. The voiced-unvoiced measure was combined with the standard Mel Frequency Cepstral Coefficients (MFCC) using linear discriminant analysis (LDA) to choose the most relevant features. Experiments were performed on the SieTill (German digit strings recorded over telephone line) and on the SPINE (...

متن کامل

What is in the Dynamic Features: Analysis of the Derivatives of Log-Mel-Spectra

2003

Britta Wrede

The present investigation analyses the behaviour of the first order derivatives of the log-mel-spectrum of vowels which constitutes the basis for the mel-frequency cepstral coefficients (MFCC). The results indicate that the dynamic features when inspected at log-mel-spectra level seem to be less influenced by speaker specific characteristics and degrade less in fast speech. However, when analys...

متن کامل

Effect of aging on speech features and phoneme recognition: a study on Bengali voicing vowels

Journal: :I. J. Speech Technology 2013

Biswajit Das Sandipan Mandal Pabitra Mitra Anupam Basu

The article studies age related variations of speech characteristics of two age groups, in the Bengali language. The study considers 60 speakers in the each age groups, 60– 80 years and 20–40 years, respectively. We have considered different voice source features like fundamental frequency, formant frequencies, jitter, shimmer and harmonic to noise ratio. Cepstral domain feature, Mel Frequency ...

متن کامل

Improved Text-Independent Speaker Identification using Fused MFCC and IMFCC Feature Sets based on Gaussian Filter

2007

Sandipan Chakroborty Goutam Saha

A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for speech related applications. On a recent contribution by authors...

متن کامل

SUT System Description for Anti-Spoofing 2017 Challenge

2017

Mohammad Adiban Hossein Sameti Nooshin Maghsoodi Sajjad Shahsavari

Reliability of Automatic Speaker Verification (ASV) systems has always been a concern in dealing with spoofing attacks. Among these attacks, replay attack is the simplest and the easiest accessible method. This paper describes a replay spoofing detection system applied to ASVspoof2017 corpus. To reach this goal, features such as Constant-Q Cepstral Coefficients (CQCC), Modified Group Delay (MGD...

متن کامل

Analisis Bentuk Pola Suara Menggunakan Ekstraksi Ciri Mel-Frequencey Cepstral Coefficients (MFCC)

Journal: :CogITo Smart Journal 2019

متن کامل

Audio feature extraction: Foreground and Background audio separation using KNN algorithm

Journal: :International Journal of Science and Research Archive 2023

Data Science is a fairly novel field, and it predominantly deals with analysis assortment of data. Machine Learning field that goes hand in this regard. Various Algorithms, which are trained on dataset predict results based their training, thus the accuracy model determined by testing dataset. Foreground feature extraction another interesting application. Using data visualization processing, we...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید