mel frequency cepstral coefficients mfcc

نتایج جستجو برای: mel frequency cepstral coefficients mfcc

تعداد نتایج: 584588 فیلتر نتایج به سال:

Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features

2018

This chapter discusses the use of vocal tract information for recognizing the emotions. Linear prediction cepstral coefficients (LPCC) and mel frequency cepstral coefficients (MFCC) are used as the correlates of vocal tract information. In addition to LPCCs and MFCCs, formant related features are also explored in this work for recognizing emotions from speech. Extraction of the above mentioned ...

متن کامل

Comparison of Parameterization Methods in Recognizing Spoken Arabic Digits

2013

Ali Ganoun

This paper proposes evaluation of sound parameterization methods in recognizing some spoken Arabic words, namely digits from zero to nine. Each isolated spoken word is represented by a single template based on a specific recognition feature, and the recognition is based on the Euclidean distance from those templates. The performance analysis of recognition is based on four parameterization feat...

متن کامل

Mel Frequency Cepstral Coefficients for Music Modeling

2000

Beth Logan

We examine in some detail Mel Frequency Cepstral Coefficients (MFCCs) the dominant features used for speech recognition and investigate their applicability to modeling music. In particular, we examine two of the main assumptions of the process of forming MFCCs: the use of the Mel frequency scale to model the spectra; and the use of the Discrete Cosine Transform (DCT) to decorrelate the Mel-spec...

متن کامل

Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients

Journal: :Sensors and Materials 2020

متن کامل

Using Mel-Frequency Cepstral Coefficients in Missing Data Technique

Journal: :EURASIP Journal on Advances in Signal Processing 2004

متن کامل

Acoustic-phonetic Feature Based Dialect Identification in Hindi Speech

2015

Shweta Sinha Aruna Jain

Every individual has some unique speaking style and this variation influences their speech characteristics. Speakers’ native dialect is one of the major factors influencing their speech characteristics that influence the performance of automatic speech recognition system (ASR). In this paper, we describe a method to identify Hindi dialects and examine the contribution of different acoustic-phon...

متن کامل

A model of dynamic auditory perception and its application to robust word recognition

Journal: :IEEE Trans. Speech and Audio Processing 1997

Brian Strope Abeer Alwan

This paper describes two mechanisms that augment the common automatic speech recognition (ASR) front end and provide adaptation and isolation of local spectral peaks. A dynamic model consisting of a linear filterbank with a novel additive logarithmic adaptation stage after each filter output is proposed. An extensive series of perceptual forward masking experiments, together with previously rep...

متن کامل

Speaker Recognition Improvement for Degraded Human Voice using Modified-MFCC with GMM

Journal: :International Journal of Advanced Computer Science and Applications 2023

Speaker’s audio is one of the unique identities speaker. Nowadays not only humans but machines can also identify by their audio. Machines different properties human voice and classify speaker from speaker’s Speaker recognition still challenging with degraded limited dataset. be identified effectively when feature extraction more accurate. Mel-Frequency Cepstral Coefficient (MFCC) mostly used me...

متن کامل

Language Accent Detection with CNN Using Sparse Data from a Crowd-Sourced Speech Archive

Journal: :Mathematics 2022

The problem of accent recognition has received a lot attention with the development Automatic Speech Recognition (ASR) systems. crux is that conventional acoustic language models adapted to fit standard corpora are unable satisfy requirements for accented speech. In this research, we contribute task group up nine European accents in English and try provide some evidence favor specific hyperpara...

متن کامل

Emotion Recognition and Evaluation of Mandarin Speech Using Weighted D-KNN Classification

2005

Tsang-Long Pao Yu-Te Chen Jun-Heng Yeh Yuan-Hao Chang

In this paper, we proposed a weighted discrete K-nearest neighbor (weighted D-KNN) classification algorithm for detecting and evaluating emotion from Mandarin speech. In the experiments of the emotion recognition, Mandarin emotional speech database used contains five basic emotions, including anger, happiness, sadness, boredom and neutral, and the extracted acoustic features are Mel-Frequency C...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید