آنالیز mfcc

Speaker Identification Based on Log Area Ratio and Gaussian Mixture Models in Narrow-Band Speech: Speech Understanding / Interaction

2004

David Chow Waleed H. Abdulla

Log area ratio coefficients (LAR) derived from linear prediction coefficients (LPC) is a well known feature extraction technique used in speech applications. This paper presents a novel way to use the LAR feature in a speaker identification system. Here, instead of using the mel frequency cepstral coefficients (MFCC), the LAR feature is used in a Gaussian mixture model (GMM) based speaker ident...

متن کامل

Effect of the Front-End Processing on Speaker Verification Performance Using PCA and Scores Level Fusion

2013

Nassim Asbai Messaoud Bengherabi Farid Harizi Abderrahmane Amrouche

This paper evaluates the impact of low-level features on speaker verification performance, with an emphasis on the recently proposed MFCC variant based on asymmetric tapers (MFCC asymmetric from now on) standalone as features or followed by PCA as linear projection technique applied before the GMM-UBM back-end classifier in clean and noisy environments. The performances of the MFCC-asymmetric f...

متن کامل

Parameters for Speaker Identi cation

2001

Nick J.-C. Wang Wei-Ho Tsai Lin-Shan Lee

Eigen-MLLR coe cients are proposed as new feature parameters for speaker-identi cation in this paper. By performing principle component analysis on MLLR parameters among training speakers, the eigen-MLLR coe cients (EMCs) are derived as the coe cients for the eigenvectors. The discriminating function of the new EMC features based on the Fisher criterion is found to be ten times larger than that...

متن کامل

Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition

Journal: :CoRR 2013

Md. Ali Hossain Md. Mijanur Rahman Uzzal Kumar Prodhan Md. Farukuzzaman Khan

This paper is concerned with the development of Back-propagation Neural Network for Bangla Speech Recognition. In this paper, ten bangla digits were recorded from ten speakers and have been recognized. The features of these speech digits were extracted by the method of Mel Frequency Cepstral Coefficient (MFCC) analysis. The mfcc features of five speakers were used to train the network with Back...

متن کامل

University of the Basque Country + Ikerlan System for NIST 2009 Language Recognition Evaluation

2009

M. Penagarikano A. Varona M. Zamalloa L. J. Rodriguez G. Bordel J. P. Uribe

This paper briefly describes the language recognition system developed by the Sofware Technology Working Group (http://gtts.ehu.es) at the University of the Basque Country in collaboration with IKERLAN Technological Research Center, and submitted to the NIST 2009 Language Recognition Evaluation. The system consists of a hierarchical fusion of individual subsystems: two acoustic GLDS-SVM systems...

متن کامل

An Approach to Extract Feature using MFCC

Journal: :IOSR Journal of Engineering 2014

متن کامل

A Robust Front-End Processor combining Mel Frequency Cepstral Coefficient and Sub-band Spectral Centroid Histogram methods for Automatic Speech Recognition

2009

R. Thangarajan A. M. Natarajan

Environmental robustness is an important area of research in speech recognition. Mismatch between trained speech models and actual speech to be recognized is due to factors like background noise. It can cause severe degradation in the accuracy of recognizers which are based on commonly used features like mel-frequency cepstral co-efficient (MFCC) and linear predictive coding (LPC). It is well u...

متن کامل

Automatic Genre Classification Using Fractional Fourier Transform Based Mel Frequency Cepstral Coefficient and Timbral Features

2017

Daulappa Guranna BHALKE Betsy RAJESH Dattatraya Shankar BORMANE

This paper presents the Automatic Genre Classification of Indian Tamil Music andWestern Music using Timbral and Fractional Fourier Transform (FrFT) based Mel Frequency Cepstral Coefficient (MFCC) features. The classifier model for the proposed system has been built using K-NN (K-Nearest Neighbours) and Support Vector Machine (SVM). In this work, the performance of various features extracted fro...

متن کامل

Speech Emotion Recognition Using Residual Phase and MFCC Features

2013

N. J. Nalini S. Palanivel M. Balasubramanian

Abstract--The main objective of this research is to develop a speech emotion recognition system using residual phase and MFCC features with autoassociative neural network (AANN). The speech emotion recognition system classifies the speech emotion into predefined categories such as anger, fear, happy, neutral or sad. The proposed technique for speech emotion recognition (SER) has two phases : Fe...

متن کامل

Effect of Time Derivatives of MFCC Features on HMM Based Speech Recognition System

2013

Sanghamitra V. Arora

In this paper, improvement of an ASR system for Hindi language, based on Vector quantized MFCC as feature vectors and HMM as classifier, is discussed. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is to create delta and delta-delta coefficients and append them to MFCC to create feature vector. This paper focuses on all digits in Hindi (Ze...

متن کامل