ضرایب mfcc

Speaker recognition using pattern recognition neural network and feedforward neural network

2017

Neha Chauhan

Neha Chauhan Birla Institute of Technology, Mesra, Ranchi Abstract— Speaker Recognition is the computing task of validating a user’s claimed identity using speech characteristics. Main objective of speech recognition system is to communication with a device through our voice. Mel frequency Cepstral Coefficient (MFCC) features are combined with pitch and root mean square values and tested for im...

متن کامل

Music Genre Classification Using Convolutional Neural Network2014.10.21.docx

2014

Qiuqiang Kong Xiaohui Feng Yanxiong Li

Feature extraction is a crucial part of many MIR tasks. Many manual-selected features such as MFCC have been applied to music processing but they are not effective for music genre classification. In this work, we present an algorithm based on spectrogram and convolutional neural network (CNN). Compared with MFCC, the spectrogram contains more details of music components such as pitch, flux, etc...

متن کامل

Exploiting frequency-scaling invariance properties of the scale transform for automatic speech recognition

2000

S. Umesh Richard C. Rose Sarangarajan Parthasarathy

An experimental study of the application of scale-transform to improve the performance of speaker independent continuous speech recognition, is presented in this paper. Three major results are described. First, a comparison was made between the scale-transform based magnitude cepstrum coeÆcients (STCC) and mel-scale lter bank cepstrum coeÆcients (MFCC) on a telephone based connected digit recog...

متن کامل

Feature extraction from analytic phase of speech signals for speaker verification

2014

Karthika Vijayan Vinay Kumar K. Sri Rama Murty

The objective of this work is to study the speaker-specific nature of analytic phase of speech signals. Since computation of analytic phase suffers from phase wrapping problem, we have used its derivativethe instantaneous frequency for feature extraction. The cepstral coefficients extracted from smoothed subband instantaneous frequencies (IFCC) are used as features for speaker verification. The...

متن کامل

Feature Extraction and Selection in Speech Emotion Recognition

2012

Yixiong Pan Peipei Shen Liping Shen

Speech Emotion Recognition (SER) is a hot research topic in the field of Human Computer Interaction (HCI). In this paper, we recognize three emotional states: happy, sad and neutral. The explored features include: energy, pitch, linear predictive spectrum coding (LPCC), Mel-frequency spectrum coefficients (MFCC), and Mel-energy spectrum dynamic coefficients (MEDC). A German Corpus (Berlin Datab...

متن کامل

Comparison of Parameterization Methods in Recognizing Spoken Arabic Digits

2013

Ali Ganoun

This paper proposes evaluation of sound parameterization methods in recognizing some spoken Arabic words, namely digits from zero to nine. Each isolated spoken word is represented by a single template based on a specific recognition feature, and the recognition is based on the Euclidean distance from those templates. The performance analysis of recognition is based on four parameterization feat...

متن کامل

Speaker Recognition using MFCC and Improved Weighted Vector Quantization Algorithm

2015

C. Sunitha E. Chandra

Speaker recognition is one of the most essential tasks in the signal processing which identifies a person from characteristics of voices . In this paper we accomplish speaker recognition using Mel-frequency Cepstral Coefficient (MFCC) with Weighted Vector Quantization algorithm. By using MFCC, the feature extraction process is carried out. It is one of the nonlinear cepstral coefficient functio...

متن کامل

Identification of Sex of the Speaker With Reference To Bodo Vowels: A Comparative Experimental Study

2016

B. Sarma P. H. Talukdar

This work presents an application of Fundamental Frequency (Pitch), Linear Predictive Cepstral Coefficient (LPCC) and Mel Frequency Cepstral Coefficient (MFCC) in identification of sex of the speaker in speech recognition research. The aim of this article is to compare the performance of these three methods for identification of sex of the speakers. A successful speech recognition system can he...

متن کامل

Pitch maxima for robust speaker recognition

2003

S. Krishnakumar K. R. Prasanna Kumar N. Balakrishnan

This paper presents a novel approach to the design of a robust speaker recognition system. A noise-free synthesised spectrum is produced from a noisy spectrum. This synthesised spectrum is used for feature extraction. From noisy speech, the pitch is extracted using arobust pitch estimation algorithm. This also helps in identifying the voiced segments of speech which are the only ones considered...

متن کامل

A Comparative Study for Identification of Sex of the Speaker With Reference to Bodo Vowels

2011

M. K. Deka

This work presents an application of Fundamental Frequency (Pitch), Linear Predictive Cepstral Coefficient (LPCC) and Mel Frequency Cepstral Coefficient (MFCC) in identification of sex of the speaker in speech recognition research. The aim of this article is to compare the performance of these three methods for identification of sex of the speakers. A successful speech recognition system can he...

متن کامل