mfcc

Speech recognition of mandarin syllables using both linear predict coding cepstra and Mel frequency cepstra

2007

Tze Fen Li Shui-Ching Chang

This paper is to compare two most common features representing a speech word for speech recognition on the basis of accuracy, computation time, complexity and cost. The two features to represent a speech word are the linear predict coding cepstra (LPCC) and the Mel-frequency cepstrum coefficient (MFCC). The MFCC was shown to be more accurate than the LPCC in speech recognition using the dynamic...

متن کامل

Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures.

Journal: :The Journal of the Acoustical Society of America 2008

Jonathan Darch Ben Milner Saeed Vaseghi

The aim of this work is to develop methods that enable acoustic speech features to be predicted from mel-frequency cepstral coefficient (MFCC) vectors as may be encountered in distributed speech recognition architectures. The work begins with a detailed analysis of the multiple correlation between acoustic speech features and MFCC vectors. This confirms the existence of correlation, which is fo...

متن کامل

Combining Gaussian Mixture Models and Segmental Feature Models for Speaker Recognition

2017

Milana Milosevic Ulrike Glavitsch

In most speaker recognition systems speech utterances are not constrained in content or language. In a text-dependent speaker recognition system lexical content of speech and language are known in advance. The goal of this paper is to show that this information can be used by a segmental features (SF) approach to improve a standard Gaussian mixture model with MFCC features (GMM-MFCC). Speech fe...

متن کامل

Native Language Identification Using Spectral and Source-Based Features

2016

Avni Rajpal Tanvina B. Patel Hardik B. Sailor Maulik C. Madhavi Hemant A. Patil Hiroya Fujisaki

The task of native language (L1) identification from nonnative language (L2) can be thought of as the task of identifying the common traits that each group of L1 speakers maintains while speaking L2 irrespective of the dialect or region. Under the assumption that speakers are L1 proficient, non-native cues in terms of segmental and prosodic aspects are investigated in our work. In this paper, w...

متن کامل

Automated Music Success Prediction

2007

Joshua Teitelbaum Niyant Krishnamurthi Sébastien Beaudet

We investigate the uses and limitations of MFCC analysis for feature extraction from music files in the domain of genre recognition. Intra-genre and Inter-genre classification is explored. We implement a method of genre classification based on MFCC extraction, K-means clustering, and KNN analysis. We demonstrate the efficacy of our method through testing, yielding a 99% accuracy rate.

متن کامل

System for Fusion of Face and Speech Modalities Using DTCWT+QFT and MFCC+RASTA Techniques

Journal: :Indian journal of science and technology 2021

Objectives: The main objective is to propose a multimodal biometric system by forming fusion of Face and Speech modalities using DTCWT+QFT techniques for face MFCC+RASTA Techniques recognitions. experimental results are compared with existing works analysed the performance counterparts. Methods: proposed model, make use DTCWT QFT extract features images perform both. MFCC RASTA implemented spee...

متن کامل

Speaker Recognition Improvement for Degraded Human Voice using Modified-MFCC with GMM

Journal: :International Journal of Advanced Computer Science and Applications 2023

Speaker’s audio is one of the unique identities speaker. Nowadays not only humans but machines can also identify by their audio. Machines different properties human voice and classify speaker from speaker’s Speaker recognition still challenging with degraded limited dataset. be identified effectively when feature extraction more accurate. Mel-Frequency Cepstral Coefficient (MFCC) mostly used me...

متن کامل

Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition.

Journal: :The Journal of the Acoustical Society of America 2004

Mark D Skowronski John G Harris

Mel frequency cepstral coefficients (MFCC) are the most widely used speech features in automatic speech recognition systems, primarily because the coefficients fit well with the assumptions used in hidden Markov models and because of the superior noise robustness of MFCC over alternative feature sets such as linear prediction-based coefficients. The authors have recently introduced human factor...

متن کامل

Vector Quantization Approach for Speaker Recognition using MFCC and Inverted MFCC

2011

Satyanand Singh E. G. Rajan

Front-end or feature extractor is the first component in an automatic speaker recognition system. Feature extraction transforms the raw speech signal into a compact but effective representation that is more stable and discriminative than the original signal. Since the front-end is the first component in the chain, the quality of the later components (speaker modeling and pattern matching) is st...

متن کامل

Fully quantum mechanical energy optimization for protein-ligand structure

Journal: :Journal of computational chemistry 2004

Yun Xiang Da W. Zhang John Z. H. Zhang

We present a quantum mechanical approach to study protein-ligand binding structure with application to a Adipocyte lipid-binding protein complexed with Propanoic Acid. The present approach employs a recently develop molecular fractionation with a conjugate caps (MFCC) method to compute protein-ligand interaction energy and performs energy optimization using the quasi-Newton method. The MFCC met...

متن کامل