mfcc

Speech recognition using reconstructed phase space features

2003

Andrew C. Lindgren Michael T. Johnson Richard J. Povinelli

This paper presents a novel method for speech recognition by utilizing nonlinear/chaotic signal processing techniques to extract time-domain based phase space features. By exploiting the theoretical results derived in nonlinear dynamics, a processing space called a reconstructed phase space can be generated where a salient model (the natural distribution of the attractor) can be extracted for s...

متن کامل

Skew Gaussian Mixture Models for Speaker Recognition

2011

Avi Matza

The current paper proposes skew Gaussian mixture models for speaker recognition and an associated algorithm for its training from experimental data. Speaker identification experiments were conducted, in which speakers were modeled using the familiar Gaussian mixture models (GMM), and the new skewGMM. Each model type was evaluated using two sets of feature vectors, the mel-frequency cepstral coe...

متن کامل

Articulatory motivated acoustic features for speech recognition

2005

Daniil Kocharov András Zolnay Ralf Schlüter Hermann Ney

In this paper, we consider the use of multiple acoustic features of the speech signal for continuous speech recognition. A novel articulatory motivated acoustic feature is introduced, namely the spectrum derivative feature. The new feature is tested in combination with the standard Mel Frequency Cepstral Coefficients (MFCC) and the voicedness features. Linear Discriminant Analysis is applied to...

متن کامل

Efficient Speech Recognition System for Isolated Digits

2013

Santosh V. Chapaneri Deepak J. Jayaswal

In this paper, an efficient speech recognition system is proposed for speaker-independent isolated digits (0 to 9). Using the Weighted MFCC (WMFCC), low computational overhead is achieved since only 13 weighted MFCC coefficients are used. In order to capture the trends of the extracted features, the local and global features are computed using the Improved Features for Dynamic Time Warping (IFD...

متن کامل

Multitaper MFCC and PLP features for speaker verification using i-vectors

Journal: :Speech Communication 2013

Md. Jahangir Alam Tomi Kinnunen Patrick Kenny Pierre Ouellet Douglas D. O'Shaughnessy

In this paper we study the performance of the low-variance multi-taper Mel-frequency cepstral coefficient (MFCC) and perceptual linear prediction (PLP) features in a state-ofthe-art i-vector speaker verification system. The MFCC and PLP features are usually computed from a Hamming-windowed periodogram spectrum estimate. Such a singletapered spectrum estimate has large variance, which can be red...

متن کامل

Effectiveness of fundamental frequency (F0) and strength of excitation (SOE) for spoofed speech detection

2016

Tanvina B. Patel Hemant A. Patil

Current countermeasures used in spoof detectors (for speech synthesis (SS) and voice conversion (VC)) are generally phase-based (as vocoders in SS and VC systems lack phaseinformation). These approaches may possibly fail for nonvocoder or unit-selection-based spoofs. In this work, we explore excitation source-based features, i.e., fundamental frequency (F0) contour and strength of excitation (S...

متن کامل

Regularized MVDR spectrum estimation-based robust feature extractors for speech recognition

2013

Md. Jahangir Alam Patrick Kenny Douglas D. O'Shaughnessy

In this paper, we present two robust feature extractors that use a regularized minimum variance distortionless response (RMVDR) spectrum estimator instead of the discrete Fourier transform-based direct spectrum estimator, used in many front-ends including the conventional MFCC, for estimating the speech power spectrum. Direct spectrum estimators, e.g., single tapered periodogram, have high vari...

متن کامل

Fractionation of peptide with disulfide bond for quantum mechanical calculation of interaction energy with molecules.

Journal: :The Journal of chemical physics 2004

X H Chen D W Zhang J Z H Zhang

We present a computational study of a recently developed molecular fractionation with conjugated caps (MFCC) method for application to peptide/protein that has disulfide bonds. Specifically, we employ the MFCC approach to generate peptide fragments in which a disulfide bond is cut and a pair of conjugated caps are inserted. The method is tested on two peptides interacting with a water molecule....

متن کامل

Early MFCC and HPCP Fusion for Robust Cover Song Identification

2017

Christopher J. Tralie

While most schemes for automatic cover song identification have focused on note-based features such as HPCP and chord profiles, a few recent papers surprisingly showed that local self-similarities of MFCC-based features also have classification power for this task. Since MFCC and HPCP capture complementary information, we design an unsupervised algorithm that combines normalized, beatsynchronou...

متن کامل

Relative phase information for detecting human speech and spoofed speech

2015

Longbiao Wang Yohei Yoshida Yuta Kawakami Seiichi Nakagawa

The detection of human and spoofed (synthetic/converted) speech has started to receive more attention. In this study, relative phase information extracted from a Fourier spectrum is used to detect human and spoofed speech. Because original/natural phase information is almost entirely lost in spoofed speech using current synthesis/conversion techniques, a modified group delay based feature, the ...

متن کامل