نتایج جستجو برای: آنالیز mfcc

تعداد نتایج: 42970  

2010
Frank Seide Pei Zhao

Missing Feature Theory (MFT), a powerful systematic framework for robust speech recognition, to date has not been optimally applied to linear-transform based features like MFCC or HLDA, which are necessary for state-of-the-art recognition accuracy, due to the intractable multivariate integral in bounded marginalization. This paper seeks to enable more optimal use of MFT with MFCC features/diago...

2009
Ben P. Milner Jonathan Darch Ibrahim Almajai

The aim of this work is to reconstruct clean speech solely from a stream of noise-contaminated MFCC vectors, as may be encountered in distributed speech recognition systems. Speech reconstruction is performed using the ETSI Aurora back-end speech reconstruction standard which requires MFCC vectors, fundamental frequency and voicing information. In this work, fundamental frequency and voicing ar...

2006
Babak Nasersharif Ahmad Akbari

The Mel-frequency cepstral coefficients (MFCC) are most widely used and successful features for speech recognition. But, their performance degrades in presence of additive noise. In this paper, we propose a noise compensation method for Mel filter bank energies and so MFCC features. This compensation method includes two steps: Mel sub-band spectral subtraction and then compression of Mel-Sub-ba...

2000
Kuo-Hwei Yuo Tai-Hwei Hwang Hsiao-Chuan Wang

This paper presents a method that combines the techniques of temporal trajectory filtering and projection measure for robust speaker identification. The proposed robust feature, called Relative Autocorrelation Sequence Mel-scale Frequency Cepstral Coefficients (RAS-MFCC), is derived based on filtering the temporal trajectories of short-time one-sided autocorrelation sequences. This filtering pr...

ژورنال: :مهندسی برق مدرس 0
ayuob jafari islamic azad university, qazvin branch farshad almasganj amirkabir university of technology maryam nabi bidhendi amirkabir university of technology

در این مقاله روشی جدید برای افزایش صحت سیستمهای بازشناسی گفتار، با استفاده از ترکیب بردارهای ویژگی به دست آمده از مدل سازی غیرخطی فضای فاز بازسازی شده سیگنال گفتار با ویژگیهای معمول به دست آمده از تحلیل حوزه فرکانس ارائه می شود. بر اساس نظریه پذیرفته شده کنونی، در صورت انتخاب تعداد بُعد کافی برای بازسازی فضای فاز سیگنال، این فضا به صورت کامل دینامیک سیستم تولید کننده آن را نشان می دهد و بنابراین...

2005
Xu Shao

This thesis is concerned with reconstructing an intelligible time-domain speech signal from speech recognition features, such as Mel-frequency cepstral coefficients (MFCCs), in a distributed speech recognition(DSR) environment. The initial reconstruction methods in this thesis require, in addition to MFCC vectors, fundamental frequency and voicing information. In the later parts of the thesis t...

2017
Ahmed Kamil Hasan Al-Ali Bouchra Senadji Ganesh Naik

We propose a system to real environmental noise and channel mismatch for forensic speaker verification systems. This method is based on suppressing various types of real environmental noise by using independent component analysis (ICA) algorithm. The enhanced speech signal is applied to mel frequency cepstral coefficients (MFCC) or MFCC feature warping to extract the essential characteristics o...

2001
Nick J.-C. Wang Wei-Ho Tsai Lin-Shan Lee

Eigen-MLLR coe cients are proposed as new feature parameters for speaker-identi cation in this paper. By performing principle component analysis on MLLR parameters among training speakers, the eigen-MLLR coe cients (EMCs) are derived as the coe cients for the eigenvectors. The discriminating function of the new EMC features based on the Fisher criterion is found to be ten times larger than that...

2012
Huan Zhao Yufeng Xiao

According to the nonlinear characteristic of the speech signal, this paper presents a novel robust MFCC extraction method using sample-ISOMAP. ISOMAP is a nonlinear dimensionality reduction method based on the theory of manifold, it can reveal the meaningful low-dimensional structure hidden in the high-dimensional observations. In the proposed method, ISOMAP is first applied for calculating the...

2006
Yasunari Obuchi Nobuo Hataoka

In a microphone array system, feature combination in the MFCC domain can improve speech recognition accuracy. Multiple microphones provide different feature parameters such as MFCCs even if they have similar speech and noise signals, because of the phase difference and transmission characteristics. In this paper, we investigate how the recognition performance changes when we average multiple MF...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید