ضرایب mfcc

Spoken Digits Recognition using Weighted MFCC and Improved Features for Dynamic Time Warping

2012

Santosh V. Chapaneri B. H. Juang O. W. Kwon K. Chan

In this paper, we propose novel techniques for feature parameter extraction based on MFCC and feature recognition using dynamic time warping algorithm for application in speaker-independent isolated digits recognition. Using the proposed Weighted MFCC (WMFCC), we achieve low computational overhead for the feature recognition stage since we use only 13 weighted MFCC coefficients instead of the c...

متن کامل

Chip design of MFCC extraction for speech recognition

Journal: :Integration 2002

Jia-Ching Wang Jhing-Fa Wang Yu-Sheng Weng

The mel frequency cepstral coefficient (MFCC) is one of the most important features required among various kinds of speech applications. In this paper, the first chip for speech features extraction based on MFCC algorithm is proposed. The chip is implemented as an intellectual property, which is suitable to be adopted in a speech recognition system on a chip. The computational complexity and me...

متن کامل

Robust speaker identification based on perceptual log area ratio and Gaussian mixture models

2004

David Chow Waleed H. Abdulla

This paper presents a new feature for speaker identification called perceptual log area ratio (PLAR). PLAR is closely related to the log area ratio (LAR) feature. PLAR is derived from the perceptual linear prediction (PLP) rather than the linear predictive coding (LPC). The PLAR feature derived from PLP is more robust to noise than the LAR feature. In this paper, PLAR, LAR and MFCC features wer...

متن کامل

Formant Prediction from MFCC Vectors

2004

Jonathan Darch Ben Milner Xu Shao

This work proposes a novel method of predicting formant frequencies from a stream of mel-frequency cepstral coefficients (MFCC) feature vectors. Prediction is based on modelling the joint density of MFCC vectors and formant vectors using a Gaussian mixture model (GMM). Using this GMM and an input MFCC vector, two maximum a posteriori (MAP) prediction methods are developed. The first method pred...

متن کامل

Tandem processing of fepstrum features

2008

Vivek Tyagi

In our previous work [1, 2], we have introduced Fepstrum an improved modulation spectrum estimation technique that overcomes certain theoretical as well as practical shortcomings in the previously published modulation spectrum related techniques[7, 8, 9]. In this paper, we provide further extensive ASR results using the Tandem processed Fepstrum features over the TIMIT corpus. The results are c...

متن کامل

Modified Mel-frequency Cepstrum Coefficient

2003

Li Tan Montri Karnjanadecha

This paper describes the principle of MFCC feature extraction and the knowledge of human auditory masking effect in order to introduce a modified-MFCC feature extraction that can improve the robustness of speech recognition systems.

متن کامل

Study on Gender Identification Based on Audio Recordings Using Gaussian Mixture Model and Mel Frequency Cepstrum Coefficient Technique

Journal: :International Journal of Innovative Computing 2021

Speaker recognition is an ability to identify speaker’s characteristics based from spoken language. The purpose of this study gender speakers on audio recordings. objective evaluate the accuracy rate technique differentiate and also determine performance classify even when using self-acquired Audio forensics uses voice recordings as part evidence solve cases. This mainly conducted provide easie...

متن کامل

تشخیص نیازهای نوزادان از طریق آنالیز صدای گریه آن ها

پایان نامه :دانشگاه آزاد اسلامی - دانشگاه آزاد اسلامی واحد شاهرود - دانشکده مهندسی برق و الکترونیک 1393

سیروس طالبی, حسین مروی, نسرین صالحی,

گریه نوزاد یکی از مهم ترین کانال های ارتباطی با دنیای اطرافش است که توسط آن می تواند بسیاری از نیازهای خود را بیان کند همچنین از طریق آنالیز گریه نوزاد می توان به کم کاری تیروئید او پی برده و از همان کودکی برای درمان او برنامه ریزی نمود. در این پروژه سعی شده است یکسری از نیازهای اساسی نوزادان مانند گرسنگی، ناراحتی، خستگی، نیاز به رفع بادگلو و نفخ شکم را از طریق آنالیز گریه آن ها تشخیص داده و به...

15 صفحه اول

MSP-MFCC: Energy-Efficient MFCC Feature Extraction Method With Mixed-Signal Processing Architecture for Wearable Speech Recognition Applications

Journal: :IEEE Access 2020

متن کامل

روشی جدید در تشخیص گوینده مستقل از متن در محیط‌های نویزی

ژورنال: روشu200cهای هوشمند در صنعت برق 2014

حمید محمودیان, نونا حیدری اصفهانی,

در این مقاله بازشناسی مقاوم به نویز گوینده در حالت مستقل از متن مورد توجه قرار گرفته است. روش پیشنهادی بر مبنای حذف سکوت از جملات و تقطیع آنها به واحدهای کوچک‌تر شامل چند آوا و حداقل یک واکه برای استخراج ویژگی‌های زمان‌بلند از جمله آنتروپی عمل می‌کند. یک واکه پرانرژی در هر قطعه گفتاری برای استخراج فرکانس پایه و فرمنت‌ها شناسایی می‌شود. با اعمال یک روش خوشه‌بندی، ویژگی‌های زمان‌کوتاه یعنی ضرایبِ ...

متن کامل