نتایج جستجو برای: mel frequency cepstral coefficient mfcc

تعداد نتایج: 644930  

2011
Cini Kurian Kannan Balakrishnan

Malayalam is one of the 22 scheduled languages in India with more than 130 million speakers. This paper presents a report on the development of a speaker independent, continuous transcription system for Malayalam. The system employs Hidden Markov Model (HMM) for acoustic modeling and Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. It is trained with 21 male and female speakers...

2014
K.Murali Krishna M.Vanitha Lakshmi

The process of converting an acoustic waveform into the text resembling the information, conveyed by the speaker is termed as speech recognition. Nowadays, normally Hidden Markov Model (HMM) based speech recognizer with Mel Frequency Cepstral Coefficient (MFCC) feature extraction is used. The proposed speech feature vector is generated by projecting an observed vector onto an Integrated Phoneme...

2001
Conrad Sanderson Kuldip K. Paliwal

In this paper we propose a new fusion technique, termed Joint Cohort Normalization Fusion, where the information fusion is done prior to the likelihood ratio test in a speaker verification system. The performance of the technique is compared against two popular types of fusion: feature vector concatenation and expert opinion fusion, for fusion of Mel Frequency Cepstral Coefficients (MFCC), MFCC...

2001
Conrad Sanderson Kuldip K. Paliwal

In this paper we have studied two information fusion approaches, namely feature vector concatenation and decision fusion, for the task of reducing error rates in a speaker verification system used in mismatched conditions. Three types of features are fused: Mel Frequency Cepstral Coefficients (MFCC), MFCC with Cepstral Mean Subtraction (CMS) and Maximum Auto-Correlation Values (MACV). We have u...

2010
Atanas Ouzounov

In the study, the effectiveness of combinations of cepstral features, channel compensation techniques, and different local distances in the Dynamic Time Warping (DTW) algorithm is experimentally evaluated in the text-dependent speaker identification task. The training and the testing has been done with noisy telephone speech (short phrases in Bulgarian with length of about 2 seconds) selected f...

2012
Sangjun Park Jungpyo Hong Byung - Ok Kang Yun - keun Lee Minsoo Hahn

In this paper, an algorithm for detecting and attenuating puff noises frequently generated under the mobile environment is proposed. As a baseline system, puff detection system is designed based on Gaussian Mixture Model (GMM), and 39th Mel Frequency Cepstral Coefficient (MFCC) is extracted as feature parameters. To improve the detection performance, effective acoustic features for puff detecti...

Journal: :Artif. Intell. Research 2016
Ta-Wen Kuan An-Chao Tsai Po-Hsun Sung Jhing-Fa Wang Hsien-Shun Kuo

An auditory-based feature extraction algorithm naming the Basilar-membrane Frequency-band Cepstral Coefficient (BFCC) is proposed to increase the robustness for automatic speech recognition. Compared to Fourier spectrogram based of the MelFrequency Cepstral Coefficient (MFCC) method, the proposed BFCC method engages an auditory spectrogram based on a gammachirp wavelet transform to simulate the...

2009
Sandipan Chakroborty

requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for speech related applications. On a recent contribution by authors, it has been shown that the Inverted Mel-Frequency Ce...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه بیرجند - دانشکده برق و کامپیوتر 1393

تشخیص جنسیت با استفاده از سیگنال گفتار احمد عطاران چکیده: طبقه بندیجنسیت درگفتار و بازشناسی گوینده به اندازه طبقه بندی احساسات گفتار مفید است زیرا هنگامی که مدلهای صوتی(آکوستیک) جداگانه برای مردان و زنان به کارگرفته شود کارایی بهتری خواهد داشت. با توجه به اینکه سکوت بین زن و مرد مشترک است بنا بر این سکوت از ابتدا حذف می گردد. این امر باعث کاهش حجم بار محاسباتی اضافی و همچنین افزای...

Journal: :CoRR 2010
Pawan Kumar Astik Biswas A. N. Mishra Mahesh Chandra

This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) sys tem. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) alon...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید