coefficient mfcc

نتایج جستجو برای: coefficient mfcc

تعداد نتایج: 170818 فیلتر نتایج به سال:

Tandem processing of fepstrum features

2008

Vivek Tyagi

In our previous work [1, 2], we have introduced Fepstrum an improved modulation spectrum estimation technique that overcomes certain theoretical as well as practical shortcomings in the previously published modulation spectrum related techniques[7, 8, 9]. In this paper, we provide further extensive ASR results using the Tandem processed Fepstrum features over the TIMIT corpus. The results are c...

متن کامل

Modified Mel-frequency Cepstrum Coefficient

2003

Li Tan Montri Karnjanadecha

This paper describes the principle of MFCC feature extraction and the knowledge of human auditory masking effect in order to introduce a modified-MFCC feature extraction that can improve the robustness of speech recognition systems.

متن کامل

MSP-MFCC: Energy-Efficient MFCC Feature Extraction Method With Mixed-Signal Processing Architecture for Wearable Speech Recognition Applications

Journal: :IEEE Access 2020

متن کامل

COVID-19 Detection Model with Acoustic Features from Cough Sound and Its Application

Journal: :Applied sciences 2023

Contrary to expectations that the coronavirus pandemic would terminate quickly, number of people infected with virus did not decrease worldwide and coronavirus-related deaths continue occur every day. The standard COVID-19 diagnostic test technique used today, PCR testing, requires professional staff equipment, which is expensive takes a long time produce results. In this paper, we propose feat...

متن کامل

Emotion recognition in spontaneous speech using GMMs

2006

Daniel Neiberg Kjell Elenius Kornel Laskowski

Automatic detection of emotions has been evaluated using standard Mel-frequency Cepstral Coefficients, MFCCs, and a variant, MFCC-low, calculated between 20 and 300 Hz, in order to model pitch. Also plain pitch features have been used. These acoustic features have all been modeled by Gaussian mixture models, GMMs, on the frame level. The method has been tested on two different corpora and langu...

متن کامل

Predicting Formant Frequencies from MFCC Vectors

2005

Jonathan Darch Ben P. Milner Xu Shao Saeed Vaseghi Qin Yan

This work proposes a novel method of predicting formant frequencies from a stream of mel-frequency cepstral coefficients (MFCC) feature vectors. Prediction is based on modelling the joint density of MFCCs and formant frequencies using a Gaussian mixture model (GMM). Using this GMM and an input MFCC vector, two maximum a posteriori (MAP) prediction methods are developed. The first method predict...

متن کامل

Speaker Identification by Combining Various Vocal Tract and Vocal Source Features

2014

Yuta Kawakami Longbiao Wang Atsuhiko Kai Seiichi Nakagawa

Previously, we proposed a speaker recognition system using a combination of MFCC-based vocal tract feature and phase information which includes rich vocal source information. In this paper, we investigate the efficiency of combination of various vocal tract features (MFCC and LPCC) and vocal source features (phase and LPC residual) for normal-duration and short-duration utterance. The Japanese ...

متن کامل

A Novel Series Arc Fault Detection Method Based on Mel-Frequency Cepstral Coefficients and Fully Connected Neural Network

Journal: :IEEE Access 2022

Arc faults pose challenges to electric safety, which can cause serious fire hazards. However, the commonly used arc fault detection method is prone nuisance tripping. This paper proposed a hybrid based on improved Mel-Frequency Ceptral Coefficients (MFCC) for preprocessing and neural network model identification called ARC_MFCC. As per IEC 62606, twelve different loads/scenarios are considered ...

متن کامل

Emotion Recognition in Spontaneous Speech

2006

Daniel Neiberg Kjell Elenius Inger Karlsson Kornel Laskowski

Automatic detection of emotions has been evaluated using standard Mel-frequency Cepstral Coefficients, MFCCs, and a variant, MFCC-low, that is calculated between 20 and 300 Hz in order to model pitch. Plain pitch features have been used as well. These acoustic features have all been modeled by Gaussian mixture models, GMMs, on the frame level. The method has been tested on two different corpora...

متن کامل

Effectiveness of LP Based Features for Identification of Professional Mimics in Indian Languages

2006

Hemant A. Patil P. K. Dutta T. K. Basu

Automatic Speaker Recognition (ASR) is an economic tool for voice biometrics because of availability of low cost and powerful processors. For an ASR system to be successful in practical environments, it must have high mimic resistance, i.e., the system should not be defeated by determined mimics which may be either identical twins or professional mimics. In this paper, we demonstrate the effect...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید