Robust feature vector compression algorithm for distributed speech recognition

نویسندگان

  • Imre Kiss
  • Pekka Kapanen
چکیده

In this paper we propose an algorithm for efficient compression of feature extracted parameters used in speech recognition. The algorithm provides a compression ratio of roughly 1:10 and causes negligible or no loss in recognition performance. It is also shown to be robust against enviromental noise. Combined with an appropriate framing structure, a complete system is obtained, which can be used for implementing speech recognition applications e.g. in a cellular mobile environment. The system achieves a gross bitrate as low as 4200 bps.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...

متن کامل

A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition

We propose a novel packetization and variable bitrate compression scheme for DSR source coding, based on the Group of Pictures concept from video coding. The proposed algorithm simultaneously packetizes and further compresses source coded features using the high interframe correlation of speech, and is compatible with a variety of VQ-based DSR source coders. The algorithm approximates vector qu...

متن کامل

Noise reduction for noise robust feature extraction for distributed speech recognition

This paper describes the noise robust feature extraction meth­ ods developed by France Telecom and Alcatel for the noise robust front-end standardisation of ETSI Aurora. It is shown that both noise reduction methods give a substantial im­ provement when compared to a standard MFCC feature ex­ traction algorithm for speech recognition in noisy environ­ ments. In addition, blind equalisation and ...

متن کامل

A Low-resource, Miniature of the Etsi Distributed Speech R

The purpose of this work is to demonstrate that distributed speech recognition front-ends can be deployed in environments which providefor very little power and CPU resources, with possibly no degradation of speech recognition quality when compared to standard floatingpoint implementations. The ETSI distributed speech recognition front-end standard is implemented on an ultra low-power miniature...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999