Comparing MFCC and MPEG-7 Audio Features for Feature Extraaction, Maximum Likelihood HMM and Entropic Prior HMM for Sports Audio Classification

نویسندگان

  • Ziyou Xiong
  • Regunathan Radhakrishnan
  • Ajay Divakaran
  • Thomas S. Huang
چکیده

We present a comparison of 6 methods for classification of sports audio. For the feature extraction we have two choices: MPEG-7 audio features and Mel-scale Frequency Cepstrum Coefficients (MFCC). For the classificaiton we also have two choices: Maximum Likelihood Hidden Markov Models (ML-HMM) and Entropic Prior HMM(EP-HMM). EP-HMM, in turn, have two variations: with and without trimming of the model parameters. We thus have 6 possible methods, each of which corresponds to a combination. Our results show that all the combinations achieve classification accuracy of around 90% with the best and the second best being MPEG-7 features with EP-HMM and MFCC with ML-HMM.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing MFCC and MPEG-7 audio features for feature extraction, maximum likelihood HMM and entropic prior HMM for sports audio classification

We present a comparison of 6 methods for classification of sports audio. For the feature extraction we have two choices: MPEG-7 audio features and Mel-scale Frequency Cepstrum Coefficients(MFCC). For the classification we also have two choices: Maximum Likelihood Hidden Markov Models(ML-HMM) and Entropic Prior HMM(EP-HMM). EP-HMM, in turn, have two variations: with and without trimming of the m...

متن کامل

Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework

We developed a unified framework to extract highlights from three sports: baseball, golf and soccer by detecting some of the common audio events that are directly indicative of highlights. We used MPEG-7 audio features and entropic prior Hidden Markov Models(HMM) as the audio features and classifier respectively to recognize these common audio events. Together with preand post-processing techni...

متن کامل

Comparison of MPEG-7 basis projection features and MFCC applied to robust speaker recognition

Our purpose is to evaluate the efficiency of MPEG-7 basis projection (BP) features vs. Mel-scale Frequency Cepstrum Coefficients (MFCC) for speaker recognition in noisy environments. The MPEG-7 feature extraction mainly consists of a Normalized Audio Spectrum Envelope (NASE), a basis decomposition algorithm and a spectrum basis projection. Prior to the feature extraction the noise reduction alg...

متن کامل

Noise Diagnostics of Scooter Faults by Using MPEG-7 Audio Features and Intelligent Classification Techniques

A scooter fault diagnostic system that makes use of feature extraction and intelligent classification algorithms is presented in this paper. Sound features based on MPEG (Moving Picture Experts Group)-7 coding standard and several other features in the time and frequency domains are extracted from noise data and preprocessed prior to classification. Classification algorithms including the Neare...

متن کامل

A multidomain approach for automatic home environmental sound classification

This article presents a multidomain approach which addresses the problem of automatic home environmental sound recognition. The proposed system will be part of a human activity monitoring system which will be based on heterogeneous sensors. This work concerns the audio classification component and its primary role is to detect anomalous sound events. We compare the discriminative capabilities o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004