Discriminative Keyword Selection Using Support Vector Machines

نویسندگان

  • William M. Campbell
  • Fred Richardson
چکیده

Many tasks in speech processing involve classification of long term characteristics of a speech segment such as language, speaker, dialect, or topic. A natural technique for determining these characteristics is to first convert the input speech into a sequence of tokens such as words, phones, etc. From these tokens, we can then look for distinctive phrases, keywords, that characterize the speech. In many applications, a set of distinctive keywords may not be known a priori. In this case, an automatic method of building up keywords from short context units such as phones is desirable. We propose a method for construction of keywords based upon Support Vector Machines. We cast the problem of keyword selection as a feature selection problem for n-grams of phones. We propose an alternating filter-wrapper method that builds successively longer keywords. Application of this method to language recognition and topic recognition tasks shows that the technique produces interesting and significant qualitative and quantitative results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Recognition using keyword Hidden Markov Models and Support vector machines

New approaches to speaker and background model training have given rise to many recent developments in speaker recognition. Recently, various text-dependent approaches have surfaced, including a keyword Hidden Markov Models (HMM) approach [1]. This approach also deviates from the traditional bag-offrames approach by taking into account relationships in time among acoustic features for different...

متن کامل

A Comparative Study on Feature Selection for E

This paper explores the application of feature selection by the Correlation based Feature Selection (CFS) algorithm on the problem of classification of E.coli promoters using neural networks, Support Vector Machines (SVM) and Extreme Learning Machines (ELM). It was found that even though in general the classification accuracy can be reduced by a statistically significant amount, in real terms t...

متن کامل

Discriminative Training and Support V Language Call Ro

In natural language call routing, callers are routed to desired departments based on natural spoken responses to an open-ended “How may I direct your call?” prompt. Natural language call classification can be performed using support vector machines (SVMs) or the popular vector-based model used in information retrieval. We recently demonstrate how discriminative training is powerful to improve a...

متن کامل

An Efficient Method for Variables Selection Using SVM-Based Criteria

The problem of feature selection for Support Vector Machines (SVMs) classification is investigated in the linear two classes case. We suggest a new method of feature selection based on ranking scores derived from SVMs. We analyze the retraining effects on the ranking rules based on these scores. Our features selection algorithm consists in a forward selection strategy according to the decreasin...

متن کامل

Discriminative Clustering Based Feature Selection and Nonparametric Bayes Error Minimization and Support Vector Machines (SVMs)

In recent years feature selection is an eminent task in knowledge discovery database (KDD) that selects appropriate features from massive amount of high-dimensional data. In an attempt to establish theoretical justification for feature selection algorithms, this work presents a theoretical optimal criterion, specifically, the discriminative optimal criterion (DoC) for feature selection. Computa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007