phoneme recognition

نتایج جستجو برای: phoneme recognition

تعداد نتایج: 254307 فیلتر نتایج به سال:

An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition

2002

Mohamed Kamal Omar Ken Chen Mark Hasegawa-Johnson Yigal Brandman

This paper addresses the problem of finding a subset of the acoustic feature space that best represents the phoneme set used in a speech recognition system. A maximum mutual information approach is presented for selecting acoustic features to be combined together to represent the distinctions among the phonemes. The overall phoneme recognition accuracy is slightly increased for the same length ...

متن کامل

Tandem representations of spectral envelope and modulation frequency features for ASR

2009

Samuel Thomas Sriram Ganapathy Hynek Hermansky

We present a feature extraction technique for automatic speech recognition that uses Tandem representation of short-term spectral envelope and modulation frequency features. These features, derived from sub-band temporal envelopes of speech estimated using frequency domain linear prediction, are combined at the phoneme posterior level. Tandem representations derived from these phoneme posterior...

متن کامل

Recognition of spontaneous conversational speech using long short-term memory phoneme predictions

2010

Martin Wöllmer Florian Eyben Björn W. Schuller Gerhard Rigoll

We present a novel continuous speech recognition framework designed to unite the principles of triphone and Long ShortTerm Memory (LSTM) modeling. The LSTM principle allows a recurrent neural network to store and to retrieve information over long time periods, which was shown to be well-suited for the modeling of co-articulation effects in human speech. Our system uses a bidirectional LSTM netw...

متن کامل

Phoneme recognition using acoustic events

Journal: :CoRR 1994

Kai Hübener Julie Carson-Berndsen

This paper presents a new approach to phoneme recognition using nonsequential sub{phoneme units. These units are called acoustic events and are phonologically meaningful as well as recognizable from speech signals. Acoustic events form a phonologically incomplete representation as compared to distinctive features. This problem may partly be overcome by incorporating phonological constraints. Cu...

متن کامل

Speech Data Clustering Based on Phoneme Error Trend for Unsupervised Acoustic Model Adaptation

2012

Taichi Asami Satoshi Kobashikawa Hirokazu Masataki Osamu Yoshioka Satoshi Takahashi

Unsupervised cluster adaptive training of acoustic models offers promise in improving recognition accuracy, especially for speech recognition systems that store massive sets of speech samples from unknown people. How to classify the variety of acoustic characteristics is an important problem in adaptation sample clustering. We propose a novel speech sample clustering method that focuses on the ...

متن کامل

Hidden Markov Model-Based Modelling of Context-Dependent Phonemes Using Decision Tree-Based State Clustering

2002

H. A. Engelbrecht J. A. du Preez

This paper discusses hidden Markov model-based context-dependent phoneme modelling and their associated problems, particulary data insufficiency and unseen triphones. The implementation of decision tree-based state clustering, a technique suitable for solving these problems, is discussed. This technique was first proposed in 1994 by Young, Woodland and Odell [1]. A triphone-based phoneme recogn...

متن کامل

Discriminative training for continuous speech recognition

1995

Wolfgang Reichl Günther Ruske

Discriminative training techniques for Hidden Markov Models were recently proposed and successfully applied for automatic speech recognition In this paper a discussion of the Minimum Classi cation Error and the Maximum Mu tual Information objective is presented An extended reesti mation formula is used for the HMM parameter update for both objective functions The discriminative training me thod...

متن کامل

Reinforcement learning for phoneme recognition

1999

Akira Ichikawa Tomoyuki Shimizu Yasuo Horiuchi

In a spontaneous spoken dialogue understanding system, real-time response and robustness to the environment are required. To realize these requirements, we adopted a multi-agent system architecture. In this paper, we propose a reinforcement learning method for a phoneme recognizing agent as a sample agent, and adopt a continuous dynamic programming technique to deal with continuous phoneme reco...

متن کامل

Multiple Reduced Phoneme Sets for Second Language Speech Recognition

2015

Xiaoyun Wang

This paper describes a novel method to improve the performance of second language speech recognition when the mother tongue of users is known. Considering that second language speech usually includes less fluent pronunciation and more frequent pronunciation mistakes, I propose using a reduced phoneme set generated by a phonetic decision tree (PDT)-based top-down sequential splitting method inst...

متن کامل

Why not model spoken word recognition instead of phoneme monitoring?

Journal: :Behavioral and Brain Sciences 2000

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید