Broad Phonetic Classification of ASR using Visual Based Features
نویسندگان
چکیده
منابع مشابه
Broad phonetic classification using discriminative Bayesian networks
We present an approach to broad phonetic classification, defined as mapping acoustic speech frames into broad (or clustered) phonetic categories. Our categories consist of silence, general voiced, general unvoiced, mixed sounds, voiced closure, and plosive release, and are sufficiently rich to allow accurate time-scaling of speech signals to improve their intelligibility in, e.g. voice-mail app...
متن کاملAcoustic Parameters versus Phonetic Features in Asr
By mapping acoustic parameters onto phonetic features, it is possible to explicitly address the linguistic information in the signal. For the experiments presented in this paper, we mapped cepstral parameters onto two sets of phonetic features, one based on the IPA chart and the other on SPE. As a result, the phoneme identification rates in a hidden Markov modelling framework increase from 15.6...
متن کاملDo phonetic features help to improve consonant identification in ASR?
The hidden Markov modelling experiments presented in this paper show that consonant identification results can be improved substantially if a neural network is used to extract linguistically relevant information from the acoustic signal before applying hidden Markov modelling. The neural network – or in this case a combination of two Kohonen networks – takes 12 mel-frequency cepstral coefficien...
متن کاملA speechreading aid based on phonetic ASR
Manual Cued Speech (MCS) is an effective method of communication by the deaf and hearing-impaired. We first describe our work on assessing the feasibility of automatic determination and presentation of cues without intervention by the speaker. The conclusions of this study are then applied to the design and implementation of a prototype automatic cueing system using HMM-based automatic speech r...
متن کاملStatistical dialect classification based on mean phonetic features
Our paper describes work done on a text-dependent method for automatic utterance classi cation and dialect model selection using mean cepstral and duration features on a per phoneme basis. From transcribed dialect data, we build a linear discriminant to separate the dialects in feature space. This method is potentially much faster than our previous selection algorithm. We have been able to achi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Egyptian Journal of Language Engineering
سال: 2020
ISSN: 2356-8216
DOI: 10.21608/ejle.2020.24358.1003