Modelling Accents for Automatic Speech Recognition
نویسندگان
چکیده
Accent is cited as an issue for speech recognition systems. If they are to be widely deployed, Automatic Speech Recognition (ASR) systems must deliver consistently high performance across user populations. Hence the development of accentrobust ASR is of significant importance. This research investigates techniques for compensating for the effects of accents on performance of Hidden Markov Model (HMM) based ASR systems. Recently, HMM systems based on Deep Neural Networks (DNNs) have achieved superior performance to more traditional systems based on Gaussian Mixture Models (GMMs), due to the discriminative nature of DNNs. Our research confirms, this by showing that a DNN system outperforms the GMM system even after an accent-dependent acoustic model was selected using Accent Identification (AID), followed by speaker adaptation. The average performance of the DNN system over all accent groups is maximized when either accent diversity is highest, or data from “difficult” accent-groups is included in the training set.
منابع مشابه
Speech Recognition of South African English Accents
Declaration By submitting this thesis electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the sole author thereof (save to the extent explicitly otherwise stated), that reproduction and publication thereof by Stellenbosch University will not infringe any third party rights and that I have not previously in its entirety or in part submitt...
متن کاملAcoustic modelling of English-accented and Afrikaans-accented South African English
In this paper we investigate whether it is possible to combine speech data from two South African accents of English in order to improve speech recognition in any one accent. Our investigation is based on Afrikaans-accented English and South African English speech data. We compare three acoustic modelling approaches: separate accent-specific models, accentindependent models obtained by straight...
متن کاملTowards a Localised German Automatic Speech Recognition
Spoken languages are often rich in regional accents and dialects. These local variations often pose challenges to automatic speech recognition. In this study, we analyse the influence of German regional accents on the performance of a large vocabulary continuous speech recogniser trained on standard German data. The experiments show a large variation in the error rate over different regions. We...
متن کاملAccent Classification among Punjabi , Urdu , Pashto , Saraiki and Sindhi Accents of Urdu Language
Automatic Speech Recognition (ASR) is a key component in Human Computer Interaction (HCI) applications. Stability of ASR systems largely depends on accent, gender, age of speakers, background noise and channel variations. In this paper, a study has been conducted to classify five different accents of Urdu language spoken in Pakistan i.e. Punjabi, Urdu, Pashto, Saraiki and Sindhi. Speech data ha...
متن کاملSignal-based accent and phrase marking using the fujisaki model
Automatic prosodic marking is very important in speech signal processing, since its results are required in many subsections, e.g. speech synthesis and speech recognition. The most important prosodic features on the linguistic level are the marking of accents and phrases. In this paper, we develop an automatic algorithm for marking accents and phrases which analyzes the F0 contour by using the ...
متن کامل