Grammatical error detection from English utterances spoken by Japanese
نویسندگان
چکیده
This paper describes methods to recognize English utterances by Japanese learners as accurately as possible and detects grammatical errors from the transcription of the utterances. This method is a building block for the voice-interactive Computer-Assisted Language Learning (CALL) system that enables a learner to make conversation practice with a computer. A difficult point for development of such a system is that the utterances made by the learners contain grammatical mistakes, which are not assumed to happen in an ordinary speech recognizer. To realize generation of accurate transcription including grammatical mistakes, we employed a language model based on an N-gram trained by generated texts. The text generation is based on grammatical error rules that reflect tendency of grammatical mistakes made by Japanese learners. The experimental results showed that the proposed method improved recognition accuracy compared with the conventional recognition and error detection method.
منابع مشابه
Automatic Error Detection in the Japanese Learners' English Spoken Data
This paper describes a method of detecting grammatical and lexical errors made by Japanese learners of English and other techniques that improve the accuracy of error detection with a limited amount of training data. In this paper, we demonstrate to what extent the proposed methods hold promise by conducting experiments using our learner corpus, which contains information on learners’ errors.
متن کاملClassification of lexical stress using spectral and prosodic features for computer-assisted language learning systems
We present a system for detection of lexical stress in English words spoken by English learners. This system was designed to be part of the EduSpeak R © computer-assisted language learning (CALL) software. The system uses both prosodic and spectral features to detect the level of stress (unstressed, primary or secondary) for each syllable in a word. Features are computed on the vowels and inclu...
متن کاملAutomatic Prediction of Intelligibility of Spoken Words in Japanese Accented English
This study examines automatic prediction of the words that will be unintelligible if they are spoken by Japanese speakers of English. In our previous study [1], 800 English utterances spoken by Japanese speakers, which contained 6,063 words, were presented to 173 American listeners and correct perception rate was obtained for each spoken word. By using the results, in this study, we define the ...
متن کاملAutomatic detection of the words that will become unintelligible through Japanese accented pronunciation of English
This study examines automatic detection of the words that will be unintelligible if they are spoken by Japanese speakers of English. In our previous study [1], 800 English utterances spoken by Japanese speakers, which contained 6,063 words, were presented to 173 American listeners and correct perception rate was obtained for each spoken word. By using the results, in this study, we define the w...
متن کاملData Driven Grammatical Error Detection in Transcripts of Children's Speech
We investigate grammatical error detection in spoken language, and present a data-driven method to train a dependency parser to automatically identify and label grammatical errors. This method is agnostic to the label set used, and the only manual annotations needed for training are grammatical error labels. We find that the proposed system is robust to disfluencies, so that a separate stage to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010