Grammatical error detection from English utterances spoken by Japanese

نویسندگان

Takuya Anzai

Seongjun Hahm

Akinori Ito

Masashi Ito

Shozo Makino

چکیده

This paper describes methods to recognize English utterances by Japanese learners as accurately as possible and detects grammatical errors from the transcription of the utterances. This method is a building block for the voice-interactive Computer-Assisted Language Learning (CALL) system that enables a learner to make conversation practice with a computer. A difficult point for development of such a system is that the utterances made by the learners contain grammatical mistakes, which are not assumed to happen in an ordinary speech recognizer. To realize generation of accurate transcription including grammatical mistakes, we employed a language model based on an N-gram trained by generated texts. The text generation is based on grammatical error rules that reflect tendency of grammatical mistakes made by Japanese learners. The experimental results showed that the proposed method improved recognition accuracy compared with the conventional recognition and error detection method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Error Detection in the Japanese Learners' English Spoken Data

This paper describes a method of detecting grammatical and lexical errors made by Japanese learners of English and other techniques that improve the accuracy of error detection with a limited amount of training data. In this paper, we demonstrate to what extent the proposed methods hold promise by conducting experiments using our learner corpus, which contains information on learners’ errors.

متن کامل

Classification of lexical stress using spectral and prosodic features for computer-assisted language learning systems

We present a system for detection of lexical stress in English words spoken by English learners. This system was designed to be part of the EduSpeak R © computer-assisted language learning (CALL) software. The system uses both prosodic and spectral features to detect the level of stress (unstressed, primary or secondary) for each syllable in a word. Features are computed on the vowels and inclu...

متن کامل

Automatic Prediction of Intelligibility of Spoken Words in Japanese Accented English

This study examines automatic prediction of the words that will be unintelligible if they are spoken by Japanese speakers of English. In our previous study [1], 800 English utterances spoken by Japanese speakers, which contained 6,063 words, were presented to 173 American listeners and correct perception rate was obtained for each spoken word. By using the results, in this study, we define the ...

متن کامل

Automatic detection of the words that will become unintelligible through Japanese accented pronunciation of English

This study examines automatic detection of the words that will be unintelligible if they are spoken by Japanese speakers of English. In our previous study [1], 800 English utterances spoken by Japanese speakers, which contained 6,063 words, were presented to 173 American listeners and correct perception rate was obtained for each spoken word. By using the results, in this study, we define the w...

متن کامل

Data Driven Grammatical Error Detection in Transcripts of Children's Speech

We investigate grammatical error detection in spoken language, and present a data-driven method to train a dependency parser to automatically identify and label grammatical errors. This method is agnostic to the label set used, and the only manual annotations needed for training are grammatical error labels. We find that the proposed system is robust to disfluencies, so that a separate stage to...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Grammatical error detection from English utterances spoken by Japanese

نویسندگان

چکیده

منابع مشابه

Automatic Error Detection in the Japanese Learners' English Spoken Data

Classification of lexical stress using spectral and prosodic features for computer-assisted language learning systems

Automatic Prediction of Intelligibility of Spoken Words in Japanese Accented English

Automatic detection of the words that will become unintelligible through Japanese accented pronunciation of English

Data Driven Grammatical Error Detection in Transcripts of Children's Speech

عنوان ژورنال:

اشتراک گذاری