Towards Automatic Error Type Classification of Japanese Language Learners' Writings
نویسندگان
چکیده
Learner corpora are receiving special attention as an invaluable source of educational feedback and are expected to improve teaching materials and methodology. However, they include various types of incorrect sentences. Error type classification is an important task in learner corpora which enables clarifying for learners why a certain sentence is classified as incorrect in order to help learners not to repeat errors. To address this issue, we defined a set of error type criteria and conducted automatic classification of errors into error types in the sentences from the NAIST Goyo Corpus and achieved an accuracy of 77.6%. We also tried inter-corpus evaluation of our system on the Lang-8 corpus of learner Japanese and achieved an accuracy of 42.3%. To know the accuracy, we also investigated the classification method by human judgement and compared the difference in classification between the machine and the human.
منابع مشابه
Impact of Grouping Type in Descriptve Collaborative Writings on Iranian EFL Learners' Written Grammatical Accuracy
The current study was an attempt to investigate the impact of grouping type on the grammatical accuracy of Iranian EFL learners in collaborative writing. Through administering the Michigan Test of English Language Proficiency, 64 female university students available participated in this study and were assigned to two groups--heterogeneous and homogeneous. The treatment process lasted 12 weeks o...
متن کاملExploring EFL Learners' Beliefs toward Communicative Language Teaching: A Case Study of Iranian EFL Learners
Although Communicative Language Teaching (CLT) has been widely advocated by a considerable number of applied linguists and English language teachers, its implementation in English as a Foreign Language (EFL) contexts has encountered a number of difficulties. Reviewing the literature suggests that one of the reasons for unsuccessful implementation of CLT may be neglect of learners' beliefs in t...
متن کاملThe Effect of Learner Corpus Size in Grammatical Error Correction of ESL Writings
English as a Second Language (ESL) learners’ writings contain various grammatical errors. Previous research on automatic error correction for ESL learners’ grammatical errors deals with restricted types of learners’ errors. Some types of errors can be corrected by rules using heuristics, while others are difficult to correct without statistical models using native corpora and/or learner corpora...
متن کاملWriting assistants and automatic lexical error correction: word combinatorics
Genuine lexical writing assistants that attempt to detect lexical errors such as miscollocations are traditionally less common in Computer Assisted Language Learning than spell and grammar checkers. However, there is empirical evidence of the importance of capturing and correcting miscollocations in the writings of language learners, and therefore an increasing number of proposals deals with th...
متن کاملThe Overview of the SST Speech Corpus of Japanese Learner English and Evaluation Through the Experiment on Automatic Detection of Learners' Errors
This paper introduces an overview of the speech corpus of Japanese learner English compiled by National Institute of Information and Communications Technology by showing its data collection procedure and annotation schemes including error tagging. We have collected 1,200 interviews for three years. One of the most unique features of this corpus is that it contains rich information on learners’ ...
متن کامل