Using Mostly Native Data to Correct Errors in Learners' Writing: A Meta-Classifier Approach
نویسنده
چکیده
We present results from a range of experiments on article and preposition error correction for non-native speakers of English. We first compare a language model and errorspecific classifiers (all trained on large English corpora) with respect to their performance in error detection and correction. We then combine the language model and the classifiers in a meta-classification approach by combining evidence from the classifiers and the language model as input features to the metaclassifier. The meta-classifier in turn is trained on error-annotated learner data, optimizing the error detection and correction performance on this domain. The meta-classification approach results in substantial gains over the classifieronly and language-model-only scenario. Since the meta-classifier requires error-annotated data for training, we investigate how much training data is needed to improve results over the baseline of not using a meta-classifier. All evaluations are conducted on a large errorannotated corpus of learner English.
منابع مشابه
Using Mostly Native Data to Correct Errors in Learners' Writing
We present results from a range of experiments on article and preposition error correction for non-native speakers of English. We first compare a language model and errorspecific classifiers (all trained on large English corpora) with respect to their performance in error detection and correction. We then combine the language model and the classifiers in a meta-classification approach by combin...
متن کاملNative Language Interference in Writing: A case study of Thai EFL learners
AbstractThe interference of the native language in acquiring a foreign language is unavoidable. In an attempt to explore the phenomenon why this occurs, the study was conducted in English as a foreign language writing. The study also investigated how the native language interference occurred in the writing process. In fact, this qualitative study explored the reasons and the process of na...
متن کاملNative Language Interference in Writing: A case study of Thai EFL learners
AbstractThe interference of the native language in acquiring a foreign language is unavoidable. In an attempt to explore the phenomenon why this occurs, the study was conducted in English as a foreign language writing. The study also investigated how the native language interference occurred in the writing process. In fact, this qualitative study explored the reasons and the process of na...
متن کاملUsing Statistical Techniques and Web Search to Correct ESL Errors
In this paper we present a system for automatic correction of errors made by learners of English. The system has two novel aspects. First, machine-learned classifiers trained on large amounts of native data and a very large language model are combined to optimize the precision of suggested corrections. Second, the user can access real-life web examples of both their original formulation and the...
متن کاملTraining Paradigms for Correcting Errors in Grammar and Usage
This paper proposes a novel approach to the problem of training classifiers to detect and correct grammar and usage errors in text by selectively introducing mistakes into the training data. When training a classifier, we would like the distribution of examples seen in training to be as similar as possible to the one seen in testing. In error correction problems, such as correcting mistakes mad...
متن کامل