A Naive Bayes classifier for automatic correction of preposition and determiner errors in ESL text
نویسندگان
چکیده
This is the report for the CNGL ILT3 team entry to the HOO shared task. A Naive-Bayes-based classifier was used in the task which involved error detection and correction in ESL exam scripts. Our system placed 11th out of 14 teams for the detection and recognition tasks and 11th out of 13 teams for the correction task on the based on f-score for both preposition and determiner errors.
منابع مشابه
A Classifier-Based Approach to Preposition and Determiner Error Correction in L2 English
In this paper, we present an approach to the automatic identification and correction of preposition and determiner errors in nonnative (L2) English writing. We show that models of use for these parts of speech can be learned with an accuracy of 70.06% and 92.15% respectively on L1 text, and present first results in an error detection task for L2 writing.
متن کاملA New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier
With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...
متن کاملUsing Error-Annotated ESL Data to Develop an ESL Error Correction System
This paper presents research on building a model of grammatical error correction, for preposition errors in particular, in English text produced by language learners. Unlike most previous work which trains a statistical classifier exclusively on well-formed text written by native speakers, we train a classifier on a large-scale, error-tagged corpus of English essays, relying on contextual and g...
متن کاملMemory-based Grammatical Error Correction
We describe the ’TILB’ team entry for the CONLL-2013 Shared Task. Our system consists of five memory-based classifiers that generate correction suggestions for center positions in small text windows of two words to the left and to the right. Trained on the Google Web 1T corpus, the first two classifiers determine the presence of a determiner or a preposition between all words in a text. The sec...
متن کاملMemory-based text correction for preposition and determiner errors
We describe the Valkuil.net team entry for the HOO 2012 Shared Task. Our systems consists of four memory-based classifiers that generate correction suggestions for middle positions in small text windows of two words to the left and to the right. Trained on the Google 1TB 5gram corpus, the first two classifiers determine the presence of a determiner or a preposition between all words in a text i...
متن کامل