Leveraging Native Data to Correct Preposition Errors in Learners' Dutch
نویسندگان
چکیده
We address the task of automatically correcting preposition errors in learners’ Dutch by modelling preposition usage in native language. Specifically, we build two models exploiting a large corpus of Dutch. The first is a binary model for detecting whether a preposition should be used at all in a given position or not. The second is a multiclass model for selecting the appropriate preposition in case one should be used. The models are tested on native as well as learners data. For the latter we exploit a crowdsourcing strategy to elicit native judgements. On native test data the models perform very well, showing that we can model preposition usage appropriately. However, the evaluation on learners’ data shows that while detecting that a given preposition is wrong is doable reasonably well, detecting the absence of a preposition is a lot more difficult. Observing such results and the data we deal with, we envisage various ways of improving performance, and report them in the final section of this article.
منابع مشابه
Learning spatial prepositions by Iranian EFL learners
The aim of the present study is threefold. The first is that whether there is any difference betweendifferent proficiency level language learners 'use of spatial prepositions. The Second aim is toreveal that if the native language of the participants has any effect on applying the appropriateprepositions and also to find which spatial preposition is difficult to acquire. The present paperexamin...
متن کاملUsing the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors
This paper presents an algorithm for correcting language errors typical of second-language learners. We focus on preposition errors, which are very common among second-language learners but are not addressed well by current commercial grammar correctors and editing aids. The algorithm takes as input a sentence containing a preposition error (and possibly other errors as well), and outputs the c...
متن کاملUsing an Error-Annotated Learner Corpus to Develop an ESL/EFL Error Correction System
This paper presents research on building a model of grammatical error correction, for preposition errors in particular, in English text produced by language learners. Unlike most previous work which trains a statistical classifier exclusively on well-formed text written by native speakers, we train a classifier on a large-scale, error-tagged corpus of English essays written by EFL learners, rel...
متن کاملAcquisition of English Relative Clauses by Adult Persian Learners: Focus on Resumptive Pronouns
Tsimpli and Dimitrakopoulou (2007) observed that uninterpretable features are unavailable in second language (L2) acquisition after the critical period. In this paper, we verify this claim by providing evidence from Persian speaking learners of English as an L2 on the status of resumptive pronouns (RPs) as uniterpretable features. Unlike English which does not allow RPs, Persian shows various b...
متن کاملA Corpus-based Analysis of Collocational Errors in the Iranian EFL Learners' Oral Production
Collocations are one of the areas generally considered problematic for EFL learners. Iranian learners of English like other EFL learners face various problems in producing oral collocations. An analysis of learners' spoken interlanguage both indicates the scope of the problem and the necessity to spend more time and energy by learners on mastering collocations. The present study specifically f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016