Discriminating Non-Native English with 350 Words

نویسندگان

  • John C. Henderson
  • Guido Zarrella
  • Craig Pfeifer
  • John D. Burger
چکیده

This paper describes MITRE’s participation in the native language identification (NLI) task at BEA-8. Our best effort performed at an accuracy of 82.6% in the eleven-way NLI task, placing it in a statistical tie with the best performing systems. We describe the variety of machine learning approaches that we explored, including Winnow, language modeling, logistic regression and maximum-entropy models. Our primary features were word and character n-grams. We also describe several ensemble methods that we employed for combining these base systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Investigation of Assessment Literacy Among Native and Non-Native English Teachers

The current study aimed at examining the relationship between English language teachers’ assessment literacy and their teaching experience. In other words, it intended to inspect the relationship between native and non-native English language teachers’ assessment literacy and their teaching experience. To achieve such goals, 100 native and non-native English teachers from ESL and EFL contexts w...

متن کامل

A Comparative Analysis of Epistemic and Root Modality in Two selected English Books in the Field of Applied Linguistics Written by English Native and Iranian Non-native Writers

Academic discourse has always been the focus of many linguists, especially those who have been involved with English for Academic Purposes (EAP) and discourse analysis. Persuasion, as part of rhetorical structure of academic writing, is partly achieved by employing modality markers.  Adopting a descriptive design, the present study was carried out to compare the use of modality markers in terms...

متن کامل

The Role of Phonotactics in the Segmentation of Native and Non- Native Continuous Speech

Previous research has shown that listeners make use of their knowledge of phonotactic constraints to segment speech into individual words. The present study investigates the influence of phonotactics when segmenting a non-native language. German and English listeners detected embedded English words in nonsense sequences. German listeners also had knowledge of English, but English listeners had ...

متن کامل

The Use of Lexical Bundles in Native and Non-native Post-graduate Writing: The Case of Applied Linguistics MA Theses

Connor et al. (2008) mention “specifying textual requirements of genres” (p.12) as one of the reasons which have motivated researchers in the analysis of writing. Members of each genre should be able to produce and retrieve these textual requirements appropriately to be considered communicatively proficient. One of the textual requirements of genres is regularities of specific forms and content...

متن کامل

Native and Non-Native Teachers’ Changing Beliefs about Teaching English as an International Language

In view of the paucity of evidence on teachers’ conceptions of teaching English an International Language (EIL), the present study used panel discussions to investigate the beliefs of 10 native and 10 non-native English-speaking teachers about their roles in teaching English in the EIL contexts and the perceptions of EIL. The findings revealed that some aspects of teachers’ beliefs about their ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013