Detecting English-French Cognates Using Orthographic Edit Distance

نویسندگان

  • Qiongkai Xu
  • Albert Chen
  • Chang Li
چکیده

Identification of cognates is an important component of computer assisted second language learning systems. We present a simple rule-based system to recognize cognates in English text from the perspective of the French language. At the core of our system is a novel similarity measure, orthographic edit distance, which incorporates orthographic information into string edit distance to compute the similarity between pairs of words from different languages. As a result, our system achieved the best results in the ALTA 2015 shared task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Detection of Orthographics Cues for Cognate Recognition

Present-day machine translation technologies crucially depend on the size and quality of lexical resources. Much of recent research in the area has been concerned with methods to build bilingual dictionaries automatically. In this paper we propose a methodology for the automatic detection of cognates between two languages based solely on the orthography of words. From a set of known cognates, t...

متن کامل

Cognate Priming 1 Orthography Plays a Critical Role in Cognate Priming: Evidence From

A series of three experiments was carried out in order to better characterize the representations that support long-term cognate priming. In Experiment 1, robust priming was obtained between orthographically similar French/English cognates in bilingual speakers, and this priming was mediated, in part, by orthographic codes given that priming for these items was dramatically reduced following a ...

متن کامل

Orthography plays a critical role in cognate priming: evidence from French/English and Arabic/French cognates.

A series of three experiments was carried out in order to better characterize the representations that support long-term cognate priming. In Experiment 1, robust priming was obtained between orthographically similar French/English cognates in bilingual speakers, and this priming was mediated, in part, by orthographic codes, given that priming for these items was dramatically reduced following a...

متن کامل

Cross-Language Distributions of High Frequency and Phonetically Similar Cognates

The coinciding form and meaning similarity of cognates, e.g. 'flamme' (French), 'Flamme' (German), 'vlam' (Dutch), meaning 'flame' in English, facilitates learning of additional languages. The cross-language frequency and similarity distributions of cognates vary according to evolutionary change and language contact. We compare frequency and orthographic (O), phonetic (P), and semantic similari...

متن کامل

A tool for detecting French - English cognates and false friends

Résumé. Les congénères sont des mots qui ont au moins un sens en commun entre deux langues en plus d‘avoir une orthographie semblable. La reconnaissance de ce type de mots permet aux apprenants de langue seconde ou étrangère d‘enrichir plus rapidement leur vocabulaire et d‘améliorer leur compréhension écrite. Toutefois, les faux amis sont des paires de mots qui à l‘écrit ont des similarités, ma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015