Handling Japanese Homophone Errors in Revision Support System for Japanese Texts; REVISE

نویسنده

  • Masahiro Oku
چکیده

Japanese texts frequently suffer from the homophone errors caused by the KANA-KANJI conversion needed to input the text. It is critical, therefore, for Japanese revision support systems to detect and to correct homophone errors. This paper proposes a method for detecting and correcting Japanese homophone errors in compound nouns. This method can not only detect Japanese homophone errors in compound nouns, but also can find the correct candidates for the detected errors automatically. Finding the correct candidates is one superiority of this method over existing methods. The basic idea of this method is that a compound noun component places some restrictions on the semantic categories of the adjoining words. The method accurately determines that a homophone is misused in a compound noun if one or both of its neighbors is not a member of the semantic set defined by the homophone. Also, the method successfully indicates the correct candidates for the detected homophone errors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detection of Japanese Homophone Errors by a Decision List Including a Written Word as a Default Evidence

In this paper, we propose a practical method to detect Japanese homophone errors in Japanese texts. It is very important to detect homophone errors in Japanese revision systems because Japanese texts suffer from homophone errors frequently. In order to detect homophone errors, we have only to solve the homophone problem. We can use the decision list to do it because the homophone problem is equ...

متن کامل

Is a FAN always FUN? Phonological and orthographic effects in bilingual visual word recognition.

A visual semantic categorization task in English was performed by native English speakers (Experiment 1) and late bilinguals whose first language was Japanese (Experiment 2) or Spanish (Experiment 3). In the critical conditions, the target word was a homophone of a correct category exemplar (e.g., A BODY OF WATER--SEE; cf. SEA) or a word that differed from the correct exemplar by a phonological...

متن کامل

Building a Corpus of Manually Revised Texts from Discourse Perspective

This paper presents building a corpus of manually revised texts which includes both before and after-revision information. In order to create such a corpus, we propose a procedure for revising a text from a discourse perspective, consisting of dividing a text to discourse units, organising and reordering groups of discourse units and finally modifying referring and connective expressions, each ...

متن کامل

The KEY to the ROCK: near-homophony in nonnative visual word recognition.

To test the hypothesis that native language (L1) phonology can affect the lexical representations of nonnative words, a visual semantic-relatedness decision task in English was given to native speakers and nonnative speakers whose L1 was Japanese or Arabic. In the critical conditions, the word pair contained a homophone or near-homophone of a semantically associated word, where a near-homophone...

متن کامل

Performance of Japanese Quails (Coturnix coturnix japonica) on Floor and Cage Rearing System in Sylhet, Bangladesh: Comparative Study

A total number of 66 day old Japanese quail chicks divided into 2 treatment groups (33 in each treatment) with 3 replications in each having 11 birds (male, 5 and female, 6) were reared on floor and in cage system for a period of 5 weeks to know the effect of rearing system on growth performance and carcass characteristics. At the age of 35 days, average body weight and feed intake were 102.15 ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994