A Framework for Cross-Language Information Access: Application to English and Japanese

نویسندگان

  • Gareth J. F. Jones
  • Nigel Collier
  • Tetsuya Sakai
  • Kazuo Sumita
  • Hideki Hirakawa
چکیده

Internet search engines allow access to online information from all over the world. However, there is currently a general assumption that users are uent in the languages of all documents that they might search for. This has for historical reasons usually been a choice between English and the locally supported language. Given the rapidly growing size of the Internet, it is likely that future users will need to access information in languages in which they are not uent or have no knowledge of at all. This paper shows how information retrieval and machine translation can be combined in a cross-language information access framework to help overcome the language barrier. We present encouraging preliminary experimental results using English queries to retrieve documents from the standard Japanese language BMIRJ2 retrieval test collection. We outline the scope and purpose of cross-language information access and provide an example application to suggest that technology already exists to provide e ective and potentially useful applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Examining the difficulty pathways of can-do statements from a localized version of the CEFR

The Japanese adaptation of the Common European Framework of Reference (CEFR-J) is a tailored version of the Common European Framework of Reference (CEFR), designed to better meet the needs of Japanese learners of English. The CEFR-J, like the CEFR, uses illustrative  descriptors  known  as  can-do  statements,  that  describe  achievement  goals  for five  skills  (listening,  reading,  spoken ...

متن کامل

Building Linked Open Data of Life Science Dictionary

There is a growing need for efficient and integrated access to databases provided by diverse institutions. Using a linked data design pattern allows the diverse data on the Internet to be linked effectively and accessed efficiently by computers. In addition, providing a dictionary to translate words into another language in Resource Description Framework (RDF) is useful to cross a language barr...

متن کامل

BRIDJE over a language barrier: cross-language information access by integrating translation and retrieval

This paper describes two new features of the BRIDJE system for cross-language information access. The first feature is the partial disambiguation function of the Bi-directional Retriever, which can be used for search request translation in cross-language IR. Its advantage over a “black-box” machine translation approach is consistent across five test collections and across two language permutati...

متن کامل

Preferred Lexical Access Route in Persian Learners of English: Associative, Semantic or Both

Background: Words in the Mental Lexicon (ML) construct semantic field through associative and/ or semantic connections, with a pervasive native speaker preference for the former. Non-native preferences, however, demand further inquiry. Previous studies have revealed inconsistent Lexical Access (LA) patterns due to the limitations in the methodology and response categorization. Objectives: To f...

متن کامل

A Comparison of ESLE Web-based English Vocabulary Learning Application with Traditional Desktop English Vocabulary Learning Application: Exceptional learner parents’ point of view

The aim of this study was to compare the Exceptional Student Learning English (ESLE) web application and traditional application and the evaluation of the ESLE app mainly from the exceptional student parents' perspective. To this end, five exceptional student parents with their exceptional children were selected among 30 parents in Isfahan in Isfahan province. Open-ended questionnaires were sen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers and the Humanities

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2001