Toshiba BRIDJE at NTCIR-6 CLIR

نویسندگان

  • Tetsuya Sakai
  • Makoto Koyama
  • Tatsuya Izuha
  • Akira Kumano
  • Toshihiko Manabe
  • Tomoharu Kokubu
چکیده

At NTCIR-6 CLIR, Toshiba participated in the Monolingual and Bilingual IR tasks covering three topic languages (Japanese, English and Chinese) and one document language (Japanese). For Stage 1 (which is the usual ad hoc task using the new NTCIR6 topics), we submitted two DESCRIPTION runs and two TITLE runs for each topic language. Our first search strategy is Selective Sampling with Memory Resetting, and our second one is the Head/Lead method, which uses the Selective Sampling run as one of the components for data fusion. According to the Relaxed and Rigid Mean Average Precision statistics released by the organisers, we are the top performer in all six subtasks. For Stage 2 (which reused the NTCIR-3, 4 and 5 test collections), we repeated our two Stage 1 strategies in order to enable analysis across all four test collections. Moreover, we conducted some unofficial true relevance feedback experiments by exploiting the graded relevance data provided in the test collections. Our automatic run results show that the Head/Lead method slightly but consistently improves performance, while our unofficial “interactive” run results suggest that graded-relevance metrics favour graded relevance feedback while Average Precision favours binary relevance feedback. In addition, our significance tests suggest that the NTCIR-6 Japanese test collection is “harder” than previous collections.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Toshiba BRIDJE at NTCIR-4 CLIR: Monolingual/Bilingual IR and Flexible Feedback

Toshiba participated in the Monolingual/Bilingual tasks at NTCIR-4 CLIR using our CLIR system called BRIDJE. We submitted 24 runs covering three topic languages (Japanese, English and Chinese) and two document languages (Japanese and English) and achieved the highest performances in the E-J-D, CJ-D, C-J-T, E-E-D, J-E-D, J-E-T subtasks. We had 12 more runs which we were not allowed to submit due...

متن کامل

Toshiba BRIDJE at NTCIR-5 CLIR: Evaluation using Geometric Means

Toshiba participated in the Monolingual and Bilingual IR tasks at NTCIR-5 CLIR using the BRIDJE system. We submitted 24 runs covering three topic languages (Japanese, English and Chinese) and two document languages (Japanese and English), and achieved the highest performances in the E-J-T, EJ-D, C-J-T, C-J-D, J-E-T and J-E-D subtasks. This paper (re-)examines Partial Disambiguation and the Pivo...

متن کامل

Toshiba BRIDJE at NTCIR-5 CLIR

Toshiba participated in the Monolingual and Bilingual IR tasks at NTCIR-5 CLIR using the BRIDJE system. We submitted 24 runs covering three topic languages (Japanese, English and Chinese) and two document languages (Japanese and English), and achieved the highest performances in the E-J-T, EJ-D, C-J-T, C-J-D, J-E-T and J-E-D subtasks. This paper (re-)examines Partial Disambiguation and the Pivo...

متن کامل

Toshiba BRIDJE at NTCIR-6 CLIR: The Head/Lead Method and Graded Relevance Feedback

At NTCIR-6 CLIR, Toshiba participated in the Monolingual and Bilingual IR tasks covering three topic languages (Japanese, English and Chinese) and one document language (Japanese). For Stage 1 (which is the usual ad hoc task using the new NTCIR6 topics), we submitted two DESCRIPTION runs and two TITLE runs for each topic language. Our first search strategy is Selective Sampling with Memory Rese...

متن کامل

Toshiba KIDS at NTCIR-3: Japanese and English-Japanese IR

Toshiba participated in the Japanese monolingual track and the English-Japanese cross-language track at NTCIR-3, and achieved the highest retrieval performances among the DESCRIPTION runs. Our presubmission experiments using the KIDS system focused primarily on effective Japanese monolingual retrieval, and addressed several problems including term selection enhancement and ranked output combina...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007