User Satisfaction Task: A Proposal for NTCIR-7
نویسنده
چکیده
Good test collections, coupled with good evaluation metrics, are very useful for evaluating Information Access systems efficiently. But useful to whom? The in vitro (or Cranfield) evaluation paradigm has been criticised, mainly because of the absence of the user. On the other hand, user-in-the-loop evaluations are expensive, unrepeatable and often inconclusive. In light of this, we propose a new task for NTCIR that aims to directly measure the correlation between user satisfaction and evaluation metric values. To this end, we plan to reuse NTCIR-5 and NTCIR-6 Japanese monolingual newspaper test collections from the crosslingual task. Our final goal is to design new evaluation metrics that accurately approximate user satisfaction scores.
منابع مشابه
A Simple Baseline Method for NTCIR-7 MuST T2N Task -Yokohama National University at NTCIR-7 MuST T2N-
We participated in the free task and the T2N task of NTCIR-7 MuST. In this paper, we will report our participation in the T2N task. The system we prepared was a very simple and straightforward one. It will serve as a baseline for the T2N task. It consists of the following four modules: i) Element expression extractor, ii) Element expression combiner, iii) Date information canonicalizer, and iv)...
متن کاملOverview of the NTCIR-10 1CLICK-2 Task
This is an overview of the NTCIR-10 1CLICK-2 task (the second One Click Access task). Given a search query, 1CLICK aims to satisfy the user with a single textual output instead of a ranked list of URLs. Systems are expected to present important pieces of information first and to minimize the amount of text the user has to read. We designed English and Japanese 1CLICK tasks, in which 10 research...
متن کاملExperiments in Finding Chinese and Japanese Answer Documents at NTCIR-7
We describe evaluation experiments conducted by submitting retrieval runs for the natural language Simplified Chinese, Traditional Chinese and Japanese questions of the Information Retrieval for Question Answering (IR4QA) Task of the Advanced Crosslingual Information Access (ACLIA) Task Cluster of the 7th NII Test Collection for IR Systems Workshop (NTCIR-7). In a sampling experiment, we found ...
متن کاملOverview of the VisEx task at NTCIR-9
Interactive Visual Exploration (VisEx) is a pilot task at NTCIR-9 for establishing an efficient and effective framework for objectively evaluating interactive and explorative information access environments. It aims to acquire more useful and richer evaluation data based on empirical user studies, by adopting a common framework for the environments and conducting sophisticated experiments. Four...
متن کاملContext-Aware Recommender Systems: A Review of the Structure Research
Recommender systems are a branch of retrieval systems and information matching, which through identifying the interests and requires of the user, help the users achieve the desired information or service through a massive selection of choices. In recent years, the recommender systems apply describing information in the terms of the user, such as location, time, and task, in order to produce re...
متن کامل