Resource-limited information retrieval in Web-based environments

نویسندگان

  • Daan Velthausz
  • Henk Eertink
چکیده

Commercial usage of the Internet increases. It is therefore likely that information providers will increasingly charge end-users for the information they provide on such networks. Hence, the need will also arise for information retrieval procedures that takes such costs into account. However, currently little support exists for such cost-effective information retrieval in a networked environment with multiple information providers. In this paper we present a framework for resource-limited information retrieval that enables a user to search for relevant information given time and cost constraints, e.g. dealing with information needs like retrieve the five most relevant images containing white monkeys as fast as possible, but within 1 minute, for less than $4,-We focus in our work on the 'retrieval strategy'. In our terminology, this strategy is a functional entity that is responsible for handling the query, that makes decisions on which information providers are queried, and which objects (if any) are being retrieved. In this paper we present such a strategy for resource-limited information retrieval (see also Velthausz, Eertink, Verhoosel, & Schot 1997). We assume that the objects are modelled using the ADMIRE information model (Velthausz, Bal & Eertink, 1996). This model facilitates aggregation and propagation of information that characterises reachable information objects. The composite relationships in the object hierarchy enable a bottom-up propagation and aggregation of the lower layered object characterisations. This information can subsequently be used to estimate the relevance of unexplored information objects. This use of summarised information to describe particular aspects of the lower layered nodes in a hierarchy, has also been reported in (Garcia-Molina, Gravano & Shivakumr 1996) for the content-characterisation of (hierarchical) databases containing textual documents. We have adapted some of their ideas for our prototype for web-based information. In the prototype environment, we assume that the information provider provides the characterisation of information (either automatically from text files, or (currently) by hand for multimedia information). The retrieval algorithm that is currently being exploited in our prototype is the well-known vector based keyword text-retrieval algorithm, with our own adaptations for time and cost constraints. These adaptations are based on Russel and Wefald's metareasoning decision theory (Russell & Wefald, 1991). In this paper, we first explain the context of our work. Subsequently, we explain how our strategy in principle works. Then, we show how the strategy can be applied on an ADMIRE-based information model that is generated from a WWW-site.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing the Internal Structure of the Ellis Information Retrieval Model in Order to Present the Persian Norm of Web Retrieval Tools

Introduction: Study evaluated the internal structure of Ellis information seeking model in the student community with the aim of presenting the Persian norm. Methods: This is a descriptive-analytical study conducted by cross-sectional survey method in the second semester of the academic year 1399-1400. Population comprise of 280 graduate students at Ahvaz Jundishapur University of Medical Scien...

متن کامل

Google for the Linguist on a Budget

In this paper, we present GLB, yet another open source and free system to create and exploit linguistic corpora gathered from the web. A simple, robust web crawl algorithm, a multi-dimensional information retrieval tool, and a crude parallelization mechanism are proposed, especially for researchers working in resource-limited environments.

متن کامل

Adaptive Information Analysis in Higher Education Institutes

Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...

متن کامل

Adaptive Information Analysis in Higher Education Institutes

Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...

متن کامل

Chaotic Genetic Algorithm based on Explicit Memory with a new Strategy for Updating and Retrieval of Memory in Dynamic Environments

Many of the problems considered in optimization and learning assume that solutions exist in a dynamic. Hence, algorithms are required that dynamically adapt with the problem’s conditions and search new conditions. Mostly, utilization of information from the past allows to quickly adapting changes after. This is the idea underlining the use of memory in this field, what involves key design issue...

متن کامل

Comparison of Information Retrieval Capabilities in Library Software of Payam, Voyager and Aleph

The purpose of this study was comparing Information Retrieval Capabilities in Web-based Library Software of Payam, with Voyager and ALEPH. A checklist designed and included six main trait for evaluation and comparing 73 scales. Data collected by experts' observing of the software's OPAC. Data analyzed by the descriptive statistics methods. Findings shows the preferences in search capabilities i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997