An Efficient Approach to Vague Joins in the Vague Query System
نویسندگان
چکیده
This article presents problems arisen when processing complex vague joins in the Vague Query System (VQS) and introduces a new solution to efficiently solve these problems. Join operation is one of the most expensive ones in database management systems. In context of the VQS, vague joins are prohibitively expensive in terms of IO-cost and CPUcost because they must undergo intermediate processing steps with the sheer volume of multidimensional data in multiple feature spaces. The VQS has been introduced to deal with the problem of empty result sets in the conventional Database Management Systems (DBMSs). It extends the query facility of the conventional DBMSs with similarity search capabilities. Concretely, when available data in a conventional database do not match a user’s query precisely, the corresponding DBMS will only return an empty result set to the user. This limits the applicability of the conventional DBMSs to domains where only crisp answers are meaningful. In many other application domains, however, users also expect not only the crisp results returned but also some other results that are relevant or close to the query in a certain sense. Such applications frequently appear in the real-world domains as image/multimedia processing, CAD systems, GIS, tourist information systems, time-series databases, digital libraries, information retrieval (IR), e-commerce, etc. The VQS, however, is a Flexible Query Answering System (FQAS). It has been designed to work ”on top” of the conventional DBMSs in order to return to users tuples of the query relation/view that do not match the query criteria exactly. The main features of the VQS are to employ the concept of NCR-Tables (Numeric Coordination Representation Tables) and to introduce a new query language called the Vague Query Language (VQL) that is an extended version of the SQL. NCR-Tables store (multidimensional) semantic metadata of attributes of the query relation/view. The VQL has introduced a new operator ”IS” (”similar to”) to formulate similar (vague) queries in the system. Unfortunately, the operator ”IS” cannot be used in a join condition. An extension of the VQL had been carried out to realize vague joins in the VQS. Nevertheless, for the sake of reducing costs, users can only get the good matches, not the best ones and the new system gives no guarantee about the goodness of the results. This paper will propose a new approach to those vague joins. The new approach not only significantly reduces the costs (I/Oand CPU-cost), but also returns to users the best match (or approximate nearest neighbors with a certain tolerant error e) in the query relation/view. Experimental results will show performance of the proposed approach. 1 Johannes Kepler University of Linz Altenberger Strasse 69, A-4040, Linz AUSTRIA/Europe [email protected], [email protected], and [email protected]
منابع مشابه
Efficient Vague Joins Processing in the VQS
Joins are among the most expensive operations in database management systems (DBMSs). In the context of the VQS (Vague Query System), which is a flexible query answering system built on top of DBMSs to provide them with similarity search/retrieval capabilities, vague joins are prohibitively expensive in terms of both IO-cost and CPU-cost because they must undergo intermediate processing steps w...
متن کاملA modified branch and bound algorithm for a vague flow-shop scheduling problem
Uncertainty plays a significant role in modeling and optimization of real world systems. Among uncertain approaches, fuzziness describes impreciseness while for ambiguity another definition is required. Vagueness is a probabilistic model of uncertainty being helpful to include ambiguity into modeling different processes especially in industrial systems. In this paper, a vague set based on dista...
متن کاملVague Language and Interpersonal Communication: An Analysis of Adolescent Intercultural Conversation
This paper is concerned with the analysis of the spoken language of teenagers, taken from a newly developed specialised corpus the British and Taiwanese Teenage Intercultural Communication Corpus (BATTICC). More specifically, the study employs a discourse analytical approach to examine vague language in an intercultural context among a group of British and Taiwanese adolescents, paying particul...
متن کاملVASA: An algebra for vague spatial data in databases
Many geographical applications deal with objects in space that cannot be adequately described by determinate, crisp spatial concepts because of their intrinsically indeterminate and vague nature. Geographical information systems and spatial database systems are currently unable to cope with this kind of data. To support the efficient representation, querying, and manipulation of vague spatial d...
متن کاملΕ-isa: an Incremental Lower Bound Approach for Efficiently Finding Approximate Nearest Neighbor of Complex Vague Queries
In our context, a complex vague query means a multifeature nearest neighbor query. Answering such queries requires the system to search on some feature spaces individually and then combine the searching results to find the final answers. The feature spaces are commonly multidimensional spaces and may consist of a vast amount of data. Therefore searching costs, including IO-cost and CPU-cost, ar...
متن کامل