Inexact Retrieval of Multiway Spatial Joins
نویسندگان
چکیده
Inexact retrieval of multiway spatial joins refers to the cases where (i) there is limited time for query processing and the goal is to retrieve the best possible solutions within this limit (ii) there is unlimited time and the goal is to retrieve a single exact solution, if such a solution exists, or the best approximate one otherwise. The first case is motivated by the high cost of join processing in realtime systems involving large amounts of multimedia data, while the second one is motivated by applications that require “negative” examples. This papers proposes several algorithms for query processing under theses conditions. For the limited-time case we develop some non-deterministic search heuristics that can quickly retrieve good solutions. However, these heuristics are not guaranteed to find the best solutions, even without a time limit. Therefore, for the unlimited-time case we propose systematic search algorithms tailored specifically for the efficient retrieval of a single solution. Both types of algorithms are integrated with R-trees in order to prune the search space. Our proposal is evaluated with extensive experimental comparison. Contact Author: Dimitris Papadias Tel: ++852-23586971 http://www.cs.ust.hk/~dimitris/ Fax: ++852-23581477 E-mail: [email protected] The Hong Kong University of Science & Technology Technical Report Series Department of Computer Science Inexact Retrieval of Multiway Spatial Joins Dimitris Papadias, Dinos Arkoumanis 2 Technical Report HKUST-CS00-05 December, 2000
منابع مشابه
Approximate Processing of Multiway Spatial Joins in Very Large Databases
Existing work on multiway spatial joins focuses on the retrieval of all exact solutions with no time limit for query processing. Depending on the query and data properties, however, exhaustive processing of multiway spatial joins can be prohibitively expensive due to the exponential nature of the problem. Furthermore, if there do not exist any exact solutions, the result will be empty even thou...
متن کاملSearch algorithms for multiway spatial joins
This papers deals with multiway spatial joins when (i) there is limited time for query processing and the goal is to retrieve the best possible solutions within this limit (ii) there is unlimited time and the goal is to retrieve a single exact solution, if such a solution exists, or the best approximate one otherwise. The first case is motivated by the high cost of join processing in real-time ...
متن کاملSharesSkew: An Algorithm to Handle Skew for Joins in MapReduce
In this paper, we investigate the problem of computing a multiway join in one round of MapReduce when the data may be skewed. We optimize on communication cost, i.e., the amount of data that is transferred from the mappers to the reducers. We identify join attributes values that appear very frequently, Heavy Hitters (HH). We distribute HH valued records to reducers avoiding skew by using an ada...
متن کاملProcessing Sliding Window Multi-Joins in Continuous Queries over Data Streams
We study sliding window multi-join processing in continuous queries over data streams. Several algorithms are reported for performing continuous, incremental joins, under the assumption that all the sliding windows fit in main memory. The algorithms include multiway incremental nested loop joins (NLJs) and multi-way incremental hash joins. We also propose join ordering heuristics to minimize th...
متن کاملAn image retrieval model based on fuzzy triples
This paper proposes an image retrieval model for in dexing and retrieving images with fuzzy triples. Th e well known triple framework based on 2 dimensional (2D) strings is a novel index mechanism in that it allows us to simply specify the spatial structure o f images, guaranteeing fast retrieval time. However , there are two significant drawbacks in this framewo rk; one is that it can not acco...
متن کامل