A Framework for Re-optimizing Repetitive Queries
نویسندگان
چکیده
In this paper, we develop a comprehensive framework forre-optimization of a large and useful set of queries, called repetitive queries. Repetitive queries refer tothosequeries that are likely to be used repeatedly or frequently in the future. Theydeserve more optimization efforts than ordinary ad hoc queries. In this research, weidentify statistics, called sufficient statistics,that are sufficient to compute the exact frequency distributions of the intermediate results of all plans ofa query. We present two innovative techniques to conductreoptimization, an eager and a lazy re-optimization. The eager approach gathersall the sufficient statistics for a query at once and generates the best plan. The lazy reoptimization gathers only the statistics that are needed to correctlarge estimation errors found in the plan and generates a revised plan. We further adapt the two basictechniques to constantly changing environmentsby continuously monitoring and revising the plans, called adaptive re-optimization. The adaptive re-optimization is devised to detectand remedy potential sub-optimality in the plans in a timely manner for the entire lifetime of the query. Our work realizes the promise made by the query optimizers, namely, executing queries in the optimal fashions, at least for the repetitive queries.
منابع مشابه
A Framework for Re-Optimization of Repetitive Queries
Optimizing executions of queries is the ultimate goal of the query optimizer. Unfortunately, due to the complexities of queries, accuracy of statistics, validities of assumptions, etc., query optimizers often cannot find the best execution plans in their search spaces, conveniently called the optimal plans, for the queries. In this paper, we develop a comprehensive framework for re-optimization...
متن کاملRelational Databases Query Optimization using Hybrid Evolutionary Algorithm
Optimizing the database queries is one of hard research problems. Exhaustive search techniques like dynamic programming is suitable for queries with a few relations, but by increasing the number of relations in query, much use of memory and processing is needed, and the use of these methods is not suitable, so we have to use random and evolutionary methods. The use of evolutionary methods, beca...
متن کاملSample Trace: Deriving Fast Approximation for Repetitive Queries
Repetitive queries refer to those queries that are likely to be executed repeatedly in the future. Queries such as those used to generate periodic reports, perform routine summarization and data analysis belong to this category. Repetitive queries can constitute a large portion of the daily activities of a database system, and thus deserve extra optimization efforts. In this paper, we propose t...
متن کاملParallel computation framework for optimizing trailer routes in bulk transportation
We consider a rich tanker trailer routing problem with stochastic transit times for chemicals and liquid bulk orders. A typical route of the tanker trailer comprises of sourcing a cleaned and prepped trailer from a pre-wash location, pickup and delivery of chemical orders, cleaning the tanker trailer at a post-wash location after order delivery and prepping for the next order. Unlike traditiona...
متن کاملDistributed Query Scheduling Service: An Architecture and Its Implementation
We present the systematic design and development of a distributed query scheduling service (DQS) in the context of DIOM, a distributed and interoperable query mediation system [26]. DQS consists of an extensible architecture for distributed query processing, a three-phase optimization algorithm for generating e cient query execution schedules, and a prototype implementation. Functionally, two i...
متن کامل