Query cost estimation through remote system contention states analysis over the Internet
نویسندگان
چکیده
Query processing over the Internet involving autonomous data sources is a major task in data integration. It requires the estimated costs of possible query plans in order to select the best one with the minimum cost. In this context, the cost of a query is affected by three factors: network congestion, server contention state, and complexity of the query. In this paper, we study the effects of both the network congestion and server contention state on the cost of a query. We refer to these two factors together as system contention states. We present a new approach to determining the system contention states by clustering the costs of a sample query. We construct two cost formulas for each of the system contention states respectively using the multiple regression process. When a new query is submitted, its system contention state is estimated first using either the time slides method or the statistical method. The cost of the query is then calculated using the corresponding cost formulas. The estimated cost of the query is further adjusted to improve its accuracy. Our experiments show that our methods can produce quite accurate cost estimates of the submitted queries to remote data sources over the Internet.
منابع مشابه
Determining Remote System Contention States in Query Processing over the Internet
In the environment of data integration over the Internet, three major factors affect the cost of a query: network congestion situation, server contention states (workload), and data/query complexity. In this paper, we concentrate on system contention states. For a remote data source, we first determine the total number of contention states of the system through applying clustering techniques to...
متن کاملDeveloping Cost Models with Qualitative Variables for Dynamic Multidatabase Environments
A major challenge for global query optimization in a multidatabase system (MDBS) is lack of local cost information at the global level due to local autonomy. A number of methods to derive local cost models have been suggested recently. However, these methods are only suitable for a static multidatabase environment. In this paper, we propose a new multi-states query sampling method to develop lo...
متن کاملRun Time Optimizations of Join Queries for Distributed Databases over the Internet
A new probe based run time optimization technique is developed and demonstrated in the context of an Internet based distributed database environment More and more common are database systems which are distributed across servers communicating via the Internet where a query at a given site might require data from remote sites Optimizing the response time of such queries is a challenging task due ...
متن کاملAn Adaptive Probe-Based Technique to Optimize Join Queries in Distributed Internet Databases
An adaptive probe based optimization technique is developed and demonstrated in the context of an Internet based distributed database environment More and more common are database sys tems which are distributed across servers communicating via the Internet where a query at a given site might require data from remote sites Optimizing the response time of such queries is a chal lenging task due t...
متن کاملQoS-based Data Access and Placement for Federated Information Systems
A wide variety of applications require access to multiple heterogeneous, distributed data sources. By transparently integrating such diverse data sources, underlying differences in DBMSs, languages, and data models can be hidden and users can use a single data model and a single highlevel query language to access the unified data through a global schema. To address the needs of such federated i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Web Intelligence and Agent Systems
دوره 2 شماره
صفحات -
تاریخ انتشار 2004