On Estimating the Cardinality of Aggregate Views

نویسندگان

  • Paolo Ciaccia
  • Matteo Golfarelli
  • Stefano Rizzi
چکیده

Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. While the warehouse is under development and data are not available yet, the approaches based on accessing data cannot be adopted. This paper proposes an approach to estimate the cardinality of views based on a-priori information derived from the application domain. We face the problem by first computing satisfactory bounds for the cardinality, then by capitalizing on these bounds to determine a good probabilistic estimate for it. Bounds are determined by using, besides the functional dependencies expressed by the multidimensional scheme, additional domain-derived information in the form of cardinality constraints which may bound either the cardinality of a given view or the ratio between the cardinalities of two given views. In particular, we propose a bounding strategy which achieves an effective trade-off between the tightness of the bounds produced and the computational complexity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bounding the cardinality of aggregate views through domain-derived constraints

Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. This paper proposes an approach based on cardinality constraints, derived a-priori from the application domain, which may bound either the cardinality of a view or the ratio between the cardinalities of two views. We face the problem by first computing satisfactory bounds for ...

متن کامل

Using Domain-Derived Constraints

Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. While the warehouse is under development and data are not available yet, the approaches based on accessing data cannot be adopted. This paper reports on the progress of an ongoing research aimed at devising a comprehensive approach to estimate the cardinality of views based on...

متن کامل

Techniques for logical design and ef fi cient querying of data warehouses

Sommario Logical design of data warehouses (DW) encompasses the sequence of steps which, given a core work-load, determine the logical scheme for the DW. A key step in logical design is view materialization. In this paper we propose an original approach to materialization in which the workload is characterized by the presence of complex queries represented by Nested Generalized Projection/Selec...

متن کامل

Estimating the Parameters for Linking Unstandardized References with the Matrix Comparator

This paper discusses recent research on methods for estimating configuration parameters for the Matrix Comparator used for linking unstandardized or heterogeneously standardized references. The matrix comparator computes the aggregate similarity between the tokens (words) in a pair of references. The two most critical parameters for the matrix comparator for obtaining the best linking results a...

متن کامل

Estimating the second virial coefficients of some real gas mixtures and related thermodynamic views

Using the Gaussian 2003 software and MP2 /6 – 311+ G method for the C2H4 : O2, CO:Cl2 andCO2:CO2 pairs and MP2/6-311++G** method for the CO2:H2O pair and B3lyp/6-31G methodfor the O2:O2 pair the optimized interaction energies between two considered pair molecules ofstudied gases(C2H4:O2, CO:Cl2, CO2:H2O, O2:O2 and CO2:CO2 pairs) as a function of thedistances between the centers of two considere...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001