On Estimating the Cardinality of Aggregate Views
نویسندگان
چکیده
Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. While the warehouse is under development and data are not available yet, the approaches based on accessing data cannot be adopted. This paper proposes an approach to estimate the cardinality of views based on a-priori information derived from the application domain. We face the problem by first computing satisfactory bounds for the cardinality, then by capitalizing on these bounds to determine a good probabilistic estimate for it. Bounds are determined by using, besides the functional dependencies expressed by the multidimensional scheme, additional domain-derived information in the form of cardinality constraints which may bound either the cardinality of a given view or the ratio between the cardinalities of two given views. In particular, we propose a bounding strategy which achieves an effective trade-off between the tightness of the bounds produced and the computational complexity.
منابع مشابه
Bounding the cardinality of aggregate views through domain-derived constraints
Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. This paper proposes an approach based on cardinality constraints, derived a-priori from the application domain, which may bound either the cardinality of a view or the ratio between the cardinalities of two views. We face the problem by first computing satisfactory bounds for ...
متن کاملUsing Domain-Derived Constraints
Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. While the warehouse is under development and data are not available yet, the approaches based on accessing data cannot be adopted. This paper reports on the progress of an ongoing research aimed at devising a comprehensive approach to estimate the cardinality of views based on...
متن کاملTechniques for logical design and ef fi cient querying of data warehouses
Sommario Logical design of data warehouses (DW) encompasses the sequence of steps which, given a core work-load, determine the logical scheme for the DW. A key step in logical design is view materialization. In this paper we propose an original approach to materialization in which the workload is characterized by the presence of complex queries represented by Nested Generalized Projection/Selec...
متن کاملEstimating the Parameters for Linking Unstandardized References with the Matrix Comparator
This paper discusses recent research on methods for estimating configuration parameters for the Matrix Comparator used for linking unstandardized or heterogeneously standardized references. The matrix comparator computes the aggregate similarity between the tokens (words) in a pair of references. The two most critical parameters for the matrix comparator for obtaining the best linking results a...
متن کاملEstimating the second virial coefficients of some real gas mixtures and related thermodynamic views
Using the Gaussian 2003 software and MP2 /6 – 311+ G method for the C2H4 : O2, CO:Cl2 andCO2:CO2 pairs and MP2/6-311++G** method for the CO2:H2O pair and B3lyp/6-31G methodfor the O2:O2 pair the optimized interaction energies between two considered pair molecules ofstudied gases(C2H4:O2, CO:Cl2, CO2:H2O, O2:O2 and CO2:CO2 pairs) as a function of thedistances between the centers of two considere...
متن کامل