Hierarchical Compact Cube for Range-Max Queries
نویسندگان
چکیده
A range-max query finds the maximum value over all selected cells of an on-line analytical processing (OLAP) data cube where the selection is specified by ranges of contiguous values for each dimension. One of the approaches to process such queries is to precompute a prefix cube (PC), which is a cube of the same dimensionality and size as the original data cube, but with some pre-computed results stored in each cell. In this paper, we propose a new cube representation called Hierarchical Compact Cube, which is an hierarchical structure that stores not only the maximum value of all the children sub-cubes, but also stores one of the locations of the maximum values among the children sub-cubes. The storage requirement is much less than the prefix cube methods. Furthermore, both of our analysis and experiment results show that the average query time using our method is bounded by a constant independent on the number of data in the data cube, N. For a fixed dimension, the average update cost of our new structure in the worst case is also relatively low. It is only O(log N).
منابع مشابه
An Efficient Processing of Range-MIN/MAX Queries over Data Cube
On-Line Analytical Processing (OLAP) has become a crucial element of decision support systems. Since historical, summarized and consolidated data is used in OLAP, the concept of data cube is often used to provide multidimensional views for such information. Among range-aggregates that are typical operations over the data cube, we in this paper focus on efficient processing of range-MAX and rang...
متن کاملTechniques for Speeding up Range-Max Queries in OLAP Data Cubes
A range-max query obtains the maximum over all selected cells of a data cube where the selection is speci ed by providing ranges of values for numeric dimensions. Our general approach to speeding up range-max queries is to precompute and store certain key information of the data cube. In [HAMS97], we gave a tree algorithm based on precomputed max over balanced hierarchical tree structures; a br...
متن کاملA Clustered Dwarf Structure to Speed Up Queries on Data Cubes
Dwarf is a highly compressed structure, which compresses the cube by eliminating the semantic redundancies while computing a data cube. Although it has high compression ratio, Dwarf is slower in querying and more difficult in updating due to its structure characteristics. We all know that the original intention of data cube is to speed up the query performance, so we propose two novel clusterin...
متن کاملVariable Sized Partitions for Range Query Algorithms
A range query applies an aggregation operation over all selected cells of an OLAP data cube where selection is specified by the range of contiguous values for each dimension. Many works have focused on efficiently computing range sum or range max queries. Most of these algorithms use a uniformly partitioning scheme for the data cube. In this paper, we improve on query costs of some of these exi...
متن کاملA Parallel and Distributed Method for Computing High Dimensional MOLAP
Data cube has been playing an essential role in fast OLAP(on-line analytical processing) in many multidimensional data warehouse. We often execute range queries on aggregate cube computed by pre-aggregate technique in MOLAP. For the cube with d dimensions, it can generate 2 cuboids. But in a high-dimensional data warehouse (such as the applications of bioinformatics and statistical analysis, et...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000