Hierarchical Compact Cube for Range-Max Queries

نویسندگان

Sin Yeung Lee

Tok Wang Ling

Hua-Gang Li

چکیده

A range-max query finds the maximum value over all selected cells of an on-line analytical processing (OLAP) data cube where the selection is specified by ranges of contiguous values for each dimension. One of the approaches to process such queries is to precompute a prefix cube (PC), which is a cube of the same dimensionality and size as the original data cube, but with some pre-computed results stored in each cell. In this paper, we propose a new cube representation called Hierarchical Compact Cube, which is an hierarchical structure that stores not only the maximum value of all the children sub-cubes, but also stores one of the locations of the maximum values among the children sub-cubes. The storage requirement is much less than the prefix cube methods. Furthermore, both of our analysis and experiment results show that the average query time using our method is bounded by a constant independent on the number of data in the data cube, N. For a fixed dimension, the average update cost of our new structure in the worst case is also relatively low. It is only O(log N).

متن کامل

منابع مشابه

An Efficient Processing of Range-MIN/MAX Queries over Data Cube

On-Line Analytical Processing (OLAP) has become a crucial element of decision support systems. Since historical, summarized and consolidated data is used in OLAP, the concept of data cube is often used to provide multidimensional views for such information. Among range-aggregates that are typical operations over the data cube, we in this paper focus on efficient processing of range-MAX and rang...

متن کامل

Techniques for Speeding up Range-Max Queries in OLAP Data Cubes

A range-max query obtains the maximum over all selected cells of a data cube where the selection is speci ed by providing ranges of values for numeric dimensions. Our general approach to speeding up range-max queries is to precompute and store certain key information of the data cube. In [HAMS97], we gave a tree algorithm based on precomputed max over balanced hierarchical tree structures; a br...

متن کامل

A Clustered Dwarf Structure to Speed Up Queries on Data Cubes

Dwarf is a highly compressed structure, which compresses the cube by eliminating the semantic redundancies while computing a data cube. Although it has high compression ratio, Dwarf is slower in querying and more difficult in updating due to its structure characteristics. We all know that the original intention of data cube is to speed up the query performance, so we propose two novel clusterin...

متن کامل

Variable Sized Partitions for Range Query Algorithms

A range query applies an aggregation operation over all selected cells of an OLAP data cube where selection is specified by the range of contiguous values for each dimension. Many works have focused on efficiently computing range sum or range max queries. Most of these algorithms use a uniformly partitioning scheme for the data cube. In this paper, we improve on query costs of some of these exi...

متن کامل

A Parallel and Distributed Method for Computing High Dimensional MOLAP

Data cube has been playing an essential role in fast OLAP(on-line analytical processing) in many multidimensional data warehouse. We often execute range queries on aggregate cube computed by pre-aggregate technique in MOLAP. For the cube with d dimensions, it can generate 2 cuboids. But in a high-dimensional data warehouse (such as the applications of bioinformatics and statistical analysis, et...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Hierarchical Compact Cube for Range-Max Queries

نویسندگان

چکیده

منابع مشابه

An Efficient Processing of Range-MIN/MAX Queries over Data Cube

Techniques for Speeding up Range-Max Queries in OLAP Data Cubes

A Clustered Dwarf Structure to Speed Up Queries on Data Cubes

Variable Sized Partitions for Range Query Algorithms

A Parallel and Distributed Method for Computing High Dimensional MOLAP

عنوان ژورنال:

اشتراک گذاری