One-Pass Wavelet Synopses for Maximum-Error Metrics

نویسندگان

  • Panagiotis Karras
  • Nikos Mamoulis
چکیده

We study the problem of computing waveletbased synopses for massive data sets in static and streaming environments. A compact representation of a data set is obtained after a thresholding process is applied on the coefficients of its wavelet decomposition. Existing polynomial-time thresholding schemes that minimize maximum error metrics are disadvantaged by impracticable time and space complexities and are not applicable in a data stream context. This is a cardinal issue, as the problem at hand in its most practically interesting form involves the time-efficient approximation of huge amounts of data, potentially in a streaming environment. In this paper we fill this gap by developing efficient and practicable wavelet thresholding algorithms for maximum-error metrics, for both a static and a streaming case. Our algorithms achieve near-optimal accuracy and superior runtime performance, as our experiments show, under frugal space requirements in both contexts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Workload-Based Wavelet Synopses

This paper introduces workload-based wavelet synopses, which exploit query workload information to significantly boost accuracy in approximate query processing. We show that wavelet synopses can adapt effectively to workload information, and that they have significant advantages over previous approaches. An important aspect of our approach is optimizing synopses constructions toward error metri...

متن کامل

A Fast Approximation Scheme for Probabilistic Wavelet Synopses

Several studies have demonstrated the effectiveness of Haar wavelets in reducing large amounts of data down to compact wavelet synopses that can be used to obtain fast, accurate approximate query answers. While Haar wavelets were originally designed for minimizing the overall root-mean-squared (i.e., L2-norm) error in the data approximation, the recently-proposed idea of probabilistic wavelet s...

متن کامل

Building Data Synopses Within a Known Maximum Error Bound

The constructions of Haar wavelet synopses for large data sets have proven to be useful tools for data approximation. Recently, research on constructing wavelet synopses with a guaranteed maximum error has gained attention. Two relevant problems have been proposed: One is the size bounded problem that requires the construction of a synopsis of a given size to minimize the maximum error. Another...

متن کامل

Improved implementation and experimental evaluation of the max-error optimized wavelet synopses

This paper provides an improved implementation of an algorithm for building wavelet synopses for max-error metrics, recently introduced by Garofalakis and Kumar (GK) [4]. Given a storage space of size M , the GK algorithm finds a wavelet synopsis of size M , which minimizes the max (absolute or relative) error, measured over the data values, with respect to any other wavelet synopsis of size M ...

متن کامل

Efficient Haar+ Synopsis Construction for the Maximum Absolute Error Measure

Several wavelet synopsis construction algorithms were previously proposed based on dynamic programming for unrestricted Haar wavelet synopses as well as Haar synopses. However, they find an optimal synopsis for every incoming value in each node of a coe cient tree, even if di↵erent incoming values share an identical optimal synopsis. To alleviate the limitation, we present novel algorithms, whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005