Efficient Summing over Sliding Windows

نویسندگان

  • Ran Ben-Basat
  • Gil Einziger
  • Roy Friedman
  • Yaron Kassner
چکیده

This paper considers the problem of maintaining statistic aggregates over the last W elements of a data stream. First, the problem of counting the number of 1’s in the last W bits of a binary stream is considered. A lower bound of Ω( 1 +logW ) memory bits forW -additive approximations is derived. This is followed by an algorithm whose memory consumption is O( 1 + logW ) bits, indicating that the algorithm is optimal and that the bound is tight. Next, the more general problem of maintaining a sum of the last W integers, each in the range of {0, 1, . . . , R}, is addressed. The paper shows that approximating the sum within an additive error of RW can also be done using Θ( 1 + logW ) bits for = Ω ( 1 W ) . For = o ( 1 W ) , we present a succinct algorithm which uses B · (1 + o(1)) bits, where B = Θ ( W log ( 1 W )) is the derived lower bound. We show that all lower bounds generalize to randomized algorithms as well. All algorithms process new elements and answer queries in O(1) worst-case time. 1998 ACM Subject Classification E.1 [Data Structures] Lists, stacks, and queues

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Network Measurements through Approximated Windows

Many networking applications require timely access to recent network measurements,which can be captured using a sliding window model. Maintaining such measurements isa challenging task due to the fast line speed and scarcity of fast memory in routers. Inthis work, we study the efficiency factor that can be gained by approximating the windowsize. That is, we allow the algorit...

متن کامل

Mining Recent Frequent Itemsets in Sliding Windows over Data Streams

This paper considers the problem of mining recent frequent itemsets over data streams. As the data grows without limit at a rapid rate, it is hard to track the new changes of frequent itemsets over data streams. We propose an efficient one-pass algorithm in sliding windows over data streams with an error bound guarantee. This algorithm does not need to refer to obsolete transactions when 316 C....

متن کامل

Framework for real-time clustering over sliding windows

Clustering queries over sliding windows require maintaining cluster memberships that change as windows slide. To address this, the Generic 2-phase Continuous Summarization framework (G2CS) utilizes a generation based window maintenance approach where windows are maintained over different time intervals. It provides algorithm independent and efficient sliding mechanisms for clustering queries wh...

متن کامل

Querying Sliding Windows Over Online Data Streams

A data stream is a real-time, continuous, ordered sequence of items generated by sources such as sensor networks, Internet traffic flow, credit card transaction logs, and on-line financial tickers. Processing continuous queries over data streams introduces a number of research problems, one of which concerns evaluating queries over sliding windows defined on the inputs. In this paper, we descri...

متن کامل

On Indexing Sliding Windows over Online Data Streams

We consider indexing sliding windows in main memory over on-line data streams. Our proposed data structures and query semantics are based on a division of the sliding window into sub-windows. By classifying windowed operators according to their method of execution, we motivate the need for two types of windowed indices: those which provide a list of attribute values and their counts for answeri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016