Universal Sketches for the Frequency Negative Moments and Other Decreasing Streaming Sums
نویسندگان
چکیده
Given a stream with frequencies fd, for d ∈ [n], we characterize the space necessary for approximating the frequency negative moments Fp = ∑ |fd|, where p < 0 and the sum is taken over all items d ∈ [n] with nonzero frequency, in terms of n, , andm = ∑ |fd|. To accomplish this, we actually prove a much more general result. Given any nonnegative and nonincreasing function g, we characterize the space necessary for any streaming algorithm that outputs a (1± )-approximation to ∑ g(|fd|), where again the sum is over items with nonzero frequency. The storage required is expressed in the form of the solution to a relatively simple nonlinear optimization problem, and the algorithm is universal for (1± )-approximations to any such sum where the applied function is nonnegative, nonincreasing, and has the same or smaller space complexity as g. This partially answers an open question of Nelson (IITK Workshop Kanpur, 2009). 1998 ACM Subject Classification F.1.2 Models of Computation, F.2.1 Numerical Algorithms and Problems
منابع مشابه
Tight Lower Bound for Linear Sketches of Moments
The problem of estimating frequency moments of a data stream has attracted a lot of attention since the onset of streaming algorithms [AMS99]. While the space complexity for approximately computing the p moment, for p ∈ (0, 2] has been settled [KNW10], for p > 2 the exact complexity remains open. For p > 2 the current best algorithm uses O(n1−2/p logn) words of space [AKO11,BO10], whereas the l...
متن کاملAsymptotic Behavior of Weighted Sums of Weakly Negative Dependent Random Variables
Let be a sequence of weakly negative dependent (denoted by, WND) random variables with common distribution function F and let be other sequence of positive random variables independent of and for some and for all . In this paper, we study the asymptotic behavior of the tail probabilities of the maximum, weighted sums, randomly weighted sums and randomly indexed weighted sums of heavy...
متن کاملGeneralizing the Layering Method of Indyk and Woodruff: Recursive Sketches for Frequency-Based Vectors on Streams
In their ground-breaking paper, Indyk and Woodruff (STOC 05) showed how to compute the k-th frequency moment Fk (for k > 2) in space O(poly-log(n,m) · n1− 2 k ), giving the first optimal result up to poly-logarithmic factors in n and m (here m is the length of the stream and n is the size of the domain.) The method of Indyk and Woodruff reduces the problem of Fk to the problem of computing heav...
متن کاملZero-One Laws for Sliding Windows and Universal Sketches
Given a stream of data, a typical approach in streaming algorithms is to design a sophisticated algorithm with small memory that computes a specific statistic over the streaming data. Usually, if one wants to compute a different statistic after the stream is gone, it is impossible. But what if we want to compute a different statistic after the fact? In this paper, we consider the following fasc...
متن کاملStrong Laws for Weighted Sums of Negative Dependent Random Variables
In this paper, we discuss strong laws for weighted sums of pairwise negatively dependent random variables. The results on i.i.d case of Soo Hak Sung [9] are generalized and extended.
متن کامل