Understanding Hierarchical Methods for Differentially Private Histograms
نویسندگان
چکیده
In recent years, many approaches to differentially privately publish histograms have been proposed. Several approaches rely on constructing tree structures in order to decrease the error when answer large range queries. In this paper, we examine the factors affecting the accuracy of hierarchical approaches by studying the mean squared error (MSE) when answering range queries. We start with one-dimensional histograms, and analyze how the MSE changes with different branching factors, after employing constrained inference, and with different methods to allocate the privacy budget among hierarchy levels. Our analysis and experimental results show that combining the choice of a good branching factor with constrained inference outperform the current state of the art. Finally, we extend our analysis to multidimensional histograms. We show that the benefits from employing hierarchical methods beyond a single dimension are significantly diminished, and when there are 3 or more dimensions, it is almost always better to use the Flat method instead of a hierarchy.
منابع مشابه
DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing
Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Releasing synthetic data that mimic original data with Differential privacy provides a promising way for privacy preserving data sharing and analytics while providing a rigorous privacy guarantee. However, to this date there is no open-source tools that allow users to genera...
متن کاملDifferentially Private Synthesization of Multi-Dimensional Data using Copula Functions
Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Most of the existing techniques that generate differentially private histograms or synthetic data only work well for single dimensional or low-dimensional histograms. They become problematic for high dimensional and large domain data due to increased perturbation error and c...
متن کاملDifferentially Private Projected Histograms: Construction and Use for Prediction
Privacy concerns are among the major barriers to efficient secondary use of information and data on humans. Differential privacy is a relatively recent measure that has received much attention in machine learning as it quantifies individual risk using a strong cryptographically motivated notion of privacy. At the core of differential privacy lies the concept of information dissemination through...
متن کاملEnd-to-End Differentially-Private Parameter Tuning in Spatial Histograms
ABSTRACT Dierentially-private histograms have emerged as a key tool for location privacy. While past mechanisms have included theoretical & experimental analysis, it has recently been observed that much of the existing literature does not fully provide dierential privacy. e missing component, private parameter tuning, is necessary for rigorous evaluation of these mechanisms. Instead works fr...
متن کاملISPE: Adaptive Differentially Private Data Release and Query Estimation
Although the mechanism of differential privacy provides a strong guarantee for privacy protection, it remains a key open problem to find efficient algorithms for non-interactive differentially private data release while maintaining good utility. In this paper, we propose an adaptive framework, called ISPE, to release differentially private histogram data through an interactive differentially pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PVLDB
دوره 6 شماره
صفحات -
تاریخ انتشار 2013