Optimizing Histogram Queries under Differential Privacy
نویسندگان
چکیده
Differential privacy is a robust privacy standard that hasbeen successfully applied to a range of data analysis tasks.But despite much recent work, optimal strategies for answer-ing a collection of related queries are not known.We propose the matrix mechanism, a new algorithm foranswering a workload of predicate counting queries. Givena workload, the mechanism requests answers to a differentset of queries, called a query strategy, which are answeredusing the standard Laplace mechanism. Noisy answers tothe workload queries are then derived from the noisy answersto the strategy queries. This two stage process can result ina more complex correlated noise distribution that preservesdifferential privacy but increases accuracy.We provide a formal analysis of the error of query answersproduced by the mechanism and investigate the problem ofcomputing the optimal query strategy in support of a givenworkload. We show this problem can be formulated as arank-constrained semidefinite program. Finally, we analyzetwo seemingly distinct techniques, whose similar behavior isexplained by viewing them as instances of the matrix mech-anism.
منابع مشابه
Differential Privacy and the Fat-Shattering Dimension of Linear Queries
In this paper, we consider the task of answering linear queries under the constraint of differential privacy. This is a general and wellstudied class of queries that captures other commonly studied classes, including predicate queries and histogram queries. We show that the accuracy to which a set of linear queries can be answered is closely related to its fat-shattering dimension, a property t...
متن کاملOn Linear Refinement of Differential Privacy-Preserving Query Answering
Recent work showed the necessity of incorporating a user’s background knowledge to improve the accuracy of estimates from noisy responses of histogram queries. Various types of constraints (e.g., linear constraints, ordering constraints, and range constraints) may hold on the true (non-randomized) answers of histogram queries. So the idea was to apply the constraints over the noisy responses an...
متن کاملDPCube: Differentially Private Histogram Release through Multidimensional Partitioning
Differential privacy is a strong notion for protecting individual privacy in privacy preserving data analysis or publishing. In this paper, we study the problem of differentially private histogram release for random workloads. We study two multidimensional partitioning strategies including: 1) a baseline cell-based partitioning strategy for releasing an equi-width cell histogram, and 2) an inno...
متن کاملLow-Rank Mechanism: Optimizing Batch Queries under Differential Privacy
Differential privacy is a promising privacy-preserving paradigm for statistical query processing over sensitive data. It works by injecting random noise into each query result, such that it is provably hard for the adversary to infer the presence or absence of any individual record from the published noisy results. The main objective in differentially private query processing is to maximize the...
متن کاملLow Rank Mechanism for Optimizing Batch Queries under Differential Privacy
Differential privacy is a promising privacy-preserving paradigm for statistical query processing over sensitive data. It works by injecting random noise into each query result, such that it is provably hard for the adversary to infer the presence or absence of any individual record from the published noisy results. The main objective in differentially private query processing is to maximize the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/0912.4742 شماره
صفحات -
تاریخ انتشار 2009