Inference using noisy degrees: Differentially Private $\beta$-model and synthetic graphs
نویسندگان
چکیده
Releasing the exact degree sequence of a graph for analysis may violate privacy. However, the degree sequence of a graph is an important summary statistic that is used in many statistical models. Hence a natural starting point is to release a private version of the degree sequence. A graphical degree partition is a monotonic degree sequence such that there exists a simple graph realizing the sequence. Ensuring graphicalness of the released degree partition is a desirable property for many statistical inference procedures. We present an algorithm to release a graphical degree partition of a graph under the framework of differential privacy. Unlike previous algorithms, our algorithm allows an analyst to perform meaningful statistical inference from the released degree partition. We focus on the statistical inference tasks of existence of maximum likelihood estimates, parameter estimation and goodness of fit testing for the random graph model where the degree partition is a sufficient statistic, called the beta model. We show the usefulness of our algorithm for performing statistical inference for the beta model by evaluating it empirically on simulated and real datasets. As the degree partition is graphical, our algorithm can also be used to release synthetic graphs.
منابع مشابه
Differentially Private Graphical Degree Sequences and Synthetic Graphs
Releasing the exact degree sequence of a graph for analysis may violate privacy. However, the degree sequence of a graph is an important summary statistic that is used in many statistical models. Hence a natural starting point is to release a private version of the degree sequence. A graphical degree partition is a monotonic degree sequence such that there exists a simple graph realizing the se...
متن کاملDifferentially Private Learning of Undirected Graphical Models Using Collective Graphical Models
We investigate the problem of learning discrete, undirected graphical models in a differentially private way. We show that the approach of releasing noisy sufficient statistics using the Laplace mechanism achieves a good trade-off between privacy, utility, and practicality. A naive learning algorithm that uses the noisy sufficient statistics “as is” outperforms general-purpose differentially pr...
متن کاملDifferentially Private Exponential Random Graphs
We propose methods to release and analyze synthetic graphs in order to protect privacy of individual relationships captured by the social network. Proposed techniques aim at fitting and estimating a wide class of exponential random graph models (ERGMs) in a differentially private manner, and thus offer rigorous privacy guarantees. More specifically, we use the randomized response mechanism to r...
متن کاملA Two-Phase Algorithm for Differentially Private Frequent Subgraph Mining
Mining frequent subgraphs from a collection of input graphs is an important task for exploratory data analysis on graph data. However, if the input graphs contain sensitive information, releasing discovered frequent subgraphs may pose considerable threats to individual privacy. In this paper, we study the problem of frequent subgraph mining (FSM) under the rigorous differential privacy model. W...
متن کاملDenoising and Untangling Graphs Using Degree Priors
This paper addresses the problem of untangling hidden graphs from a set of noisy detections of undirected edges. We present a model of the generation of the observed graph that includes degree-based structure priors on the hidden graphs. Exact inference in the model is intractable; we present an efficient approximate inference algorithm to compute edge appearance posteriors. We evaluate our mod...
متن کامل