Releasing Microdata: Disclosure Risk Estimation, Data Masking and Assessing Utility
نویسنده
چکیده
Statistical agencies release sample microdata from social surveys under different modes of access ranging from Public Use Files (PUF) in the form of tables or highly perturbed datasets to Microdata Under Contract (MUC) for researchers and licensed institutions where levels of protection are less severe. In addition, statistical agencies often have on-site datalabs where registered researchers can access unperturbed statistical data. Statistical agencies will generally set up a panel of experts to form a Microdata Review Panel (MRP) who will then have the authority to release microdata. To make informed decisions about the release of microdata, the MRP needs objective disclosure risk measures to determine tolerable risk thresholds according to the access mode. They also need to monitor the application of data masking techniques and to ensure the quality and utility of the released microdata.
منابع مشابه
Global Measures of Data Utility for Microdata Masked for Disclosure Limitation
When releasing microdata to the public, data disseminators typically alter the original data to protect the confidentiality of database subjects’ identities and sensitive attributes. However, such alteration negatively impacts the utility (quality) of the released data. In this paper, we present quantitative measures of data utility for masked microdata, with the aim of improving disseminators’...
متن کاملMultiplicative noise for masking numerical microdata with constraints
Before releasing databases which contain sensitive information about individuals, statistical agencies have to apply Statistical Disclosure Limitation (SDL) methods to such data. The goal of these methods is to minimize the risk of disclosure of the confidential information and at the same time provide legitimate data users with accurate information about the population of interest. SDL methods...
متن کاملMicrodata Protection
Governmental, public, and private organizations are more and more frequently required to make data available for external release in a selective and secure fashion. Most data are today released in the form of microdata, reporting information on individual respondents. The protection of microdata against improper disclosure is therefore an issue that has become increasingly important and will co...
متن کاملDisclosure risk assessment in statistical microdata protection via advanced record linkage
The performance of Statistical Disclosure Control (SDC) methods for microdata (also called masking methods) is measured in terms of the utility and the disclosure risk associated to the protected microdata set. Empirical disclosure risk assessment based on record linkage stands out as a realistic and practical disclosure risk assessment methodology which is applicable to every conceivable maski...
متن کاملA Theoretical Comparison of Data Masking Techniques for Numerical Microdata
In this study, we perform a comprehensive theoretical evaluation of masking techniques for numerical microdata. The objective of this comparison is to establish the extent to which existing techniques can satisfy disclosure risk, data utility, ease of implementation, and ease of use requirements. This evaluation allows data providers to select from these techniques to account for the demands of...
متن کامل