Maximum Utility-Minimum Information Loss Table Server Design for Statistical Disclosure Control of Tabular Data
نویسنده
چکیده
Statistical agencies typically serve a diverse group of end users with varying information needs. Accommodating the conflicting needs for information in combination with stringent rules for statistical disclosure limitation (SDL) of statistical information creates a special challenge. We provide a generic table server design for SDL of tabular data to meet this challenge. Our table server design works equally well with counts data and magnitude data, and is compatible with commonly used cell perturbation methods and cell suppression methods used for the statistical disclosure control of sensitive tabular data. We demonstrate the scope and the effectiveness of our table server design on counts and magnitude data by using a simplified controlled tabular adjustment procedure proposed by Dandekar (2003). In addition to ad hoc queries, the information compiled using our table server design could be used to capture multi-way interactions of counts data and magnitude data either in a static environment or in dynamic mode.
منابع مشابه
On Assessing the Disclosure Risk of Controlled Adjustment Methods for Statistical Tabular Data
Minimum distance controlled tabular adjustment is a recent perturbative approach for statistical disclosure control in tabular data. Given a table to be protected, it looks for the closest safe table, using some particular distance. Controlled adjustment is known to provide high data utility. However, the disclosure risk has only been partially analyzed using theoretical results from optimizati...
متن کاملAssessing the Information Loss of Controlled Adjustment Methods in Two-Way Tables
Minimum distance controlled tabular adjustment (CTA) is a perturbative technique of statistical disclosure control for tabular data. Given a table to be protected, CTA looks for the closest safe table by solving an optimization problem using some particular distance in the objective function. CTA has shown to exhibit a low disclosure risk. The purpose of this work is to show that CTA also provi...
متن کاملStatistical Disclosure Control Methods for Census Frequency Tables
This paper provides a review of common statistical disclosure control (SDC) methods implemented at Statistical Agencies for standard tabular outputs containing whole population counts from a Census (either enumerated or based on a register). These methods include record swapping on the microdata prior to its tabulation and rounding of entries in the tables after they are produced. The approach ...
متن کاملWorking Paper ENGLISH ONLY UNITED NATIONS ECONOMIC COMMISSION FOR EUROPE (UNECE) CONFERENCE OF EUROPEAN STATISTICIANS EUROPEAN COMMISSION STATISTICAL OFFICE OF THE EUROPEAN
Minimum distance controlled tabular adjustment (CTA) is a recent perturbative approach for statistical disclosure control in tabular data. CTA looks for the closest safe table, using some particular distance. In this talk we provide empirical results to assess the disclosure risk of the method. A set of 33 instances from the literature and four different attacker scenarios are considered. The r...
متن کاملA CTA Model Based on the Huber Function
Minimum distance controlled tabular adjustment (CTA) is an emerging perturbative method of statistical disclosure control for tabular data. The goal of CTA is to find the closest safe table to some original tabular data with sensitive information. Closeness is usually measured by 1 or 2 distances. Distance 1 provides solutions with a smaller 0 norm than 2 (i.e., with a lesser number of changes ...
متن کامل