An AI-Based Causal Strategy for Securing Statistical Databases Using Micro-aggregation
نویسندگان
چکیده
Although Artificial Intelligent (AI) techniques have been used in various applications, their use in maintaining security in Statistical DataBases (SDBs) has not been reported. This paper presents results, that to the best of our knowledge is pioneering, by which concepts from causal networks are used to secure SDBs. We consider the MicroAggregation Problem (MAP) in secure SDBs which involves partitioning a set of individual records in a micro-data file into a number of mutually exclusive and exhaustive groups. This problem, which seeks for the best partition of the micro-data file, is known to be NP-hard, and has been tackled using many heuristic solutions. In this paper, we would like to demonstrate that in the process of developing MicroAggregation Techniques (MATs), it is expedient to incorporate AI-based causal information about the dependence between the random variables in the micro-data file. This can be achieved by pre-processing the micro-data before invoking any MAT, in order to extract the useful dependence information from the joint probability distribution of the variables in the micro-data file, and then accomplishing the microaggregation on the “maximally independent” variables. Our results, on artificial life data sets, show that including such information will enhance the process of determining how many variables are to be used, and which of them should be used in the microaggregation process.
منابع مشابه
A Fixed Structure Learning Automaton Micro-aggregation Technique for Secure Statistical Databases
We consider the problem of securing statistical databases and, more specifically, the micro-aggregation technique (MAT), which coalesces the individual records in the micro-data file into groups or classes, and on being queried, reports, for the all individual values, the aggregated means of the corresponding group. This problem is known to be NP-hard and has been tackled using many heuristic s...
متن کاملEnhancing Micro-Aggregation Technique by Utilizing Dependence-Based Information in Secure Statistical Databases
We consider the Micro-Aggregation Problem (MAP) in secure statistical databases which involves partitioning a set of individual records in a micro-data file into a number of mutually exclusive and exhaustive groups. This problem, which seeks for the best partition of the micro-data file, is known to be NP-hard, and has been tackled using many heuristic solutions. In this paper, we would like to...
متن کاملA Model for Representing Statistical Objects
In this paper the structure and the semantic properties of the entities stored in databases, whose data are only aggregate-type data, are defined and discussed. This choice is justified by the wide spread use of aggregate data without the corresponding raw data (i.e. micro-data, such as census data). Aggregate data are often derived by applying statistical aggregation (e.g. sum, count) and stat...
متن کاملInvestigating causal linkages and strategic mapping in the balanced scorecard: A case study approach in the banking industry sector
One of the main challenges of strategic management is implementing the strategies. Designing the strategy map in Balanced Scorecard framework to determine the causality between strategic objectives is one of the most important issues in implementing the strategies. In designing the strategy map with intuition and judgment, the link between strategic objectives is not clear and it is not obvious...
متن کاملProtecting Micro-data by Micro-aggregation: the Experience in Eurostat
A natural strategy to protect the confidentiality of individual data is to aggregate them at the lowest possible level. Some studies realised in Eurostat on this topic will be presented: properties of classifications in clusters of fixed sizes, micro-aggregation as a generic method to protect the confidentiality of individual data, application to the Community Innovation Survey. The work perfor...
متن کامل