Privacy-preserving naive Bayes classification on distributed data via semi-trusted mixers
نویسندگان
چکیده
Distributed data mining applications, such as those dealing with health care, finance, counter-terrorism and homeland defense, use sensitive data from distributed databases held by different parties. This comes into direct conflict with an individual’s need and right to privacy. It is thus of great importance to develop adequate security techniques In this paper, we consider privacy-preserving naive Bayes classifier for horizontally partitioned distributed data and propose a two-party protocol and a multi-party protocol to achieve it. Our multi-party protocol is built on the semi-trusted mixer model, in which each data site sends messages to two semi-trusted mixers, respectively, which run our two-party protocol and then broadcast the classification result. This model facilitates both trust management and implementation. Security analysis has showed that our two-party protocol is a private protocol and our multi-party protocol is a private protocol as long as the two mixers do not conclude. & 2008 Published by Elsevier B.V. D E 63
منابع مشابه
Performance Analysis of Privacy Preserving Naïve Bayes Classifiers for Distributed Databases
The problem of secure and fast distributed classification is an important one. The main focus of the paper is on privacy preserving distributed classification rule mining. This research paper addresses the performance analysis of privacy preserving Naïve Bayes classifiers for horizontal and vertical partitioned databases. The Naïve Bayes classifier is a simple but efficient baseline classifier....
متن کاملThird Party Privacy Preserving Protocol for Perturbation Based Classification of Vertically Fragmented Data Bases
Privacy is become major issue in distributed data mining. In the literature we can found many proposals of privacy preserving which can be divided into two major categories that is trusted third party and multiparty based privacy protocols. In case of trusted third party models the conventional asymmetric cryptographic based techniques will be used and in case of multi party based protocols dat...
متن کاملPrivacy Preserving Naïve Bayes Classifier for Horizontally Distribution Scenario Using Un-trusted Third Party
The aim of the classification task is to discover some kind of relationship between the input attributes and the output class, so that the discovered knowledge can be used to predict the class of a new unknown tuple. The problem of secure distributed classification is an important one. In many situations, data is split between multiple organizations. These organizations may want to utilize all ...
متن کاملPrivacy Preserving Näıve Bayes Classifier for Vertically Partitioned Data
Privacy-Preserving Data Mining – developing models without seeing the data – is receiving growing attention. This paper assumes a privacy-preserving distributed data mining scenario: data sources collaborate to develop a global model, but must not disclose their data to others. Näıve Bayes is often used as a baseline classifier, consistently providing reasonable classification performance. This...
متن کاملPrivacy Preserving Naive Bayes Classifier for Horizontally Partitioned Data
The problem of secure distributed classification is an important one. In many situations, data is split between multiple organizations. These organizations may want to utilize all of the data to create more accurate predictive models while revealing neither their training data / databases nor the instances to be classified. The Naive Bayes Classifier is a simple but efficient baseline classifie...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Inf. Syst.
دوره 34 شماره
صفحات -
تاریخ انتشار 2009