Scalable Privacy-Preserving Data Mining with Asynchronously Partitioned Datasets

نویسندگان

  • Hiroaki Kikuchi
  • Daisuke Kagawa
  • Anirban Basu
  • Kazuhiko Ishii
  • Masayuki Terada
  • Sadayuki Hongo
چکیده

In the Näıve Bayes classification problem using a vertically partitioned dataset, the conventional scheme to preserve privacy of each partition uses a secure scalar product and is based on the assumption that the data is synchronised amongst common unique identities. In this paper, we attempt to discard this assumption in order to develop a more efficient and secure scheme to perform classification with minimal disclosure of private data. Our proposed scheme is based on the work by Vaidya and Clifton[1], which uses commutative encryption to perform secure set intersection so that the parties with access to the individual partitions have no knowledge of the intersection. The evaluations presented in this paper are based on experimental results, which show that our proposed protocol scales well with large sparse datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Privacy Preserving Data mining with Reduced Communication overhead

In today's world privacy and security are more essential elements when data is shared. A fruitful direction for future data mining research will be the development of techniques that incorporate privacy concerns. Most of the methods use random permutation techniques to mask the data, for preserving the privacy of sensitive data. The approaches for privacy preserving data mining suffer from high...

متن کامل

Data Anonymization of Vertically Partitioned Data Using Mapreduce on Cloud

In the world of computers, cloud services, on large scale, are being offered by service providers. User wishes to share some private information that has been stored on the cloud server due to various reasons such as data analysis, data mining and so on. These things bring up a concern about privacy. Privacy preservation may be attained by Anonymization data sets via normalization for satisfyin...

متن کامل

Privacy Preserving Näıve Bayes Classifier for Vertically Partitioned Data

Privacy-Preserving Data Mining – developing models without seeing the data – is receiving growing attention. This paper assumes a privacy-preserving distributed data mining scenario: data sources collaborate to develop a global model, but must not disclose their data to others. Näıve Bayes is often used as a baseline classifier, consistently providing reasonable classification performance. This...

متن کامل

Privacy Preserving Naïve Bayes Classifier for Vertically Partitioned Data

Privacy-Preserving Data Mining – developing models without seeing the data – is receiving growing attention. This paper assumes a privacy-preserving distributed data mining scenario: data sources collaborate to develop a global model, but must not disclose their data to others. Näıve Bayes is often used as a baseline classifier, consistently providing reasonable classification performance. This...

متن کامل

A Model Based Framework for Privacy Preserving Clustering Using SOM

Privacy has become an important issue in the progress of data mining techniques. Many laws are being enacted in various countries to protect the privacy of data. This privacy concern has been addressed by developing data mining techniques under a framework called privacy preserving data mining. Presently there are two main approaches popularly used -data perturbation and secure multiparty compu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEICE Transactions

دوره 96-A  شماره 

صفحات  -

تاریخ انتشار 2011