Article

Privacy leakage in multi-relational databases via pattern based semi-supervised learning

Authors:

Hui Xiong,

Michael Steinbach,

Vipin KumarAuthors Info & Claims

CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management

Pages 355 - 356

https://doi.org/10.1145/1099554.1099664

Published: 31 October 2005 Publication History

Get Access

Abstract

In multi-relational databases, a view, which is a context- and content-dependent subset of one or more tables (or other views), is often used to preserve privacy by hiding sensitive information. However, recent developments in data mining present a new challenge for database security even when traditional database security techniques, such as database access control, are employed. This paper presents a data mining framework using semi-supervised learning that demonstrates the potential for privacy leakage in multi-relational databases. Many different types of semi-supervised learning techniques, such as the K-nearest neighbor (KNN) method, can be used to demonstrate privacy leakage. However, we also introduce a new approach to semi-supervised learning, hyperclique pattern based semi-supervised learning (HPSL), which differs from traditional semi-supervised learning approaches in that it considers the similarity among groups of objects instead of only pairs of objects. Our experimental results show that both the KNN and HPSL methods have the ability to compromise database security, although HPSL is better at this privacy violation than the KNN method.

References

[1]

C. Clifton. Using sample size to limit exposure to data mining. J. Comput. Secur., 8(4):281--307, 2000.

Digital Library

Google Scholar

[2]

P. Domingos. Prospects and challenges for multi- relational data mining. SIGKDD Explorations, 2003.

Digital Library

Google Scholar

[3]

R. Duin. Classifiers in almost empty spaces. In Proc. 15th Int'l Conference on Pattern Recognition, 2000.

Crossref

Google Scholar

[4]

S. Raudys and A. Jain. Small sample size effects in statistical pattern recognition: Recommendations for practitioners. IEEE TPAMI, 13(3):252--264, 1991.

Digital Library

Google Scholar

[5]

M. Seeger. Learning with labeled and unlabeled data. In Technical Report, University of Edinburgh, 2001.

Google Scholar

[6]

H. Xiong, M. Steinbach, and V. Kumar. Privacy leakage in multi-relational databases via pattern based semi-supervised learning. Technical Report 04-023, University of Minnesota, 2004.

Google Scholar

[7]

H. Xiong, P. Tan, and V. Kumar. Mining strong affinity association patterns in data sets with skewed support distribution. In Proc. of ICDM, 2003.

Digital Library

Google Scholar

Cited By

View all

Han BTsang IXiao XChen LFung SYu C(2021)Privacy-Preserving Stochastic Gradual LearningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.296397733:8(3129-3140)Online publication date: 1-Aug-2021
https://doi.org/10.1109/TKDE.2020.2963977
Koufakou A(2017)An approximate representation of hypercliquesJournal of Intelligent Information Systems10.1007/s10844-016-0409-448:2(263-285)Online publication date: 1-Apr-2017
https://dl.acm.org/doi/10.1007/s10844-016-0409-4
Koufakou A(2014)Mining non-derivable hypercliquesKnowledge and Information Systems10.1007/s10115-013-0660-841:1(77-99)Online publication date: 1-Oct-2014
https://dl.acm.org/doi/10.1007/s10115-013-0660-8
Show More Cited By

Index Terms

Privacy leakage in multi-relational databases via pattern based semi-supervised learning
1. Mathematics of computing
  1. Information theory

Recommendations

Privacy preserving semi-supervised learning for labeled graphs
ECMLPKDD'11: Proceedings of the 2011th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I

We propose a novel privacy preserving learning algorithm that achieves semi-supervised learning in graphs. In real world networks, such as disease infection over individuals, links (contact) and labels (infection) are often highly sensitive information. ...
Privacy preserving semi-supervised learning for labeled graphs
ECML PKDD'11: Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I

We propose a novel privacy preserving learning algorithm that achieves semi-supervised learning in graphs. In real world networks, such as disease infection over individuals, links (contact) and labels (infection) are often highly sensitive information. ...
Inductive Semi-supervised Multi-Label Learning with Co-Training
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

In multi-label learning, each training example is associated with multiple class labels and the task is to learn a mapping from the feature space to the power set of label space. It is generally demanding and time-consuming to obtain labels for training ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management

October 2005

854 pages

ISBN:1595931406

DOI:10.1145/1099554

General Chair:
Otthein Herzog
University of Bremen, Germany
,
Program Chairs:
Hans-Jörg Schek
University for Health Sciences, Medical Informatics and Technology, Austria
,
Norbert Fuhr
University of Duisburg-Essen, Germany
,
Abdur Chowdhury
America Online, USA
,
Wilfried Teiken
IBM T.J. Watson Research Center, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2005

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

CIKM05

Sponsor:

CIKM05: Conference on Information and Knowledge Management

October 31 - November 5, 2005

Bremen, Germany

Acceptance Rates

CIKM '05 Paper Acceptance Rate 77 of 425 submissions, 18%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
380
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)1

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Han BTsang IXiao XChen LFung SYu C(2021)Privacy-Preserving Stochastic Gradual LearningIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.296397733:8(3129-3140)Online publication date: 1-Aug-2021
https://doi.org/10.1109/TKDE.2020.2963977
Koufakou A(2017)An approximate representation of hypercliquesJournal of Intelligent Information Systems10.1007/s10844-016-0409-448:2(263-285)Online publication date: 1-Apr-2017
https://dl.acm.org/doi/10.1007/s10844-016-0409-4
Koufakou A(2014)Mining non-derivable hypercliquesKnowledge and Information Systems10.1007/s10115-013-0660-841:1(77-99)Online publication date: 1-Oct-2014
https://dl.acm.org/doi/10.1007/s10115-013-0660-8
Jafer YViktor HPaquet E(2012)Aggregation and privacy in multi-relational databasesProceedings of the 2012 Tenth Annual International Conference on Privacy, Security and Trust (PST)10.1109/PST.2012.6297921(67-74)Online publication date: 16-Jul-2012
https://dl.acm.org/doi/10.1109/PST.2012.6297921
Xiong HSteinbach MKumar V(2006)Privacy leakage in multi-relational databases: a semi-supervised learning perspectiveThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-006-0011-415:4(388-402)Online publication date: 1-Nov-2006
https://dl.acm.org/doi/10.1007/s00778-006-0011-4

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Privacy preserving semi-supervised learning for labeled graphs

Privacy preserving semi-supervised learning for labeled graphs

Inductive Semi-supervised Multi-Label Learning with Co-Training

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations