An Adaptive Kernel Method for Semi-supervised Clustering

Bojun Yan²¹ &
Carlotta Domeniconi²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5935 Accesses
18 Citations

Abstract

Semi-supervised clustering uses the limited background knowledge to aid unsupervised clustering algorithms. Recently, a kernel method for semi-supervised clustering has been introduced, which has been shown to outperform previous semi-supervised clustering approaches. However, the setting of the kernel’s parameter is left to manual tuning, and the chosen value can largely affect the quality of the results. Thus, the selection of kernel’s parameters remains a critical and open problem when only limited supervision, provided in terms of pairwise constraints, is available. In this paper, we derive a new optimization criterion to automatically determine the optimal parameter of an RBF kernel, directly from the data and the given constraints. Our approach integrates the constraints into the clustering objective function, and optimizes the parameter of a Gaussian kernel iteratively during the clustering process. Our experimental comparisons and results with simulated and real data clearly demonstrate the effectiveness and advantages of the proposed algorithm.

Download to read the full chapter text

Chapter PDF

Kernel conditional clustering and kernel conditional semi-supervised learning

Article Open access 06 June 2019

On Semi-Supervised Clustering

Research Progress on Semi-Supervised Clustering

Article 17 July 2019

References

Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning distance functions using equivalence relations. In: International Conference on Machine Learning, pp. 11–18 (2003)
Google Scholar
Basu, S., Bilenko, M., Mooney, R.J.: A probabilistic framework for semi-supervised clustering. In: International Conference on Knowledge Discovery and Data Mining (2004)
Google Scholar
Besag, J.: On the statistical analysis of dirty pictures. Journal of the Royal Statistical Society, Series B (Methodological) (1986)
Google Scholar
Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and Metric Learning in semi-supervised clustering. In: International Conference on Machine Learning (2004)
Google Scholar
Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Boykov, Y., Veksler, O., Zabih, R.: Markov Random fields with efficient approximations. In: IEEE Computer Vision and pattern Recognition Conference (1998)
Google Scholar
Chapelle, O., Vapnik, V.: Choosing Mutiple Parameters for Support Vector Machines. Machine Learning 46(1), 131–159 (2002)
Article MATH Google Scholar
Cohn, D., Caruana, R., McCallum, A.: Semi-supervised clustering with user feedback. TR2003-1892, Cornell University (2003)
Google Scholar
Cristianini, N., Shawe-Taylor, J., Elisseeff, A.: On Kernel-Target Alignment. In: Neural Information Processing Systems (NIPS) (2001)
Google Scholar
Huang, J., Yuen, P.C., Chen, W.S., Lai, J.H.: Kernel Subspace LDA with optimized Kernel Parameters on Face Recognition. In: The sixth IEEE International Conference on Automatic Face and Gesture Recognition (2004)
Google Scholar
Kleinberg, J., Tardos, E.: Approximation algorithms for classification problems with pairwise relationships: metric labeling and Markov random fields. In: The 40th IEEE Symposium on Foundation of Computer Science (1999)
Google Scholar
Kulis, B., Basu, S., Dhillon, I., Moony, R.: Semi-supervised graph clustering: a kernel approach. In: International Conference on Machine Learning (2005)
Google Scholar
Segal, E., Wang, H., Koller, D.: Discovering molecular pathways from protein interaction and gene expression data. Bioinformatics (2003)
Google Scholar
Theodoridis, S., Koutroubas, K.: Pattern Recognition. Academic Press, London (1999)
Google Scholar
Vapnik., V.: The Nature of Statistical Learning Theory. Wiley, New York (1995)
Google Scholar
Wagstaff, K., Cardie, C., Rogers, S., Schroedl, S.: Constrained K-Means clustering with background knowledge. In: International Conference on Machine Learning, pp. 577–584 (2001)
Google Scholar
Wang, W., Xu, Z., Lu, W., Zhang, X.: Determination of the spread parameter in the Gaussian Kernel for classification and regression. Neurocomputing 55(3), 645 (2002)
Google Scholar
Xing, E.P., Ng, A.Y., Jordan, M.I., Russell, S.: Distance metric learning, with application to clustering with side-information. In: Advances in Neural Information Processing Systems, vol. 15 (2003)
Google Scholar
Zhang, Y., Brady, M., Smith, S.: Hidden Markov random field model and segmentation of brain MR images. IEEE Transactions on Medical Imaging (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Software Engineering, George Mason University, Fairfax, Virginia, 22030, USA
Bojun Yan & Carlotta Domeniconi

Authors

Bojun Yan
View author publications
You can also search for this author in PubMed Google Scholar
Carlotta Domeniconi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yan, B., Domeniconi, C. (2006). An Adaptive Kernel Method for Semi-supervised Clustering. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_49

Download citation

DOI: https://doi.org/10.1007/11871842_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Adaptive Kernel Method for Semi-supervised Clustering

Abstract

Chapter PDF

Similar content being viewed by others

Kernel conditional clustering and kernel conditional semi-supervised learning

On Semi-Supervised Clustering

Research Progress on Semi-Supervised Clustering

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

An Adaptive Kernel Method for Semi-supervised Clustering

Abstract

Chapter PDF

Similar content being viewed by others

Kernel conditional clustering and kernel conditional semi-supervised learning

On Semi-Supervised Clustering

Research Progress on Semi-Supervised Clustering

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation