[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2396761.2398498acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Relational co-clustering via manifold ensemble learning

Published: 29 October 2012 Publication History

Abstract

Co-clustering targets on grouping the samples and features simultaneously. It takes advantage of the duality between the samples and features. In many real-world applications, the data points or features usually reside on a submanifold of the ambient Euclidean space, but it is nontrivial to estimate the intrinsic manifolds in a principled way. In this study, we focus on improving the co-clustering performance via manifold ensemble learning, which aims to maximally approximate the intrinsic manifolds of both the sample and feature spaces. To achieve this, we develop a novel co-clustering algorithm called Relational Multi-manifold Co-clustering (RMC) based on symmetric nonnegative matrix tri-factorization, which decomposes the relational data matrix into three matrices. This method considers the inter-type relationship revealed by the relational data matrix and the intra-type information reflected by the affinity matrices. Specifically, we assume the intrinsic manifold of the sample or feature space lies in a convex hull of a group of pre-defined candidate manifolds. We hope to learn an appropriate convex combination of them to approach the desired intrinsic manifold. To optimize the objective, the multiplicative rules are utilized to update the factorized matrices and the entropic mirror descent algorithm is exploited to automatically learn the manifold coefficients. Experimental results demonstrate the superiority of the proposed algorithm.

References

[1]
A. Beck and M. Teboulle. Mirror descent and nonlinear projected subgradient methods for convex optimization. Operations Research Letters, 31(3):167--175, 2003.
[2]
M. Belkin, P. Niyogi, and V. Sindhwani. Manifold regularization: a geometric framework for learning from examples. JMLR, 7:2399--2434, 2006.
[3]
D. Cai, X. He, and J. Han. Document clustering using locality preserving indexing. IEEE TKDE, 17(12):1624--1637, 2005.
[4]
D. Cai, X. He, and J. Han. Locally consistent concept factorization for document clustering. IEEE TKDE, 23(6):902--913, 2011.
[5]
D. Cai, X. He, J. Han, and T. Huang. Graph regularized nonnegative matrix factorization for data representation. IEEE TPAMI, 33(8):1548--1560, 2011.
[6]
D. Cai, X. He, X. Wang, H. Bao, and J. Han. Locality preserving nonnegative matrix factorization. In IJCAI, 2009.
[7]
D. Cai, X. He, X. Wu, and J. Han. Non-negative matrix factorization on manifold. In ICDM, 2008.
[8]
D. Cai, X. Wang, and X. He. Probabilistic dyadic data analysis with local and global consistency. In ICML, 2009.
[9]
W. Chen, Y. Song, H. Bai, C. Lin, and E. Chang. Parallel spectral clustering in distributed systems. IEEE TPAMI, 33(3):568--586, 2011.
[10]
I. Dhillon. Co-clustering documents and words using bipartite spectral graph partitioning. In SIGKDD, 2001.
[11]
C. Ding, T. Li, and M. Jordan. Convex and semi-nonnegative matrix factorizations. IEEE TPAMI, 32(1):45--55, 2010.
[12]
C. Ding, T. Li, W. Peng, and H. Park. Orthogonal nonnegative matrix t-factorizations for clustering. In SIGKDD, 2006.
[13]
B. Geng, C. Xu, D. Tao, L. Yang, and X. Hua. Ensemble manifold regularization. In CVPR, 2009.
[14]
Q. Gu and J. Zhou. Co-clustering on manifolds. In SIGKDD, 2009.
[15]
D. Lee, H. Seung, et al. Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755):788--791, 1999.
[16]
W. Liu, K. Yuan, and D. Ye. On α-divergence based nonnegative matrix factorization for clustering cancer gene expression data. Artificial Intelligence in Medicine, 44(1):1--5, 2008.
[17]
B. Long, Z. Zhang, X. Wu, and P. Yu. Spectral clustering for multi-type relational data. In ICML, 2006.
[18]
B. Long, Z. Zhang, and P. Yu. Co-clustering by block value decomposition. In SIGKDD, pages 635--640, 2005.
[19]
M. Rege, M. Dong, and F. Fotouhi. Co-clustering documents and words using bipartite isoperimetric graph partitioning. In ICDM, 2006.
[20]
A. Statnikov, C. Aliferis, I. Tsamardinos, D. Hardin, and S. Levy. A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics, 21(5):631--643, 2005.
[21]
H. Wang, H. Huang, and C. Ding. Simultaneous clustering of multi-type relational data via symmetric nonnegative matrix tri-factorization. In CIKM, 2011.
[22]
H. Wang, F. Nie, H. Huang, and C. Ding. Nonnegative matrix tri-factorization based high-order co-clustering and its fast implementation. In ICDM, 2011.

Cited By

View all

Index Terms

  1. Relational co-clustering via manifold ensemble learning

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
      October 2012
      2840 pages
      ISBN:9781450311564
      DOI:10.1145/2396761
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 29 October 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. co-clustering
      2. entropic mirror descent algorithm
      3. manifold ensemble learning
      4. nonnegative matrix tri-factorization

      Qualifiers

      • Short-paper

      Conference

      CIKM'12
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)7
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 31 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Survey of Co-ClusteringACM Transactions on Knowledge Discovery from Data10.1145/368179318:9(1-28)Online publication date: 25-Jul-2024
      • (2023)NMF and Manifold Learning for Multi-aspect DataMulti-aspect Learning10.1007/978-3-031-33560-0_3(51-76)Online publication date: 28-Jul-2023
      • (2023)Non-negative Matrix Factorization-Based Multi-aspect Data ClusteringMulti-aspect Learning10.1007/978-3-031-33560-0_2(27-50)Online publication date: 28-Jul-2023
      • (2022)Dual graph-regularized sparse concept factorization for clusteringInformation Sciences: an International Journal10.1016/j.ins.2022.05.101607:C(1074-1088)Online publication date: 1-Aug-2022
      • (2020)A Novel Approach to Learning Consensus and Complementary Information for Multi-View Data Clustering2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00080(865-876)Online publication date: Apr-2020
      • (2019)Multi-type Relational Data Clustering for Community Detection by Exploiting Content and Structure Information in Social NetworksPRICAI 2019: Trends in Artificial Intelligence10.1007/978-3-030-29911-8_42(541-554)Online publication date: 23-Aug-2019
      • (2018)Learning Association Relationship and Accurate Geometric Structures for Multi-Type Relational Data2018 IEEE 34th International Conference on Data Engineering (ICDE)10.1109/ICDE.2018.00053(509-520)Online publication date: Apr-2018
      • (2018)A Novel Technique of Using Coupled Matrix and Greedy Coordinate Descent for Multi-view Data RepresentationWeb Information Systems Engineering – WISE 201810.1007/978-3-030-02925-8_20(285-300)Online publication date: 21-Oct-2018
      • (2018)Clustering Multi-View Data Using Non-negative Matrix Factorization and Manifold Learning for Effective Understanding: A Survey PaperLinking and Mining Heterogeneous and Multi-view Data10.1007/978-3-030-01872-6_9(201-227)Online publication date: 27-Nov-2018
      • (2017)A Kernel Probabilistic Model for Semi-supervised Co-clustering EnsembleJournal of Intelligent Systems10.1515/jisys-2017-051329:1(143-153)Online publication date: 30-Dec-2017
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media