[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1014052.1014111acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

A generalized maximum entropy approach to bregman co-clustering and matrix approximation

Published: 22 August 2004 Publication History

Abstract

Co-clustering is a powerful data mining technique with varied applications such as text clustering, microarray analysis and recommender systems. Recently, an information-theoretic co-clustering approach applicable to empirical joint probability distributions was proposed. In many situations, co-clustering of more general matrices is desired. In this paper, we present a substantially generalized co-clustering framework wherein any Bregman divergence can be used in the objective function, and various conditional expectation based constraints can be considered based on the statistics that need to be preserved. Analysis of the co-clustering problem leads to the minimum Bregman information principle, which generalizes the maximum entropy principle, and yields an elegant meta algorithm that is guaranteed to achieve local optimality. Our methodology yields new algorithms and also encompasses several previously known clustering and co-clustering algorithms based on alternate minimization.

References

[1]
A. Banerjee, I. Dhillon, J. Ghosh, S. Merugu, and D. Modha. A generalized maximum entropy approach to bregman co-clustering and matrix approximation. Technical Report UTCS TR04-24, UT, Austin, 2004.
[2]
A. Banerjee, S. Merugu, I. Dhillon, and J. Ghosh. Clustering with Bregman divergences. In SDM, 2004.
[3]
Y. Censor and S. Zenios. Parallel Optimization: Theory, Algorithms, and Applications. Oxford University Press, 1998.
[4]
Y. Cheng and G. M. Church. Biclustering of expression data. In ICMB, pages 93--103, 2000.
[5]
H. Cho, I. S. Dhillon, Y. Guan, and S. Sra. Minimum sum squared residue co-clustering of gene expression data. In SDM, 2004.
[6]
T. M. Cover and J. A. Thomas. Elements of Information Theory. Wiley-Interscience, 1991.
[7]
I. Csiszar. Why least squares and maximum entropy? an axiomatic approach to inference for linear inverse problems. Annals of Statistics, 19:2032--2066, 1991.
[8]
I. Dhillon, S. Mallela, and D. Modha. Information-theoretic co-clustering. In KDD, pages 89--98, 2003.
[9]
J. A. Hartigan. Direct clustering of a data matrix. Journal of the American Statistical Association, 67(337):123--129, 1972.
[10]
T. Hofmann and J. Puzicha. Unsupervised learning from dyadic data. Technical Report TR-98-042, ICSI, Berkeley, 1998.
[11]
D. L. Lee and S. Seung. Algorithms for non-negative matrix factorization. In NIPS, 2001. 556--562.

Cited By

View all
  • (2024)Spatio-Temporal Heterogeneous Ensemble Learning Method for Predicting Land SubsidenceApplied Sciences10.3390/app1418833014:18(8330)Online publication date: 16-Sep-2024
  • (2024)A Survey of Co-ClusteringACM Transactions on Knowledge Discovery from Data10.1145/368179318:9(1-28)Online publication date: 25-Jul-2024
  • (2024)T-Distributed Stochastic Neighbor Embedding for Co-Representation LearningACM Transactions on Intelligent Systems and Technology10.1145/362782315:2(1-18)Online publication date: 22-Feb-2024
  • Show More Cited By

Index Terms

  1. A generalized maximum entropy approach to bregman co-clustering and matrix approximation

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2004
    874 pages
    ISBN:1581138881
    DOI:10.1145/1014052
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 August 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Bregman divergences
    2. co-clustering
    3. matrix approximation

    Qualifiers

    • Article

    Conference

    KDD04

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)36
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 10 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Spatio-Temporal Heterogeneous Ensemble Learning Method for Predicting Land SubsidenceApplied Sciences10.3390/app1418833014:18(8330)Online publication date: 16-Sep-2024
    • (2024)A Survey of Co-ClusteringACM Transactions on Knowledge Discovery from Data10.1145/368179318:9(1-28)Online publication date: 25-Jul-2024
    • (2024)T-Distributed Stochastic Neighbor Embedding for Co-Representation LearningACM Transactions on Intelligent Systems and Technology10.1145/362782315:2(1-18)Online publication date: 22-Feb-2024
    • (2024)A comprehensive survey of fast graph clusteringVicinagearth10.1007/s44336-024-00008-31:1Online publication date: 13-Sep-2024
    • (2023)Trustworthy Recommender SystemsACM Transactions on Intelligent Systems and Technology10.1145/3627826Online publication date: 13-Oct-2023
    • (2023)Interpretable Imitation Learning with Symbolic RewardsACM Transactions on Intelligent Systems and Technology10.1145/3627822Online publication date: 13-Oct-2023
    • (2023)Movie Recommender System Using Parameter Tuning of User and Movie Neighbourhood via Co-ClusteringProcedia Computer Science10.1016/j.procs.2023.01.096218:C(1176-1183)Online publication date: 1-Jan-2023
    • (2023)Orthogonal parametric non-negative matrix tri-factorization with α-divergence for co-clusteringExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.120680231:COnline publication date: 30-Nov-2023
    • (2022)Co-clustering of multivariate functional data for the analysis of air pollution in the South of FranceThe Annals of Applied Statistics10.1214/21-AOAS154716:3Online publication date: 1-Sep-2022
    • (2022)Fast Flexible Bipartite Graph Model for Co-ClusteringIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3194275(1-12)Online publication date: 2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media