Article

A generalized maximum entropy approach to bregman co-clustering and matrix approximation

Authors:

Dharmendra S. ModhaAuthors Info & Claims

KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 509 - 514

https://doi.org/10.1145/1014052.1014111

Published: 22 August 2004 Publication History

Get Access

Abstract

Co-clustering is a powerful data mining technique with varied applications such as text clustering, microarray analysis and recommender systems. Recently, an information-theoretic co-clustering approach applicable to empirical joint probability distributions was proposed. In many situations, co-clustering of more general matrices is desired. In this paper, we present a substantially generalized co-clustering framework wherein any Bregman divergence can be used in the objective function, and various conditional expectation based constraints can be considered based on the statistics that need to be preserved. Analysis of the co-clustering problem leads to the minimum Bregman information principle, which generalizes the maximum entropy principle, and yields an elegant meta algorithm that is guaranteed to achieve local optimality. Our methodology yields new algorithms and also encompasses several previously known clustering and co-clustering algorithms based on alternate minimization.

References

[1]

A. Banerjee, I. Dhillon, J. Ghosh, S. Merugu, and D. Modha. A generalized maximum entropy approach to bregman co-clustering and matrix approximation. Technical Report UTCS TR04-24, UT, Austin, 2004.

Digital Library

Google Scholar

[2]

A. Banerjee, S. Merugu, I. Dhillon, and J. Ghosh. Clustering with Bregman divergences. In SDM, 2004.

Digital Library

Google Scholar

[3]

Y. Censor and S. Zenios. Parallel Optimization: Theory, Algorithms, and Applications. Oxford University Press, 1998.

Digital Library

Google Scholar

[4]

Y. Cheng and G. M. Church. Biclustering of expression data. In ICMB, pages 93--103, 2000.

Digital Library

Google Scholar

[5]

H. Cho, I. S. Dhillon, Y. Guan, and S. Sra. Minimum sum squared residue co-clustering of gene expression data. In SDM, 2004.

Crossref

Google Scholar

[6]

T. M. Cover and J. A. Thomas. Elements of Information Theory. Wiley-Interscience, 1991.

Digital Library

Google Scholar

[7]

I. Csiszar. Why least squares and maximum entropy? an axiomatic approach to inference for linear inverse problems. Annals of Statistics, 19:2032--2066, 1991.

Crossref

Google Scholar

[8]

I. Dhillon, S. Mallela, and D. Modha. Information-theoretic co-clustering. In KDD, pages 89--98, 2003.

Digital Library

Google Scholar

[9]

J. A. Hartigan. Direct clustering of a data matrix. Journal of the American Statistical Association, 67(337):123--129, 1972.

Crossref

Google Scholar

[10]

T. Hofmann and J. Puzicha. Unsupervised learning from dyadic data. Technical Report TR-98-042, ICSI, Berkeley, 1998.

Google Scholar

[11]

D. L. Lee and S. Seung. Algorithms for non-negative matrix factorization. In NIPS, 2001. 556--562.

Digital Library

Google Scholar

Cited By

View all

Zhao BWu GLi JWu QDeng M(2024)Spatio-Temporal Heterogeneous Ensemble Learning Method for Predicting Land SubsidenceApplied Sciences10.3390/app1418833014:18(8330)Online publication date: 16-Sep-2024
https://doi.org/10.3390/app14188330
Wang HSong YChen WLuo ZLi CLi T(2024)A Survey of Co-ClusteringACM Transactions on Knowledge Discovery from Data10.1145/368179318:9(1-28)Online publication date: 25-Jul-2024
https://dl.acm.org/doi/10.1145/3681793
Chen WWang HZhang YDeng PLuo ZLi T(2024)T-Distributed Stochastic Neighbor Embedding for Co-Representation LearningACM Transactions on Intelligent Systems and Technology10.1145/362782315:2(1-18)Online publication date: 22-Feb-2024
https://dl.acm.org/doi/10.1145/3627823
Show More Cited By

Index Terms

A generalized maximum entropy approach to bregman co-clustering and matrix approximation
1. Computing methodologies
  1. Machine learning

Recommendations

Multitask Bregman clustering

Traditional clustering methods deal with a single clustering task on a single data set. In some newly emerging applications, multiple similar clustering tasks are involved simultaneously. In this case, we not only desire a partition for each task, but ...
A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximation

Co-clustering, or simultaneous clustering of rows and columns of a two-dimensional data matrix, is rapidly becoming a powerful data analysis technique. Co-clustering has enjoyed wide success in varied application domains such as text clustering, gene-...
Clustering based on matrix approximation: a unifying view

Clustering is the problem of identifying the distribution of patterns and intrinsic correlations in large data sets by partitioning the data points into similarity classes. Recently, a number of methods have been proposed and demonstrated good ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining

August 2004

874 pages

ISBN:1581138881

DOI:10.1145/1014052

General Chairs:
Won Kim
Cyber Database Solutions
,
Ronny Kohavi
Amazon.com
,
Program Chairs:
Johannes Gehrke
Cornell University
,
William DuMouchel
AT&T Labs Research

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 August 2004

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

KDD04

Sponsor:

KDD04: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 22 - 25, 2004

WA, Seattle, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

107
Total Citations
View Citations
1,474
Total Downloads

Downloads (Last 12 months)36
Downloads (Last 6 weeks)3

Reflects downloads up to 10 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Zhao BWu GLi JWu QDeng M(2024)Spatio-Temporal Heterogeneous Ensemble Learning Method for Predicting Land SubsidenceApplied Sciences10.3390/app1418833014:18(8330)Online publication date: 16-Sep-2024
https://doi.org/10.3390/app14188330
Wang HSong YChen WLuo ZLi CLi T(2024)A Survey of Co-ClusteringACM Transactions on Knowledge Discovery from Data10.1145/368179318:9(1-28)Online publication date: 25-Jul-2024
https://dl.acm.org/doi/10.1145/3681793
Chen WWang HZhang YDeng PLuo ZLi T(2024)T-Distributed Stochastic Neighbor Embedding for Co-Representation LearningACM Transactions on Intelligent Systems and Technology10.1145/362782315:2(1-18)Online publication date: 22-Feb-2024
https://dl.acm.org/doi/10.1145/3627823
Xue JXing LWang YFan XKong LZhang QNie FLi X(2024)A comprehensive survey of fast graph clusteringVicinagearth10.1007/s44336-024-00008-31:1Online publication date: 13-Sep-2024
https://doi.org/10.1007/s44336-024-00008-3
Wang SZhang XWang YRicci F(2023)Trustworthy Recommender SystemsACM Transactions on Intelligent Systems and Technology10.1145/3627826Online publication date: 13-Oct-2023
https://dl.acm.org/doi/10.1145/3627826
Bougie NOnishi TTsuruoka Y(2023)Interpretable Imitation Learning with Symbolic RewardsACM Transactions on Intelligent Systems and Technology10.1145/3627822Online publication date: 13-Oct-2023
https://dl.acm.org/doi/10.1145/3627822
Airen SAgrawal J(2023)Movie Recommender System Using Parameter Tuning of User and Movie Neighbourhood via Co-ClusteringProcedia Computer Science10.1016/j.procs.2023.01.096218:C(1176-1183)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1016/j.procs.2023.01.096
Hoseinipour SAminghafari MMohammadpour A(2023)Orthogonal parametric non-negative matrix tri-factorization with α-divergence for co-clusteringExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.120680231:COnline publication date: 30-Nov-2023
https://dl.acm.org/doi/10.1016/j.eswa.2023.120680
Bouveyron CJacques JSchmutz ASimões FBottini S(2022)Co-clustering of multivariate functional data for the analysis of air pollution in the South of FranceThe Annals of Applied Statistics10.1214/21-AOAS154716:3Online publication date: 1-Sep-2022
https://doi.org/10.1214/21-AOAS1547
Chen WWang HLong ZLi T(2022)Fast Flexible Bipartite Graph Model for Co-ClusteringIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3194275(1-12)Online publication date: 2022
https://doi.org/10.1109/TKDE.2022.3194275
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Multitask Bregman clustering

A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximation

Clustering based on matrix approximation: a unifying view