[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A Survey of Co-Clustering

Published: 20 November 2024 Publication History

Abstract

Co-clustering is to cluster samples and features simultaneously, which can also reveal the relationship between row clusters and column clusters. Therefore, lots of scientists have drawn much attention to conduct extensive research on it, and co-clustering is widely used in recommendation systems, gene analysis, medical data analysis, natural language processing, image analysis, and social network analysis. In this article, we survey the entire research aspect of co-clustering, especially the latest advances in co-clustering, and discover the current research challenges and future directions. First, due to different views from researchers on the definition of co-clustering, this article summarizes the definition of co-clustering and its extended definitions, as well as related issues, based on the perspectives of various scientists. Second, existing co-clustering techniques are approximately categorized into four classes: information-theory-based, graph-theory-based, matrix-factorization-based, and other theories-based. Third, co-clustering is applied in various aspects such as recommendation systems, medical data analysis, natural language processing, image analysis, and social network analysis. Furthermore, 10 popular co-clustering algorithms are empirically studied on 10 benchmark datasets with 4 metrics—accuracy, purity, block discriminant index, and running time, and their results are objectively reported. Finally, future work is provided to get insights into the research challenges of co-clustering.

References

[1]
Séverine Affeldt, Lazhar Labiod, and Mohamed Nadif. 2020. Ensemble block co-clustering: A unified framework for text data. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, 5–14.
[2]
Séverine Affeldt, Lazhar Labiod, and Mohamed Nadif. 2021. Regularized bi-directional co-clustering. Statistics and Computing 31, 3 (2021), 32.
[3]
Melissa Ailem, François Role, and Mohamed Nadif. 2017. Model-based co-clustering for the effective handling of sparse data. Pattern Recognition 72 (2017), 108–122.
[4]
Rana Ali Amjad, Clemens Blöchl, and Bernhard C. Geiger. 2019. A generalized framework for Kullback–Leibler Markov aggregation. IEEE Trans. Automat. Control 65, 7 (2019), 3068–3075.
[5]
Katy S Azoury and Manfred K. Warmuth. 2001. Relative loss bounds for on-line density estimation with the exponential family of distributions. Machine Learning 43 (2001), 211–246.
[6]
Arindam Banerjee, Inderjit Dhillon, Joydeep Ghosh, Srujana Merugu, and Dharmendra S. Modha. 2004. A generalized maximum entropy approach to bregman co-clustering and matrix approximation. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 509–514.
[7]
Arindam Banerjee, Srujana Merugu, Inderjit S Dhillon, Joydeep Ghosh, and John Lafferty. 2005. Clustering with Bregman divergences. Journal of Machine Learning Research 6, 10 (2005), 1705–1749.
[8]
BingKun Bao, Weiqing Min, Teng Li, and Changsheng Xu. 2015. Joint Local and Global Consistency on Interdocument and Interword Relationships for Co-Clustering. IEEE Transactions on Cybernetics 45 (2015), 15–28.
[9]
Lejla Batina, Benedikt Gierlichs, Emmanuel Prouff, Matthieu Rivain, FrançoisXavier Standaert, and Nicolas VeyratCharvillon. 2011. Mutual information analysis:A comprehensive study. Journal of Cryptology 24, 2 (2011), 269–291.
[10]
Laurent R. Bergń, Charles Bouveyron, Marco Corneli, and Pierre Latouche. 2019. The latent topic block model for the co-clustering of textual interaction data. Computational Statistics & Data Analysis 137 (2019), 247–270.
[11]
Haixia Bi, Jian Sun, and Zongben Xu. 2017. Unsupervised PolSAR image classification using discriminative clustering. IEEE Transactions on Geoscience and Remote Sensing 55, 6 (2017), 3531–3544.
[12]
CharlesEdmond Bichot. 2010. Co-clustering documents and words by minimizing the normalized cut objective function. Journal of Mathematical Modelling and Algorithms 9 (2010), 131–147.
[13]
Steffen Bickel and Tobias Scheffer. 2004. Multi-view clustering. In 4th IEEE International Conference on Data Mining (ICDM ’04), 19–26.
[14]
Clemens Blochl, Rana Ali Amjad, and Bernhard C. Geiger. 2019. Co-Clustering via Information-Theoretic Markov Aggregation. IEEE Transactions on Knowledge and Data Engineering 31 (2019), 720–732.
[15]
R. E. Bonner. 1964. On Some Clustering Techniques. IBM Journal of Research and Development 8, 1 (1964), 22–32. DOI:
[16]
N. Del Buono and G. Pio. 2015. Non-negative matrix tri-factorization for co-clustering: An analysis of the block matrix. Information Sciences 301 (2015), 13–26.
[17]
Collins Census, Hongjun Wang, Ji Zhang, Ping Deng, and Tianrui Li. 2019. Particle subswarms collaborative clustering. IEEE Transactions on Computational Social Systems 6, 6 (2019), 1165–1179.
[18]
Wei Chen, Hongjun Wang, Zhiguo Long, and Tianrui Li. 2023. Fast flexible bipartite graph model for co-clustering. IEEE Transactions on Knowledge and Data Engineering 35, 7 (2023), 6930–6940.
[19]
Xiaojun Chen, Weijun Hong, Feiping Nie, Dan He, Min Yang, and Joshua Zhexue Huang. 2018. Spectral clustering of large-scale data by directly solving normalized cut. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 1206–1215.
[20]
Xiaojun Chen, Joshua Z. Huang, Qingyao Wu, and Min Yang. 2019. Subspace weighting co-clustering of gene expression data. IEEE/ACM Transactions on Computational Biology and Bioinformatics 16 (2019), 352–364.
[21]
Yufu Chen, Zhiqi Lei, Yanghui Rao, Haoran Xie, Fu Lee Wang, Jian Yin, and Qing Li. 2022. Parallel non-negative matrix tri-factorization for text data co-clustering. IEEE Transactions on Knowledge and Data Engineering 35, 5 (2022), 5132–5146.
[22]
Ying Chen, ChunGuang Li, and Chong You. 2020. Stochastic sparse subspace clustering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 4155–4164.
[23]
Laizhong Cui, Sudipta Acharya, Sumit Mishra, Yi Pan, and Joshua Zhexue Huang. 2022. MMCo-Clus—An evolutionary co-clustering algorithm for gene selection. IEEE Transactions on Knowledge and Data Engineering 34, 9 (2022), 4371–4384. DOI:
[24]
Wenyuan Dai, Gui-Rong Xue, Qiang Yang, and Yong Yu. 2007. Co-clustering based classification for out-of-domain documents. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 210–219.
[25]
Francisco de A.T. De Carvalho, Antonio Balzanella, Antonio Irpino, and Rosanna Verde. 2021. Co-clustering algorithms for distributional data with automated variable weighting. Information Sciences 549 (2021), 87–115.
[26]
Ping Deng, Tianrui Li, Hongjun Wang, ShiJinn Horng, Zeng Yu, and Xiaomin Wang. 2021. Tri-regularized nonnegative matrix tri-factorization for co-clustering. Knowledge-Based Systems 226 (2021), 107101.
[27]
Meghana Deodhar and Joydeep Ghosh. 2007. A framework for simultaneous co-clustering and learning from complex data. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2007), 250–259.
[28]
Meghana Deodhar and Joydeep Ghosh. 2010. SCOAL: A framework for simultaneous co-clustering and learning from complex data. ACM Transactions on Knowledge Discovery from Data 4, 3, Article 11 (Oct 2010), 31 pages. DOI:
[29]
Inderjit S. Dhillon. 2001. Co-clustering documents and words using bipartite spectral graph partitioning. In Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 269–274.
[30]
Inderjit S. Dhillon, Subramanyam Mallela, and Dharmendra S. Modha. 2003. Information-theoretic co-clustering. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 89–98.
[31]
Chris Ding, Tao Li, Wei Peng, and Haesun Park. 2006. Orthogonal nonnegative matrix t-factorizations for clustering. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 126–135.
[32]
Chris H. Q. Ding, Xiaofeng He, Hongyuan Zha, Ming Gu, and Horst D. Simon. 2001. A min-max cut algorithm for graph partitioning and data clustering. In Proceedings 2001 IEEE International Conference on Data Mining, 107–114.
[33]
Liang Du and Yi-Dong Shen. 2013. Towards robust co-clustering. In Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, 1317–1322.
[34]
Liang Feng, Qianchuan Zhao, and Cangqi Zhou. 2020. Improving performances of top-N recommendations with co-clustering method. Expert Systems with Applications 143 (2020), 113078.
[35]
Dayne Freitag. 2004. Trained named entity recognition using distributional clusters. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 262–269.
[36]
Bin Gao, TieYan Liu, Xin Zheng, QianSheng Cheng, and WeiYing Ma. 2005. Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering. In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, 41–50.
[37]
Bernhard C. Geiger, Tatjana Petrov, Gernot Kubin, and Heinz Koeppl. 2014. Optimal Kullback–Leibler aggregation via information bottleneck. IEEE Transactions on Automatic Control 60, 4 (2014), 1010–1022.
[38]
Quanquan Gu and Jie Zhou. 2009. Co-clustering on manifolds. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 359–368.
[39]
Francesco Gullo, AKM Khaled Talukder, Sean Luke, Carlotta Domeniconi, and Andrea Tagarelli. 2012. Multiobjective optimization of co-clustering ensembles. In Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation, 1495–1496.
[40]
Ting Guo, Shirui Pan, Xingquan Zhu, and Chengqi Zhang. 2019. CFOND: Consensus factorization for co-clustering networked data. IEEE Transactions on Knowledge and Data Engineering 31 (2019), 706–719.
[41]
Bara’a A. Attea, Wisam A. Hariz, and Mayyadah F. Abdulhalim 2016. Improving the performance of evolutionary multi-objective co-clustering models for community detection in complex social networks. Swarm and Evolutionary Computation 26 (2016), 137–156.
[42]
John A. Hartigan. 1972. Direct clustering of a data matrix. Journal of the American Statistical Association 67, 337 (1972), 123–129.
[43]
Jian Hou, Aihua Zhang, and Naiming Qi. 2020. Density peak clustering based on relative density relationship. Pattern Recognition 108 (2020), 107554.
[44]
Shizhe Hu, Xiaoqiang Yan, and Yangdong Ye. 2020. Dynamic auto-weighted multi-view co-clustering. Pattern Recognition 99 (2020), 107101.
[45]
Dong Huang, ChangDong Wang, JianSheng Wu, JianHuang Lai, and CheeKeong Kwoh. 2019. Ultra-scalable spectral clustering and ensemble clustering. IEEE Transactions on Knowledge and Data Engineering 32, 6 (2019), 1212–1226.
[46]
JihJeng Huang. 2017. Using topic and subjectivity analysis for overlapped co-clustering documents. In 2017 IEEE 3rd International Conference on Multimedia Big Data (BigMM), 105–108.
[47]
Shudong Huang, Zenglin Xu, Zhao Kang, and Yazhou Ren. 2020a. Regularized nonnegative matrix factorization with adaptive local structure learning. Neurocomputing 382 (2020), 196–209.
[48]
Shudong Huang, Zenglin Xu, Ivor W. Tsang, and Zhao Kang. 2020b. Auto-weighted multi-view co-clustering with bipartite graphs. Information Sciences 512 (2020), 18–30.
[49]
WenLiang Hung and JennHwai Yang. 2015. Automatic clustering algorithm for fuzzy data. Journal of Applied Statistics 42, 7 (2015), 1503–1518.
[50]
Syed Fawad Hussain and Muhammad Haris. 2019. A k-means based co-clustering (kCC) algorithm for sparse, high dimensional data. Expert Systems with Applications 118 (2019), 20–34.
[51]
Syed Fawad Hussain and Shahid Iqbal. 2018. CCGA: Co-similarity based co-clustering using genetic algorithm. Applied Soft Computing 72 (2018), 30–42.
[52]
Syed Fawad Hussain, Khadija Khan, and Rashad Jillani. 2021. Weighted multi-view co-clustering (WMVCC) for sparse data. Applied Intelligence 52 (2021), 398–416.
[53]
Syed Fawad Hussain and Muhammad Ramazan. 2016. Biclustering of human cancer microarray data using co-similarity based co-clustering. Expert Systems with Applications 55 (2016), 520–531.
[54]
Dino Ienco, Cńline Robardet, Ruggero G. Pensa, and Rosa Meo. 2012. Parameter-less co-clustering for star-structured heterogeneous data. Data Mining and Knowledge Discovery 26 (2012), 217–254.
[55]
Julien Jacques and Christophe Biernacki. 2018. Model-based co-clustering for ordinal data. Computational Statistics & Data Analysis 123 (2018), 101–115.
[56]
Andrzej Jaszkiewicz. 2002. Genetic local search for multi-objective combinatorial optimization. European Journal of Operational Research 137, 1 (2002), 50–71.
[57]
Yugang Ji, Chuan Shi, Yuan Fang, Xiangnan Kong, and Mingyang Yin. 2020. Semi-supervised co-clustering on attributed heterogeneous information networks. Information Processing & Management 57 (2020), 102338.
[58]
Yuheng Jia, Sam Kwong, Junhui Hou, and Wenhui Wu. 2019. Semi-supervised non-negative matrix factorization with dissimilarity and similarity regularization. IEEE Transactions on Neural Networks and Learning Systems 31, 7 (2019), 2510–2521.
[59]
Zhao Kang, Zhiping Lin, Xiaofeng Zhu, and Wenbo Xu. 2021. Structured graph learning for scalable subspace clustering: From single view to multiview. IEEE Transactions on Cybernetics 52, 9 (2021), 8976–8986.
[60]
Ramakrishnan Kannan, Grey Ballard, and Haesun Park. 2017. MPI-FAUN: An MPI-based framework for alternating-updating nonnegative matrix factorization. IEEE Transactions on Knowledge and Data Engineering 30, 3 (2017), 544–558.
[61]
Margret Keuper, Siyu Tang, Bjoern Andres, Thomas Brox, and Bernt Schiele. 2020. Motion segmentation & multiple object tracking by correlation co-clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 42 (2020), 140–153.
[62]
Sheheryar Khan, Lijiang Chen, and Hong Yan. 2020. Co-clustering to reveal salient facial features for expression recognition. IEEE Transactions on Affective Computing 11 (2020), 348–360.
[63]
Jungeun Kim, JaeGil Lee, Byung Suk Lee, and Jiajun Liu. 2020. Geosocial co-clustering: A novel framework for geosocial community detection. ACM Transactions on Intelligent Systems and Technology 11, 4 (2020), 1–26.
[64]
Yuval Kluger, Ronen Basri, Joseph T. Chang, and Mark Gerstein. 2003. Spectral biclustering of microarray data: Coclustering genes and conditions. Genome Research 13, 4 (2003), 703–716.
[65]
Krishna Kummamuru, Ajay Dhawale, and Raghu Krishnapuram. 2003. Fuzzy co-clustering of documents and keywords. In 12th IEEE International Conference on Fuzzy Systems. (FUZZ ’03), 772–777.
[66]
Lazhar Labiod and Mohamed Nadif. 2011. Co-clustering for binary and categorical data with maximum modularity. In 2011 IEEE 11th International Conference on Data Mining, 1140–1145.
[67]
Kenneth WaiTing Leung, Dik Lun Lee, and WangChien Lee. 2011. Clr: A collaborative location recommendation framework based on co-clustering. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, 305–314.
[68]
Jingxuan Li and Tao Li. 2010. HCC: A hierarchical co-clustering algorithm. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 861–862.
[69]
Jun Li, Hongfu Liu, Zhiqiang Tao, Handong Zhao, and Yun Fu. 2020. Learnable subspace clustering. IEEE Transactions on Neural Networks and Learning Systems 33, 3 (2020), 1119–1133.
[70]
Jingxuan Li, Bo Shao, Tao Li, and Mitsunori Ogihara. 2011. Hierarchical co-clustering: A new way to organize the music data. IEEE Transactions on Multimedia 14, 2 (2011), 471–481.
[71]
Mingyang Li, Xinhua Bi, Limin Wang, and Xuming Han. 2021a. A method of two-stage clustering learning based on improved DBSCAN and density peak algorithm. Computer Communications 167 (2021), 75–84.
[72]
Man Li, Luosheng Wen, and Feiyu Chen. 2021b. A novel Collaborative filtering recommendation approach based on soft co-clustering. Physica A: Statistical Mechanics and its Applications 561 (2021), 125140.
[73]
Ping Li, Jiajun Bu, Chun Chen, and Zhanying He. 2012. Relational co-clustering via manifold ensemble learning. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 1687–1691.
[74]
Xiangli Li, Xiyan Lu, and Xuezhen Fan. 2022. Semi-supervised sparse neighbor constrained co-clustering with dissimilarity and similarity regularization. Engineering Applications of Artificial Intelligence 114 (2022), 104989.
[75]
Chunfeng Lian, Su Ruan, Thierry Denoeux, Hua Li, and Pierre Vera. 2019. Joint tumor segmentation in PET-CT images using co-clustering and fusion based on belief functions. IEEE Transactions on Image Processing 28 (2019), 755–766.
[76]
Zhiping Lin, Zhao Kang, Lizong Zhang, and Ling Tian. 2021. Multi-view attributed graph clustering. IEEE Transactions on Knowledge and Data Engineering 35, 2 (2021), 1872–1880.
[77]
Hongfu Liu and Yun Fu. 2018. Consensus Guided Multi-View Clustering. ACM Transactions on Knowledge Discovery from Data 12, 4, Article 42 (Apr 2018), 21 pages. DOI:
[78]
Huafeng Liu, Liping Jing, Jingxuan Wen, Pengyu Xu, Jian Yu, and Michael K. Ng. 2021. Bayesian Additive Matrix Approximation for Social Recommendation. ACM Transactions on Knowledge Discovery from Data 16, 1, Article 7 (Jul 2021), 34 pages. DOI:
[79]
Xinwang Liu, Xinzhong Zhu, Miaomiao Li, Lei Wang, Chang Tang, Jianping Yin, Dinggang Shen, Huaimin Wang, and Wen Gao. 2018. Late fusion incomplete multi-view clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 10 (2018), 2410–2423.
[80]
Bo Long, Zhongfei Zhang, and Philip S. Yu. 2005. Co-clustering by block value decomposition. In Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, 635–640.
[81]
Yanbin Lu, Lawrence Carin, Ronald Coifman, William Shain, and Badrinath Roysam. 2014. Quantitative arbor analytics: Unsupervised harmonic co-clustering of populations of brain cell arbors based on L-measure. Neuroinformatics 13 (2014), 47–63.
[82]
Zhoumin Lu, Genggeng Liu, and Shiping Wang. 2020. Sparse neighbor constrained co-clustering via category consistency learning. Knowledge-Based Systems 201–202 (2020), 105987.
[83]
Peng Luo, Jinye Peng, Ziyu Guan, and Jianping Fan. 2018. Dual regularized multi-view non-negative matrix factorization for clustering. Neurocomputing 294 (2018), 1–11.
[84]
Jiaqi Ma, Yipeng Zhang, and Lefei Zhang. 2021. Discriminative subspace matrix factorization for multiview data clustering. Pattern Recognition 111 (2021), 107676.
[85]
Feiping Nie, Shenfei Pei, Rong Wang, and Xuelong Li. 2020a. Fast clustering with co-clustering via discrete non-negative matrix factorization for image identification. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2073–2077.
[86]
Feiping Nie, Shaojun Shi, and Xuelong Li. 2020b. Auto-weighted multi-view co-clustering via fast matrix factorization. Pattern Recognition 102 (2020), 107207.
[87]
Feiping Nie, Xiaoqian Wang, Cheng Deng, and Heng Huang. 2017. Learning a structured optimal bipartite graph for co-clustering. In Advances in Neural Information Processing Systems 30 (2017).
[88]
Krzysztof Nowicki and Tom A. B. Snijders. 2001. Estimation and prediction for stochastic blockstructures. Journal of the American Statistical Association 96, 455 (2001), 1077–1087.
[89]
Chi-Hyon Oh, Katsuhiro Honda, and Hidetomo Ichihashi. 2001. Fuzzy clustering for categorical multivariate data. In Proceedings joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569), Vol. 4. IEEE, 2154–2159.
[90]
Yulong Pei, Nilanjan Chakraborty, and Katia Sycara. 2015. Nonnegative matrix tri-factorization with graph regularization for community detection in social networks. In 24th International Joint Conference on Artificial Intelligence, 2083–2089.
[91]
Wei Peng and Tao Li. 2011. Temporal relation co-clustering on directional social network and author-topic evolution. Knowledge and Information Systems 26 (2011), 467–486.
[92]
Ruggero G. Pensa and JeanFrançois Boulicaut. 2008. Constrained co-clustering of gene expression data. In Proceedings of the 2008 SIAM International Conference on Data Mining, 25–36.
[93]
Ruggero G. Pensa, Dino Ienco, and Rosa Meo. 2012. Hierarchical co-clustering: Off-line and incremental approaches. Data Mining and Knowledge Discovery 28 (2012), 31–64.
[94]
Nha Van Pham, Long The Pham, Witold Pedrycz, and Long Thanh Ngo. 2021. Feature-reduction fuzzy co-clustering approach for hyper-spectral image analysis. Knowledge-Based Systems 216 (2021), 106549.
[95]
Van Nha Pham, Long The Pham, Witold Pedrycz, and Long Thanh Ngo. 2017. Feature-reduction fuzzy co-clustering algorithm for hyperspectral image segmentation. In 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 1–6.
[96]
Gianvito Pio, Francesco Serafino, Donato Malerba, and Michelangelo Ceci. 2018. Multi-type clustering and classification from heterogeneous networks. Information Sciences 425 (2018), 107–126.
[97]
Rodolphe Priam, Mohamed Nadif, and Gérard Govaert. 2013. Gaussian topographic co-clustering model. In 12th International Symposium on Advances in Intelligent Data Analysis XII (IDA ’13), Proceedings, 345–356.
[98]
Subin Qian, Huiyi Liu, Xiaofeng Yuan, Wei Wei, Shuangshuang Chen, and Hong Yan. 2022. Row and Column Structure-Based Biclustering for Gene Expression Data. IEEE/ACM Transactions on Computational Biology and Bioinformatics 19, 2 (2022), 1117–1129. DOI:
[99]
Xueming Qian, Mingdi Li, Yayun Ren, and Shuhui Jiang. 2019. Social media based event summarization by user–text–image co-clustering. Knowledge-Based Systems 164 (2019), 107–121.
[100]
Manjeet Rege, Ming Dong, and Farshad Fotouhi. 2008. Bipartite isoperimetric graph partitioning for data co-clustering. Data Mining and Knowledge Discovery 16 (2008), 276–312.
[101]
Jingru Ren, Zhi Liu, Gongyang Li, Xiaofei Zhou, Cong Bai, and Guangling Sun. 2020. Co-saliency detection using collaborative feature extraction and high-to-low feature integration. 2020 IEEE International Conference on Multimedia and Expo (ICME), 1–6.
[102]
Jiaqi Ren and Youlong Yang. 2018. Multitask possibilistic and fuzzy co-clustering algorithm for clustering data with multisource features. Neural Computing and Applications 32 (2018), 4785–4804.
[103]
Alfréd Rényi et al. 1961. On measures of information and entropy. In Proceedings of the 4th Berkeley Symposium on Mathematics, Statistics and Probability 1, 547 (1961).
[104]
Karl Rohe, Tai Qin, and Bin Yu. 2016. Co-clustering directed graphs to discover asymmetries and directional communities. Proceedings of the National Academy of Sciences 113 (2016), 12679–12684.
[105]
Richard Rohwer and Dayne Freitag. 2004. Towards full automation of lexicon construction. Proceedings of the Computational Lexical Semantics Workshop at HLT-NAACL 2004, 9–16.
[106]
ChunYan Sang and DiHua Sun. 2014a. Co-clustering over multiple dynamic data streams based on non-negative matrix factorization. Applied intelligence 41 (2014), 487–502.
[107]
Margot Selosse, Julien Jacques, and Christophe Biernacki. 2020a. Co-clustering contraint pour le résumé de matrices document-terme. JdS 2020-52èmes Journées de Statistique de la Société Française de Statistique, 1–6.
[108]
Margot Selosse, Julien Jacques, and Christophe Biernacki. 2020b. Model-based co-clustering for mixed type data. Computational Statistics & Data Analysis 144, 106866.
[109]
Margot Selosse, Julien Jacques, and Christophe Biernacki. 2020c. Textual data summarization using the Self-Organized Co-Clustering model. Pattern Recognition 103, 107315.
[110]
Hanhuai Shan and Arindam Banerjee. 2008. Bayesian co-clustering. IEEE International Conference on Data Mining, 530–539.
[111]
Hanhuai Shan and Arindam Banerjee. 2010. Residual Bayesian co-clustering for matrix approximation. In Proceedings of the 2010 SIAM International Conference on Data Mining, 223–234.
[112]
Fanhua Shang, L. C. Jiao, and Fei Wang. 2012. Graph dual regularization non-negative matrix factorization for co-clustering. Pattern Recognition 45 (2012), 2237–2250.
[113]
Xiaoxiao Shi, Wei Fan, and S Yu Philip. 2010. Efficient semi-supervised spectral co-clustering with constraints. 2010 IEEE International Conference on Data Mining, 1043–1048.
[114]
Yosra Ben Slimen, Sylvain Allio, and Julien Jacques. 2018. Model-based co-clustering for functional data. Neurocomputing 291 (2018), 97–108.
[115]
Kun Song, Xiwen Yao, Feiping Nie, Xuelong Li, and Mingliang Xu. 2021. Weighted bilateral K-means algorithm for fast co-clustering and fast spectral clustering. Pattern Recognition 109 (2021), 107560.
[116]
Harald Steck and Tommi Jaakkola. 2002. On the Dirichlet prior and Bayesian regularization. Advances in Neural Information Processing Systems, 15.
[117]
Kai Sugahara and Kazushi Okamoto. 2023. Hierarchical co-clustering with augmented matrices from external domains. Pattern Recognition (2023), 109657.
[118]
Jing Sun, Zhihui Wang, Fuming Sun, and Haojie Li. 2018. Sparse dual graph-regularized NMF for image co-clustering. Neurocomputing 316 (2018), 156–165.
[119]
Qi Tan, Pei Yang, and Jingrui He. 2018. Feature co-shrinking for co-clustering. Pattern Recognition 77 (2018), 12–19.
[120]
Jiayi Tang and Zhong Wan. 2021. Orthogonal dual graph-regularized nonnegative matrix factorization for co-clustering. Journal of Scientific Computing 87, 3 (2021), 66.
[121]
WilliamChandra Tjhi and Lihui Chen. 2007. Possibilistic fuzzy co-clustering of large document collections. Pattern Recognition 40 (2007), 3452–3466.
[122]
Seiki Ubukata, Narihira Nodake, Akira Notsu, and Katsuhiro Honda. 2020. Basic consideration of co-clustering based on rough set theory. In Integrated Uncertainty in Knowledge Modelling and Decision Making: 8th International Symposium (IUKM 2020), Proceedings, 151–161.
[123]
Tim Van Erven and Peter Harremos. 2014. Rényi divergence and Kullback-Leibler divergence. IEEE Transactions on Information Theory 60, 7 (2014), 3797–3820.
[124]
Jun Wang, Xing Wang, Guoxian Yu, Carlotta Domeniconi, Zhiwen Yu, and Zili Zhang. 2021. Discovering multiple co-clusterings with matrix factorization. IEEE Transactions on Cybernetics 51, 7 (2021), 3576–3587.
[125]
Pu Wang, Carlotta Domeniconi, and Kathryn Blackmond Laskey. 2009. Latent dirichlet Bayesian co-clustering. In Machine Learning and Knowledge Discovery in Databases: European Conference (ECML PKDD ’09), Proceedings, Part II 20, 522–537.
[126]
Pu Wang, Carlotta Domeniconi, and Kathryn Blackmond Laskey. 2010. Information bottleneck co-clustering. In Workshop TextMining@ SIAM Data Mining 10, (2010).
[127]
Pu Wang, Kathryn B. Laskey, Carlotta Domeniconi, and Michael I. Jordan. 2011. Nonparametric Bayesian co-clustering ensembles. In Proceedings of the 2011 SIAM International Conference on Data Mining, 331–342.
[128]
Shiping Wang and Wenzhong Guo. 2017. Robust co-clustering via dual local learning and high-order matrix factorization. Knowledge-Based Systems 138 (2017), 176–187.
[129]
Shiping Wang and Aiping Huang. 2017. Penalized nonnegative matrix tri-factorization for co-clustering. Expert Systems with Applications 78 (2017), 64–73.
[130]
Xing Wang, Jun Wang, Guoxain Yu, and Maozu Guo. 2018. Network regularization bi-clustering for cancer subtype categorization. Chinese Journal of Computer 42 (2018).
[131]
Yan Wang and Xiaoke Ma. 2021. Joint nonnegative matrix factorization and network embedding for graph co-clustering. Neurocomputing 462 (2021), 453–465.
[132]
YuXiong Wang and YuJin Zhang. 2012. Nonnegative matrix factorization: A comprehensive review. IEEE Transactions on Knowledge and Data Engineering 25, 6 (2012), 1336–1353.
[133]
Zhenghong Wei, Hongya Zhao, Lan Zhao, and Hong Yan. 2019. Multiscale co-clustering for tensor data based on canonical polyadic decomposition and slice-wise factorization. Information Sciences 503 (2019), 72–91.
[134]
Chao-Yuan Wu, Alex Beutel, Amr Ahmed, and Alexander J. Smola. 2016. Explaining reviews and ratings with paco: Poisson additive co-clustering. In Proceedings of the 25th International Conference Companion on World Wide Web, 127–128.
[135]
Hu Wu, Yongji Wang, Zhe Wang, Xiuli Wang, and Shuanzhu Du. 2010. Two-phase collaborative filtering algorithm based on co-clustering. Journal of Software 21 (2010), 1042–1054.
[136]
Jian-Sheng Wu, Jian-Huang Lai, and Chang-Dong Wang. 2011. A novel co-clustering method with intra-similarities. 2011 IEEE 11th International Conference on Data Mining Workshops, 300–306.
[137]
Meng-Lun Wu, Chia-Hui Chang, and Rui-Zhe Liu. 2012. Co-clustering with augmented matrix. Applied Intelligence 39 (2012), 153–164.
[138]
Juan Xie, Anjun Ma, Yu Zhang, Bingqiang Liu, Sha Cao, Cankun Wang, Jennifer Xu, Chi Zhang, and Qin Ma. 2020. QUBIC2: A novel and robust biclustering algorithm for analyses and interpretation of large-scale RNA-Seq data. Bioinformatics 36, 4 (2020), 1143–1149.
[139]
Bin Xu, Jiajun Bu, Chun Chen, and Deng Cai. 2012. An exploration of improving collaborative recommender systems via user-item subgroups. In Proceedings of the 21st International Conference on World Wide Web, 21–30.
[140]
Dongkuan Xu, Wei Cheng, Bo Zong, Jingchao Ni, Dongjin Song, Wenchao Yu, Yuncong Chen, Haifeng Chen, and Xiang Zhang. 2019a. Deep co-clustering. In Proceedings of the 2019 SIAM International Conference on Data Mining, 414–422.
[141]
Peng Xu, Zhaohong Deng, Kup-Sze Choi, Longbing Cao, and Shitong Wang. 2019b. Multi-view information-theoretic co-clustering for co-occurrence data. Proceedings of the AAAI conference on Artificial Intelligence 33, 01 (2019), 379–386.
[142]
Xiaoqiang Yan, Zhengzheng Lou, Shizhe Hu, and Yangdong Ye. 2020. Multi-task iInformation bottleneck co-clustering for unsupervised cross-view human action categorization. ACM Transactions on Knowledge Discovery from Data 14 (2020), 1–23.
[143]
Yang Yan, Lihui Chen, and WilliamChandra Tjhi. 2013a. Fuzzy semi-supervised co-clustering for text documents. Fuzzy Sets and Systems 215 (2013), 74–89.
[144]
Hui Yang, Han Peng, Jianyong Zhu, and Feiping Nie. 2020. Co-clustering ensemble based on bilateral K-means algorithm. IEEE Access 8 (2020), 51285–51294.
[145]
Miin-Shen Yang and Yessica Nataliani. 2017. A feature-reduction fuzzy clustering algorithm based on feature-weighted entropy. IEEE Transactions on Fuzzy Systems 26, 2 (2017), 817–835.
[146]
Xiwen Yao, Junwei Han, Dingwen Zhang, and Feiping Nie. 2017. Revisiting co-saliency detection: A novel approach based on two-stage multi-view spectral rotation co-clustering. IEEE Transactions on Image Processing 26 (2017), 3196–3209.
[147]
Qiyue Yin, Shu Wu, and Liang Wang. 2018. Multiview clustering via unified and view-specific embeddings learning. IEEE Transactions on Neural Networks and Learning Systems 29, 11 (2018), 5541–5553.
[148]
Donghua Yu, Guojun Liu, Maozu Guo, and Xiaoyan Liu. 2018. An improved K-medoids algorithm based on step increasing and optimizing medoids. Expert Systems with Applications 92 (2018), 464–473.
[149]
Xianxue Yu, Guoxian Yu, Jun Wang, and Carlotta Domeniconi. 2019. Co-clustering ensembles based on multiple relevance measures. IEEE Transactions on Knowledge and Data Engineering 33, 4 (2019), 1389–1400.
[150]
Hongyuan Zha, Xiaofeng He, Chris Ding, Horst Simon, and Ming Gu. 2001. Bipartite graph partitioning and data clustering. In Proceedings of the 10th International Conference on Information and Knowledge Management, 25–32.
[151]
Kun Zhan, Changqing Zhang, Junpeng Guan, and Junsheng Wang. 2017. Graph learning for multiview clustering. IEEE Transactions on Cybernetics 48, 10 (2017), 2887–2895.
[152]
Fei Zhang, Libo Zhang, Tiejian Luo, and Yanjun Wu. 2018. A feature-based on co-clustering model. Journal of Computer Research and Development 55 (2018), 1508–1524.
[153]
Lijun Zhang, Chun Chen, Jiajun Bu, Zhengguang Chen, Deng Cai, and Jiawei Han. 2011. Locally discriminative coclustering. IEEE Transactions on Knowledge and Data Engineering 24, 6 (2011), 1025–1035.
[154]
Liang Zheng, Yuzhong Qu, Xinqi Qian, and Gong Cheng. 2018. A hierarchical co-clustering approach for entity exploration over Linked Data. Knowledge-Based Systems 141 (2018), 200–210.
[155]
Yuxin Zhong, Hongjun Wang, Wenlu Yang, Luqing Wang, and Tianrui Li. 2023. Multi-objective genetic model for co-clustering ensemble. Applied Soft Computing 135 (2023), 110058.
[156]
Yada Zhu and Jingrui He. 2016. Co-clustering structural temporal data with applications to semiconductor manufacturing. ACM Transactions on Knowledge Discovery from Data 10, 4, Article 43 (May 2016), 18 pages. DOI:
[157]
Yu Zhu, Boning Li, and Santiago Segarra. 2021. Co-clustering vertices and hyperedges via spectral hypergraph partitioning. In 2021 29th European Signal Processing Conference (EUSIPCO), 1416–1420.
[158]
Yu Zong, Ping Jin, Hong Chen, Enhong an Li, and Renji Liu. 2012. Fuzzy co-clustering algorithm for weblog. Journal of Electronics & Information Technology 34 (2012), 1009–5896.

Cited By

View all
  • (2024)GTGNN: Global Graph and Taxonomy Tree for Graph Neural Network Session-Based RecommendationWeb Information Systems and Applications10.1007/978-981-97-7707-5_3(29-40)Online publication date: 1-Aug-2024
  • (2024)scCDCG: Efficient Deep Structural Clustering for Single-Cell RNA-Seq via Deep Cut-Informed Graph EmbeddingDatabase Systems for Advanced Applications10.1007/978-981-97-5575-2_11(172-187)Online publication date: 2-Jul-2024

Index Terms

  1. A Survey of Co-Clustering

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Knowledge Discovery from Data
    ACM Transactions on Knowledge Discovery from Data  Volume 18, Issue 9
    November 2024
    730 pages
    EISSN:1556-472X
    DOI:10.1145/3613722
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 November 2024
    Online AM: 25 July 2024
    Accepted: 20 July 2024
    Revised: 19 May 2024
    Received: 16 October 2023
    Published in TKDD Volume 18, Issue 9

    Check for updates

    Author Tags

    1. co-clustering
    2. information theory
    3. graph theory
    4. matrix factorization

    Qualifiers

    • Research-article

    Funding Sources

    • National Natural Science Foundation of China
    • Natural Science Foundation of Sichuan Province
    • China Postdoctoral Science Foundation
    • Fundamental Research Funds for the Central Universities

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)340
    • Downloads (Last 6 weeks)90
    Reflects downloads up to 11 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)GTGNN: Global Graph and Taxonomy Tree for Graph Neural Network Session-Based RecommendationWeb Information Systems and Applications10.1007/978-981-97-7707-5_3(29-40)Online publication date: 1-Aug-2024
    • (2024)scCDCG: Efficient Deep Structural Clustering for Single-Cell RNA-Seq via Deep Cut-Informed Graph EmbeddingDatabase Systems for Advanced Applications10.1007/978-981-97-5575-2_11(172-187)Online publication date: 2-Jul-2024

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media