[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3442381.3449956acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Scalable Auto-weighted Discrete Multi-view Clustering

Published: 03 June 2021 Publication History

Abstract

Multi-view clustering has been widely studied in machine learning, which uses complementary information to improve clustering performance. However, challenges remain when handling large-scale multi-view data due to the traditional approaches’ high time complexity. Besides, the existing approaches suffer from parameter selection. Due to the lack of labeled data, parameter selection in practical clustering applications is difficult, especially in big data. In this paper, we propose a novel approach for large-scale multi-view clustering to overcome the above challenges. Our approach focuses on learning the low-dimensional binary embedding of multi-view data, preserving the samples’ local structure during binary embedding, and optimizing the embedding and clustering in a unified framework. Furthermore, we proposed to learn the parameters using a combination of data-driven and heuristic approaches. Experiments on five large-scale multi-view datasets show that the proposed method is superior to the state-of-the-art in terms of clustering quality and running time.

References

[1]
Galen Andrew, Raman Arora, Jeff A Bilmes, and Karen Livescu. 2013. Deep Canonical Correlation Analysis. (2013), 1247–1255.
[2]
Guoqing Chao, Shiliang Sun, and Jinbo Bi. 2017. A Survey on Multi-View Clustering. arXiv: Learning (2017).
[3]
Xinlei Chen and Deng Cai. 2011. Large scale spectral clustering with landmark-based representation. In AAAI.
[4]
Inderjit S Dhillon and Dharmendra S Modha. 2001. Concept Decompositions for Large Sparse Text Data Using Clustering. Machine Learning 42, 1 (2001), 143–175.
[5]
Hongchang Gao, Feiping Nie, Xuelong Li, and Heng Huang. 2015. Multi-view Subspace Clustering. In ICCV.
[6]
Bo Geng, Dacheng Tao, Chao Xu, Linjun Yang, and Xian-Sheng Hua. 2012. Ensemble Manifold Regularization. IEEE Trans. PAMI 34, 6 (2012), 1227–1233.
[7]
J A Hartigan and M A Wong. 1979. A K‐Means Clustering Algorithm. Journal of The Royal Statistical Society Series C-applied Statistics 28, 1(1979), 100–108.
[8]
Menglei Hu and Songcan Chen. 2019. One-Pass Incomplete Multi-view Clustering. In AAAI. 3838–3845.
[9]
Anil K Jain, M N Murty, and Patrick J Flynn. 1999. Data clustering: a review. Comput. Surveys 31, 3 (1999), 264–323.
[10]
Daxin Jiang, Chun Tang, and Aidong Zhang. 2004. Cluster analysis for gene expression data: a survey. IEEE Trans. KDE 16, 11 (2004), 1370–1386.
[11]
Zhao Kang, Wangtao Zhou, Zhitong Zhao, Junming Shao, Meng Han, and Zenglin Xu. 2020. Large-Scale Multi-View Subspace Clustering in Linear Time, In AAAI. Proceedings of the AAAI Conference on Artificial Intelligence 34, 4412–4419.
[12]
Abhishek Kumar and Hal Daumé Iii. 2011. A Co-training Approach for Multi-view Spectral Clustering Abhishek Kumar. In ICML.
[13]
Abhishek Kumar, Piyush Rai, and Hal Daume. 2011. Co-regularized Multi-view Spectral Clustering. In NIPS.
[14]
Yeqing Li, Feiping Nie, Heng Huang, and Junzhou Huang. 2015. Large-scale multi-view spectral clustering via bipartite graph. In AAAI. 2750–2756.
[15]
Jialu Liu, Chi Wang, Jing Gao, and Jiawei Han. 2013. Multi-View Clustering via Joint Nonnegative Matrix Factorization. In SDM.
[16]
Wei Liu, Junfeng He, and Shih Fu Chang. 2010. Large Graph Construction for Scalable Semi-Supervised Learning. In ICML.
[17]
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2010. Introduction to Information Retrieval.
[18]
Feiping Nie, Guohao Cai, Jing Li, and Xuelong Li. 2017. Auto-Weighted Multi-view Learning for Image Clustering and Semi-supervised Classification. IEEE Trans. IP 27, 3 (2017), 1501–1511.
[19]
Feiping Nie, Li Jing, and Xuelong Li. 2016. Parameter-free auto-weighted multiple graph learning: a framework for multiview clustering and semi-supervised classification. In IJCAI.
[20]
Xi Peng, Zhenyu Huang, Jiancheng Lv, Hongyuan Zhu, and Joey Tianyi Zhou. 2019. COMIC: Multi-view Clustering Without Parameter Selection. (2019), 5092–5101.
[21]
Pengzhen Ren, Yun Xiao, Pengfei Xu, Jun Guo, Xiaojiang Chen, Xin Wang, and Dingyi Fang. 2018. Robust Auto-Weighted Multi-View Clustering. In IJCAI. 2644–2650.
[22]
Alex Rodriguez and Alessandro Laio. 2014. Clustering by fast search and find of density peaks. Science 344, 6191 (2014), 1492–1496.
[23]
Weixiang Shao, Lifang He, Chun Ta Lu, and Philip S Yu. 2016. Online multi-view clustering with incomplete views. In IEEE Big Data. 1012–1017.
[24]
Fan Shaohua, Wang Xiao, Shi Chuan, Lu Emiao, Lin Ken, and Wang Bai. 2020. One2Multi Graph Autoencoder for Multi-view Graph Clustering. In Proceedings of The Web Conference 2020 (WWW ’20). ACM, Taipei,Taiwan.
[25]
Fumin Shen, Xiang Zhou, Yang Yang, Jingkuan Song, Heng Tao Shen, and Dacheng Tao. 2016. A Fast Optimization Method for General Binary Code Learning. IEEE Trans. IP 25, 12 (2016), 5610–5621.
[26]
Jianbo Shi and Jitendra Malik. 2000. Normalized cuts and image segmentation. IEEE Trans. PAMI 22, 8 (2000), 888–905.
[27]
Shiliang Sun. 2013. A survey of multi-view machine learning. Neural Computing and Applications 23, 7 (2013), 2031–2038.
[28]
Hao Wang, Linlin Zong, Bing Liu, Yan Yang, and Wei Zhou. 2019. Spectral Perturbation Meets Incomplete Multi-view Data. In IJCAI. 3677–3683.
[29]
Jingdong Wang, Ting Zhang, Jingkuan Song, Nicu Sebe, and Heng Tao Shen. 2018. A Survey on Learning to Hash. IEEE Trans. PAMI 40, 4 (2018), 769–790.
[30]
Weiran Wang, Raman Arora, Karen Livescu, and Jeff A Bilmes. 2015. On Deep Multi-View Representation Learning. (2015), 1083–1092.
[31]
Yang Wang, Xuemin Lin, Lin Wu, Wenjie Zhang, Qing Zhang, and Xiaodi Huang. 2015. Robust Subspace Clustering for Multi-View Data by Exploiting Correlation Consensus. IEEE Trans. IP 24, 11 (2015), 3939–49.
[32]
Cai Xiao, Feiping Nie, and Heng Huang. 2013. Multi-View K-Means Clustering on Big Data. In IJCAI.
[33]
Chang Xu, Dacheng Tao, and Chao Xu. 2013. A Survey on Multi-view Learning. CoRR abs/1304.5634(2013). arxiv:1304.5634http://arxiv.org/abs/1304.5634
[34]
Longqi Yang, Liangliang Zhang, and Tang Yuhua. 2020. Online Binary Incomplete Multi-view Clustering. In ECMLPKDD.
[35]
Zhiyong Yang, Qianqian Xu, Weigang Zhang, Xiaochun Cao, and Qingming Huang. 2019. Split Multiplicative Multi-View Subspace Clustering. IEEE Trans. IP 28, 10 (2019), 5147–5160.
[36]
Zheng Zhang, Li Liu, Fumin Shen, Heng Tao Shen, and Ling Shao. 2018. Binary Multi-View Clustering. IEEE Trans. PAMI PP, 99 (2018), 1–1.
[37]
Handong Zhao, Zhengming Ding, and Yun Fu. 2017. Multi-View Clustering via Deep Matrix Factorization. In AAAI.
[38]
Dengyong Zhou and Christopher J.C. Burges. 2007. Spectral Clustering and Transductive Learning with Multiple Views. In ICML.

Cited By

View all
  1. Scalable Auto-weighted Discrete Multi-view Clustering

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '21: Proceedings of the Web Conference 2021
    April 2021
    4054 pages
    ISBN:9781450383127
    DOI:10.1145/3442381
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 June 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. binary coding
    2. graph regularization
    3. multi-view clustering
    4. parameter selection

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    WWW '21
    Sponsor:
    WWW '21: The Web Conference 2021
    April 19 - 23, 2021
    Ljubljana, Slovenia

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)45
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media