[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2487575.2487689acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Redundancy-aware maximal cliques

Published: 11 August 2013 Publication History

Abstract

Recent research efforts have made notable progress in improving the performance of (exhaustive) maximal clique enumeration (MCE). However, existing algorithms still suffer from exploring the huge search space of MCE. Furthermore, their results are often undesirable as many of the returned maximal cliques have large overlapping parts. This redundancy leads to problems in both computational efficiency and usefulness of MCE.
In this paper, we aim at providing a concise and complete summary of the set of maximal cliques, which is useful to many applications. We propose the notion of τ-visible MCE to achieve this goal and design algorithms to realize the notion. Based on the refined output space, we further consider applications including an efficient computation of the top-k results with diversity and an interactive clique exploration process. Our experimental results demonstrate that our approach is capable of producing output of high usability and our algorithms achieve superior efficiency over classic MCE algorithms.

References

[1]
J. Abello, M. G. C. Resende, and S. Sudarsky. Massive quasi-clique detection. In LATIN, pages 598--612, 2002.
[2]
F. Afrati, A. Gionis, and H. Mannila. Approximating a collection of frequent sets. In SIGKDD, 2004.
[3]
R. Agrawal, S. Gollapudi, A. Halverson, and S. Leong. Diversifying search results. In Second ACM Int. Conf. on Web Search and Data Mining (WSDM), 2009.
[4]
N. M. Berry, T. H. Ko, T. Moy, J. Smrcka, J. Turnley, and B. Wu. Emergent clique formation in terrorist recruitment. In The AAAI-04 Workshop on Agent Organizations: Theory and Practice, 2004.
[5]
C. Bron and J. Kerbosch. Algorithm 457: finding all cliques of an undirected graph. Commun. ACM, 16(9):575--577, 1973.
[6]
F. Cazals and C. Karande. A note on the problem of reporting maximal cliques. Theor. Comput. Sci., 407(1-3):564--568, 2008.
[7]
J. Cheng, Y. Ke, S. Chu, and M. T. Özsu. Efficient core decomposition in massive networks. In ICDE, pages 51--62, 2011.
[8]
J. Cheng, Y. Ke, A. W.-C. Fu, J. X. Yu, and L. Zhu. Finding maximal cliques in massive networks by h*-graph. In SIGMOD Conference, pages 447--458, 2010.
[9]
J. Cheng, Y. Ke, A. W.-C. Fu, J. X. Yu, and L. Zhu. Finding maximal cliques in massive networks. ACM Trans. Database Syst., 36(4):21, 2011.
[10]
J. Cheng, L. Zhu, Y. Ke, and S. Chu. Fast algorithms for maximal clique enumeration with limited memory. In KDD, pages 1240--1248, 2012.
[11]
S. Chu and J. Cheng. Triangle listing in massive networks and its applications. In KDD, pages 672--680, 2011.
[12]
S. Chu and J. Cheng. Triangle listing in massive networks. TKDD, 6(4):17, 2012.
[13]
D. Eppstein, M. Löffler, and D. Strash. Listing all maximal cliques in sparse graphs in near-optimal time. In ISAAC (1), pages 403--414, 2010.
[14]
M. Hua, J. Pei, A. Fu, and X. Lin. Top-k typicality queries and efficient query answering methods on large databases. The VLDB Journal, 18:809--835, 2009.
[15]
F. Kose, W. Weckwerth, T. Linke, and O. Fiehn. Visualizing plant metabolomic correlation networks using clique-metabolite matrices. Bioinformatics, 17(12):1198--1208, 2001.
[16]
G. Liu and L. Wong. Effective pruning techniques for mining quasi-cliques. In ECML/PKDD (2), pages 33--49, 2008.
[17]
H. Matsuda, T. Ishihara, and A. Hashimoto. Classifying molecular sequences using a linkage graph with their pairwise similarities. Theor. Comput. Sci., 210(2):305--325, 1999.
[18]
J. Moon and L. Moser. On cliques in graphs. Israel J. Math, 3:23--28, 1965.
[19]
G. Nemhauser, L. Wolsey, and M. Fisher. An analysis of the approximations for maximizing submodular set functions. Math. Programming, 14:265--294, 1978.
[20]
J. Pei, D. Jiang, and A. Zhang. On mining cross-graph quasi-cliques. In SIGKDD, 2005.
[21]
E. Tomita, A. Tanaka, and H. Takahashi. The worst-case time complexity for generating all maximal cliques and computational experiments. Theor. Comput. Sci., 363(1):28--42, 2006.
[22]
E. Vee, U. Srivastava, J. Shanmugasundaram, P. Bhat, and S. Yahia. Efficient computation of diverse query results. In ICDE, 2008.
[23]
J. Wang and J. Cheng. Truss decomposition in massive networks. PVLDB, 5(9):812--823, 2012.
[24]
D. Xin, H. Cheng, X. Yan, and J. Han. Extracting redundancy-aware top-k patterns. In SIGKDD, 2006.
[25]
X. Yan, H. Cheng, J. Han, and D. Xin. Summarizing itemset patterns: A profile-based approach. In SIGKDD, 2005.

Cited By

View all
  • (2024)An Efficient and Exact Algorithm for Locally h-Clique Densest Subgraph DiscoveryProceedings of the ACM on Management of Data10.1145/36988002:6(1-26)Online publication date: 20-Dec-2024
  • (2024)TED$^+$: Towards Discovering Top-k Edge-Diversified Patterns in a Graph DatabaseIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.3312566(1-14)Online publication date: 2024
  • (2024)Evolutionary Multi-objective Diversity OptimizationParallel Problem Solving from Nature – PPSN XVIII10.1007/978-3-031-70085-9_8(117-134)Online publication date: 14-Sep-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
August 2013
1534 pages
ISBN:9781450321747
DOI:10.1145/2487575
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 August 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. clique concise representation
  2. clique summarization
  3. maximal clique enumeration

Qualifiers

  • Research-article

Conference

KDD' 13
Sponsor:

Acceptance Rates

KDD '13 Paper Acceptance Rate 125 of 726 submissions, 17%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)2
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)An Efficient and Exact Algorithm for Locally h-Clique Densest Subgraph DiscoveryProceedings of the ACM on Management of Data10.1145/36988002:6(1-26)Online publication date: 20-Dec-2024
  • (2024)TED$^+$: Towards Discovering Top-k Edge-Diversified Patterns in a Graph DatabaseIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.3312566(1-14)Online publication date: 2024
  • (2024)Evolutionary Multi-objective Diversity OptimizationParallel Problem Solving from Nature – PPSN XVIII10.1007/978-3-031-70085-9_8(117-134)Online publication date: 14-Sep-2024
  • (2023)Diverse approximations for monotone submodular maximization problems with a matroid constraintProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/617(5558-5566)Online publication date: 19-Aug-2023
  • (2023)TED: Towards Discovering Top-k Edge-Diversified Patterns in a Graph DatabaseProceedings of the ACM on Management of Data10.1145/35887361:1(1-26)Online publication date: 30-May-2023
  • (2022)Analysis of Evolutionary Diversity Optimization for Permutation ProblemsACM Transactions on Evolutionary Learning and Optimization10.1145/35619742:3(1-27)Online publication date: 19-Oct-2022
  • (2022)One Set to Cover All Maximal Cliques ApproximatelyProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517881(2006-2019)Online publication date: 10-Jun-2022
  • (2022)Continuous Monitoring of Maximum Clique Over Dynamic GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.300370134:4(1667-1683)Online publication date: 1-Apr-2022
  • (2022)On Maximising the Vertex Coverage for ${\text{Top}}-k$ t-Bicliques in Bipartite Graphs2022 IEEE 38th International Conference on Data Engineering (ICDE)10.1109/ICDE53745.2022.00221(2346-2358)Online publication date: May-2022
  • (2022)Discovering Overlapping Communities Based on Cohesive Subgraph Models over Graph DataBig Data Analytics and Knowledge Discovery10.1007/978-3-031-12670-3_16(189-201)Online publication date: 26-Jul-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media