[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Neighborhood-Based Hypergraph Core Decomposition

Published: 01 May 2023 Publication History

Abstract

We propose neighborhood-based core decomposition: a novel way of decomposing hypergraphs into hierarchical neighborhood-cohesive subhypergraphs. Alternative approaches to decomposing hypergraphs, e.g., reduction to clique or bipartite graphs, are not meaningful in certain applications, the later also results in inefficient decomposition; while existing degree-based hypergraph decomposition does not distinguish nodes with different neighborhood sizes. Our case studies show that the proposed decomposition is more effective than degree and clique graph-based decompositions in disease intervention and in extracting provably approximate and application-wise meaningful densest subhypergraphs. We propose three algorithms: Peel, its efficient variant E-Peel, and a novel local algorithm: Local-core with parallel implementation. Our most efficient parallel algorithm Local-core(P) decomposes hypergraph with 27M nodes and 17M hyperedges in-memory within 91 seconds by adopting various optimizations. Finally, we develop a new hypergraph-core model, the (neighborhood, degree)-core by considering both neighborhood and degree constraints, design its decomposition algorithm Local-core+Peel, and demonstrate its superiority in spreading diffusion.

References

[1]
J. Ignacio Alvarez-Hamelin, Luca Dall'Asta, Alain Barrat, and Alessandro Vespignani. 2005. Large scale networks fingerprinting and visualization using the k-core decomposition. In Advances in Neural Information Processing Systems (NeurIPS). MIT Press, 41--50.
[2]
Naheed Anjum Arafat, Debabrota Basu, Laurent Decreusefond, and Stéphane Bressan. 2020. Construction and random generation of hypergraphs with prescribed degree and dimension sequences. In Database and Expert Systems Applications (DEXA) (Lecture Notes in Computer Science), Vol. 12392. Springer, 130--145.
[3]
Naheed Anjum Arafat, Arijit Khan, Arpit Kumar Rai, and Bishwamittra Ghosh. 2022. Our code and datasets. https://github.com/toggled/vldbsubmission/.
[4]
Naheed Anjum Arafat, Arijit Khan, Arpit Kumar Rai, and Bishwamittra Ghosh. 2023. Neighborhood-based Hypergraph Core Decomposition. arXiv preprint arXiv:2301.06426 (2023).
[5]
Chen Avin, Zvi Lotker, Yinon Nahum, and David Peleg. 2019. Random preferential attachment hypergraph. In IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). 398--405.
[6]
Mohammad A. Bahmanian and Mateja Sajna. 2015. Connection and separation in hypergraphs. Theory and Applications of Graphs 2, 2 (2015), 5.
[7]
Stephen Bailey. 2017. Nashville meetup network. https://www.kaggle.com/datasets/stkbailey/nashville-meetup.
[8]
Vladimir Batagelj, Andrej Mrvar, and Matjaž Zaveršnik. 1999. Partitioning approach to visualization of large graphs. In Graph Drawing (Lecture Notes in Computer Science), Vol. 1731.
[9]
Vladimir Batagelj and Matjaž Zaveršnik. 2011. Fast algorithms for determining (generalized) core groups in social networks. Advances in Data Analysis and Classification 5, 2 (2011), 129--145.
[10]
Federico Battiston, Giulia Cencetti, Iacopo Iacopini, Vito Latora, Maxime Lucas, Alice Patania, Jean-Gabriel Young, and Giovanni Petri. 2020. Networks beyond pairwise interactions: structure and dynamics. Physics Reports 874 (2020), 1--92.
[11]
Austin R. Benson, Rediet Abebe, Michael T. Schaub, Ali Jadbabaie, and Jon Kleinberg. 2018. Simplicial closure and higher-order link prediction. Proceedings of the National Academy of Sciences 115, 48 (2018), E11221--E11230.
[12]
Claude Berge. 1989. Hypergraphs - combinatorics of finite sets. North-Holland mathematical library, Vol. 45. North-Holland.
[13]
Francesco Bonchi, Arijit Khan, and Lorenzo Severini. 2019. Distance-generalized core decomposition. In The International Conference on Management of Data (SIGMOD). ACM, 1006--1023.
[14]
Moses Charikar. 2000. Greedy approximation algorithms for finding dense components in a graph. In Approximation Algorithms for Combinatorial Optimization, Third International Workshop (Lecture Notes in Computer Science), Vol. 1913. Springer, 84--95.
[15]
James Cheng, Yiping Ke, Shumo Chu, and M. Tamer Özsu. 2011. Efficient core decomposition in massive networks. In IEEE International Conference on Data Engineering (ICDE). 51--62.
[16]
Megan Dewar, Kirill Ternovsky, Benjamin Reiniger, John Proos, Pawel Pralat, Xavier Pérez-Giménez, and John Healy. 2018. Subhypergraphs in non-uniform random hypergraphs. Internet Math. 2018 (2018).
[17]
Yon Dourisboure, Filippo Geraci, and Marco Pellegrini. 2009. Extraction and classification of dense implicit communities in the web graph. ACM Transactions on the Web 3, 2 (2009), 1--36.
[18]
Tina Eliassi-Rad, Vito Latora, Martin Rosvall, and Ingo Scholtes. 2021. Higher-order graph models: from theoretical foundations to machine learning (Dagstuhl Seminar 21352). Dagstuhl Reports 11, 7 (2021), 139--178.
[19]
Yixiang Fang, Kaiqiang Yu, Reynold Cheng, Laks V. S. Lakshmanan, and Xuemin Lin. 2019. Efficient algorithms for densest subgraph discovery. Proc. VLDB Endow. 12, 11 (2019), 1719--1732.
[20]
Pit Fender and Guido Moerkotte. 2013. Counter strike: generic top-down join enumeration for hypergraphs. Proc. VLDB Endow. 6, 14 (2013), 1822--1833.
[21]
Andrew Finlayson. [n.d.]. OpenMP Scheduling. http://www.inf.ufsc.br/~bosco.sobral/ensino/ine5645/OpenMP_Dynamic_Scheduling.pdf.
[22]
Christoph Flamm, Bärbel M.R. Stadler, and Peter F. Stadler. 2015. Generalized topologies: hypergraphs, chemical reactions, and biological evolution. In Advances in Mathematical Chemistry and Applications. Bentham Science, 300--328.
[23]
Kasimir Gabert, Ali Pinar, and Ümit V. Çatalyürek. 2021. Shared-Memory Scalable k-Core Maintenance on Dynamic Graphs and Hypergraphs. In IEEE International Parallel and Distributed Processing Symposium. 998--1007.
[24]
Edoardo Galimberti, Francesco Bonchi, Francesco Gullo, and Tommaso Lanciano. 2020. Core decomposition in multilayer networks: theory, algorithms, and applications. ACM Trans. Knowl. Discov. Data 14, 1 (2020), 11:1--11:40.
[25]
Thomas Gaudelet, Noël Malod-Dognin, and Natasa Przulj. 2018. Higher-order molecular organization as a source of biological function. Bioinformatics 34, 17 (2018), i944--i953.
[26]
Andrew V. Goldberg. 1984. Finding a maximum density subgraph. University of California Berkeley.
[27]
Ronald L. Graham. 1969. Bounds on multiprocessing timing anomalies. SIAM journal on Applied Mathematics 17, 2 (1969), 416--429.
[28]
Ronald L. Graham, Martin Grötschel, and László Lovász (Eds.). 1996. Handbook of Combinatorics (Vol. 2). MIT Press.
[29]
Yi Han, Bin Zhou, Jian Pei, and Yan Jia. 2009. Understanding importance of collaborations in co-authorship networks: a supportiveness analysis approach. In SIAM International Conference on Data Mining (SDM). 1112--1123.
[30]
Jorge E. Hirsch. 2005. An index to quantify an individual's scientific research output. Proceedings of the National academy of Sciences 102, 46 (2005), 16569--16572.
[31]
Shuguang Hu, Xiaowei Wu, and T.-H. Hubert Chan. 2017. Maintaining densest subsets efficiently in evolving hypergraphs. In ACM International Conference on Information and Knowledge Management (CIKM). 929--938.
[32]
Jin Huang, Rui Zhang, and Jeffrey Xu Yu. 2015. Scalable hypergraph learning and processing. In IEEE International Conference on Data Mining (ICDM). 775--780.
[33]
Jiayang Jiang, Michael Mitzenmacher, and Justin Thaler. 2017. Parallel peeling algorithms. ACM Transactions on Parallel Computing 3, 1 (2017), 1--27.
[34]
Igor Kabiljo, Brian Karrer, Mayank Pundir, Sergey Pupyrev, Alon Shalita, Yaroslav Akhremtsev, and Alessandro Presta. 2017. Social hash partitioner: a scalable distributed hypergraph partitioner. Proc. VLDB Endow. 10, 11 (2017), 1418--1429.
[35]
Wissam Khaouid, Marina Barsky, Venkatesh Srinivasan, and Alex Thomo. 2015. K-core decomposition of large networks on a single PC. Proc. VLDB Endow. 9, 1 (2015), 13--23.
[36]
Maksim Kitsak, Lazaros K. Gallos, Shlomo Havlin, Fredrik Liljeros, Lev Muchnik, H. Eugene Stanley, and Hernán A. Makse. 2010. Identification of influential spreaders in complex networks. Nature physics 6, 11 (2010), 888--893.
[37]
Laks V.S. Lakshmanan. 2022. On a quest for combating filter bubbles and misinformation. In The International Conference on Management of Data (SIGMOD). ACM, 2.
[38]
Geon Lee, Jihoon Ko, and Kijung Shin. 2020. Hypergraph motifs: concepts, algorithms, and discoveries. Proc. VLDB Endow. 13, 11 (2020), 2256--2269.
[39]
Geon Lee, Jaemin Yoo, and Kijung Shin. 2022. Mining of Real-World Hypergraphs: Patterns, Tools, and Generators. In Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM). 5144--5147.
[40]
Robert Legenstein, Wolfgang Maass, Christos H. Papadimitriou, and Santosh S. Vempala. 2018. Long term memory and the densest k-subgraph problem. In Innovations in Theoretical Computer Science Conference (ITCS) (Leibniz International Proceedings in Informatics (LIPIcs)), Vol. 94. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 57:1--57:15.
[41]
Qing Liu, Xuliang Zhu, Xin Huang, and Jianliang Xu. 2021. Local algorithms for distance-generalized core decomposition over large dynamic graphs. Proc. VLDB Endow. 14, 9 (2021), 1531--1543.
[42]
Linyuan Lü, Tao Zhou, Qian-Ming Zhang, and H. Eugene Stanley. 2016. The H-index of a network node and its relation to degree and coreness. Nature communications 7, 1 (2016), 1--7.
[43]
Fragkiskos D. Malliaros, Maria-Evgenia G. Rossi, and Michalis Vazirgiannis. 2016. Locating influential nodes in complex networks. Scientific Reports 6, 19307 (2016).
[44]
Irene Malvestio, Alessio Cardillo, and Naoki Masuda. 2020. Interplay between k-core and community structure in complex networks. Scientific Reports 10, 14702 (2020).
[45]
Alberto Montresor, Francesco De Pellegrini, and Daniele Miorandi. 2012. Distributed k-core decomposition. IEEE Transactions on parallel and distributed systems 24, 2 (2012), 288--300.
[46]
Emad Ramadan, Arijit Tarafdar, and Alex Pothen. 2004. A hypergraph model for the yeast protein complex network. In International Parallel and Distributed Processing Symposium.
[47]
Ahmet Erdem Saríyüce, Buğra Gedik, Gabriela Jacques-Silva, Kun-Lung Wu, and Ümit V. Çatalyürek. 2013. Streaming algorithms for k-core decomposition. Proc. VLDB Endow. 6, 6 (2013), 433--444.
[48]
Ahmet Erdem Sariyüce, C. Seshadhri, Ali Pinar, and Ümit V. Çatalyürek. 2015. Finding the hierarchy of dense subgraphs using nucleus decompositions. In The International Conference on World Wide Web (WWW). ACM.
[49]
Julian Shun. 2020. Practical parallel hypergraph algorithms. In The ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. 232--249.
[50]
Bintao Sun, T.-H. Hubert Chan, and Mauro Sozio. 2020. Fully dynamic approximate k-core decomposition in hypergraphs. ACM Trans. Knowl. Discov. Data 14, 4 (2020), 39:1--39:21.
[51]
Justin Sybrandt, Ruslan Shaydulin, and Ilya Safro. 2022. Hypergraph partitioning with embeddings. IEEE Trans. Knowl. Data Eng. 34, 6 (2022), 2771--2782.
[52]
Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. ArnetMiner: extraction and mining of academic social networks. In ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 990--998.
[53]
Leo Torres, Ann Sizemore Blevins, Danielle S. Bassett, and Tina Eliassi-Rad. 2021. The why, how, and when of representations for complex systems. SIAM Rev. 63, 3 (2021), 435--485.
[54]
Jia Wang and James Cheng. 2012. Truss decomposition in massive networks. Proc. VLDB Endow. 5, 9 (2012), 812--823.
[55]
Joyce Jiyoung Whang, Rundong Du, Sangwon Jung, Geon Lee, Barry L. Drake, Qingqing Liu, Seonggoo Kang, and Haesun Park. 2020. MEGA: multi-view semi-supervised clustering of hypergraphs. Proc. VLDB Endow. 13, 5 (2020), 698--711.
[56]
Xin Xia, Hongzhi Yin, Junliang Yu, Qinyong Wang, Lizhen Cui, and Xiangliang Zhang. 2021. Self-supervised hypergraph convolutional networks for session-based recommendation. In AAAI Conference on Artificial Intelligence. 4503--4511.
[57]
Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems 42, 1 (2015), 181--213.

Cited By

View all
  • (2024)Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent QueriesProceedings of the VLDB Endowment10.14778/3681954.368197917:11(3003-3016)Online publication date: 30-Aug-2024
  • (2024)SpeedCore: Space-efficient and Dependency-aware GPU Parallel Framework for Core DecompositionProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673111(555-564)Online publication date: 12-Aug-2024
  • (2024)MCR-Tree: An Efficient Index for Multi-dimensional Core SearchProceedings of the ACM on Management of Data10.1145/36549562:3(1-25)Online publication date: 30-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 16, Issue 9
May 2023
330 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 May 2023
Published in PVLDB Volume 16, Issue 9

Check for updates

Badges

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)36
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent QueriesProceedings of the VLDB Endowment10.14778/3681954.368197917:11(3003-3016)Online publication date: 30-Aug-2024
  • (2024)SpeedCore: Space-efficient and Dependency-aware GPU Parallel Framework for Core DecompositionProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673111(555-564)Online publication date: 12-Aug-2024
  • (2024)MCR-Tree: An Efficient Index for Multi-dimensional Core SearchProceedings of the ACM on Management of Data10.1145/36549562:3(1-25)Online publication date: 30-May-2024
  • (2024)Hierarchical Structure Construction on HypergraphsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679765(1597-1606)Online publication date: 21-Oct-2024
  • (2024)Estimate and Reduce Uncertainty in Uncertain Graphs2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA61799.2024.10722821(1-12)Online publication date: 6-Oct-2024
  • (2023)Hypercore decomposition for non-fragile hyperedges: concepts, algorithms, observations, and applicationsData Mining and Knowledge Discovery10.1007/s10618-023-00956-237:6(2389-2437)Online publication date: 8-Aug-2023

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media