[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3632410.3632459acmotherconferencesArticle/Chapter ViewAbstractPublication PagescomadConference Proceedingsconference-collections
research-article
Open access

VeNoM: Approximate Subgraph Matching with Enhanced Neighbourhood Structural Information

Published: 04 January 2024 Publication History

Abstract

Subgraph matching is an important research problem in the area of graph mining. Over the years, researchers have made significant headway in this direction with many state-of-the-art algorithms aimed at providing an efficient solution for approximate subgraph matching (ASM). Although many studies have been conducted to compare these and highlight their respective advantages and differences, little analysis has been done on how varying the different aspects of an ASM approach including the depth and breadth of neighborhood affect the performance. In this paper, we propose VeNoM, a variant of a state-of-the-art ASM algorithm VELSET, and present different extensions of it by parameterizing the breadth and depth of the neighborhood considered. We discuss the effects of these neighborhood parameters and other graph parameters on performance through empirical results over diverse datasets. We also compare the VeNoM instances against VerSaChI, a two-hop neighborhood similarity based ASM approach. The empirical results suggest that increasing the depth of a neighborhood can increase the accuracy of ASM significantly, although it requires a much longer running time. The breadth of the neighborhood for a vertex does not matter much as long as it is more than a single edge.

References

[1]
Shubhangi Agarwal, Sourav Dutta, and Arnab Bhattacharya. 2021. VerSaChI: Finding Statistically Significant Subgraph Matches using Chebyshev’s Inequality. In CIKM. ACM, 2812–2816.
[2]
Réka Albert and Albert-László Barabási. 2002. Statistical mechanics of complex networks. Rev. Mod. Phys. 74 (Jan 2002), 47–97. Issue 1.
[3]
F. Bi, L. Chang, X. Lin, L. Qin, and W. Zhang. 2016. Efficient Subgraph Matching by Postponing Cartesian Products. In SIGMOD. 1199–1214.
[4]
Vincenzo Bonnici and Rosalba Giugno. 2016. On the variable ordering in Subgraph Isomorphism Algorithms. IEEE/ACM transactions on computational biology and bioinformatics 14, 1 (2016), 193–203.
[5]
Chen Chen, Xifeng Yan, S Yu Philip, Jiawei Han, Dong Qing Zhang, and Xiaohui Gu. 2007. Towards Graph Containment Search and Indexing. In PVLDB. 926–937.
[6]
Wei Chen, Jia Liu, Ziyang Chen, Xian Tang, and Kaiyu Li. 2018. PBSM: An efficient Top-K subgraph matching algorithm. International Journal of Pattern Recognition and Artificial Intelligence 32, 06 (2018), 1850020.
[7]
James Cheng, Yiping Ke, Wilfred Ng, and An Lu. 2007. FG-Index: Towards Verification-free Query Processing on Graph Databases. In SIGMOD. 857–872.
[8]
Stephen A. Cook. 1971. The Complexity of Theorem-Proving Procedures. In Proceedings of the Third Annual ACM Symposium on Theory of Computing (Shaker Heights, Ohio, USA) (STOC ’71). 8 pages.
[9]
Luigi P Cordella, Pasquale Foggia, Carlo Sansone, and Mario Vento. 2004. A (sub) graph isomorphism algorithm for matching large graphs. IEEE transactions on pattern analysis and machine intelligence 26, 10 (2004), 1367–1372.
[10]
Sourav Dutta, Pratik Nayek, and Arnab Bhattacharya. 2017. Neighbor-aware search for approximate labeled graph matching using the chi-square statistics. In WWW’17. 1281–1290.
[11]
Myoungji Han, Hyunjoon Kim, Geonmo Gu, Kunsoo Park, and Wook-Shin Han. 2019. Efficient Subgraph Matching: Harmonizing dynamic programming, adaptive matching order, and failing set together. In Proceedings of the 2019 International Conference on Management of Data. 1429–1446.
[12]
Wook-Shin Han, Jinsoo Lee, and Jeong-Hoon Lee. 2013. Turboiso: towards ultrafast and robust subgraph isomorphism search in large graph databases. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. 337–348.
[13]
Wook Shin Han, Jinsoo Lee, Minh Duc Pham, and Jeffrey Xu Yu. 2010. iGraph: A Framework for Comparisons of Disk-based Graph Indexing Techniques. PVLDB 3, 1-2 (2010), 449–459.
[14]
Sen Hu, Lei Zou, Jeffrey Xu Yu, Haixun Wang, and Dongyan Zhao. 2018. Answering Natural Language Questions by Subgraph Matching over Knowledge Graphs. IEEE TKDE 30, 5 (2018), 824–837.
[15]
Yi Jia, Jintao Zhang, and Jun Huan. 2011. An Efficient Graph-mining Method for Complicated and Noisy Data with Real-world Applications. Knowledge and Information Systems 28, 2 (2011), 423–447.
[16]
Foteini Katsarou, Nikos Ntarmos, and Peter Triantafillou. 2015. Performance and Scalability of Indexed Subgraph Query Processing Methods. Proc. VLDB Endow. 8, 12 (Aug 2015), 12 pages.
[17]
Arijit Khan, Yinghui Wu, Charu C. Aggarwal, and Xifeng Yan. 2013. NeMa: Fast Graph Search with Label Similarity. Proc. VLDB Endow. 6, 3 (Jan. 2013), 12 pages.
[18]
Segla Kpodjedo, Philippe Galinier, and Giulio Antoniol. 2014. Using local similarity measures to efficiently address approximate graph matching. Discrete Applied Mathematics 164 (2014), 161–177.
[19]
Jinsoo Lee, Wook-Shin Han, Romans Kasperovics, and Jeong-Hoon Lee. 2012. An In-Depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases. Proc. VLDB Endow. 6, 2 (Dec 2012), 12 pages.
[20]
Lihui Liu, Boxin Du, Hanghang Tong, 2019. G-Finder: Approximate Attributed Subgraph Matching. In 2019 IEEE International Conference on Big Data (Big Data). IEEE, 513–522.
[21]
Tinghuai Ma, Siyang Yu, Jie Cao, Yuan Tian, Abdullah Al-Dhelaan, and Mznah Al-Rodhaan. 2018. A comparative study of subgraph matching isomorphic methods in social networks. IEEE Access 6 (2018).
[22]
Misael Mongiovi, Raffaele Di Natale, Rosalba Guigno, Alfredo Pulvirenti, Alfredo Ferro, and Roded Sharan. 2010. SIGMA: A Set-Cover-Based Inexact Graph Matching Algo. Journal of Bioinformatics and Computational Biology 8, 2 (2010), 199–218.
[23]
Xuguang Ren and Junhu Wang. 2015. Exploiting vertex relationships in speeding up subgraph isomorphism over large graphs. Proceedings of the VLDB Endowment 8, 5 (2015), 617–628.
[24]
Kaspar Riesen, Xiaoyi Jiang, and Horst Bunke. 2010. Exact and Inexact Graph Matching: Methodology and Applications. Managing and Mining Graph Data (2010), 217–247.
[25]
Carlos R Rivero and Hasan M Jamil. 2017. Efficient and scalable labeled subgraph matching using SGMatch. Knowledge and Information Systems 51, 1 (2017), 61–87.
[26]
R. A. Rossi and N. K. Ahmed. 2015. The Network Data Repository with Interactive Graph Analytics and Visualization. In AAAI. 4292–4293.
[27]
Theyvaa Sangkaran, Azween Abdullah, and NZ Jhanjhi. 2020. Community Detection Based on Isomorphic Subgraph Analytics in Criminal Network. IJCSNS 20, 5 (2020), 94.
[28]
Theyvaa Sangkaran, Azween Abdullah, NZ JhanJhi, and Mahadevan Supramaniam. 2019. Survey on isomorphic graph algorithms for graph analytics. IJCSNS 19, 1 (2019), 85–92.
[29]
Ms Rachna Somkunwar and Vinod Moreshwar Vaze. 2017. Subgraph Isomorphism Algorithms for Matching Graphs: A Survey. IJETT 1, 1 (2017).
[30]
Junshuai Song, Xiaoru Qu, Zehong Hu, Zhao Li, Jun Gao, and Ji Zhang. 2021. A subgraph-based knowledge reasoning method for collective fraud detection in E-commerce. Neurocomputing 461 (2021), 587–597.
[31]
Shixuan Sun and Qiong Luo. 2019. Scaling Up Subgraph Query Processing with Efficient Subgraph Matching. In ICDE. 220–231.
[32]
Shixuan Sun and Qiong Luo. 2020. In-Memory Subgraph Matching: An In-Depth Study. In ICMD (Portland, OR, USA) (SIGMOD). 16 pages.
[33]
Yunhao Sun, Guanyu Li, Jingjing Du, Bo Ning, and Heng Chen. 2022. A subgraph matching algorithm based on subgraph index for knowledge graph. Frontiers of Computer Science 16 (2022), 1–18.
[34]
Yuanyuan Tian and Jignesh M Patel. 2008. Tale: A tool for approximate large graph matching. In 2008 IEEE 24th International Conference on Data Engineering. IEEE, 963–972.
[35]
Junchi Yan, Xu-Cheng Yin, Weiyao Lin, Cheng Deng, Hongyuan Zha, and Xiaokang Yang. 2016. A Short Survey of Recent Advances in Graph Matching. In ICMR (New York, New York, USA). 167–174.
[36]
Xifeng Yan, Philip S. Yu, and Jiawei Han. 2005. Graph Indexing Based on Discriminative Frequent Structure Analysis. Transactions on Database Systems 30, 4 (2005), 960–993.
[37]
Xifeng Yan, Philip S. Yu, and Jiawei Han. 2005. Substructure Similarity Search in Graph Databases. In SIGMOD. 766–777.
[38]
Shuo Zhang, Jianzhong Li, Hong Gao, and Zhaonian Zou. 2009. A Novel Approach for Efficient Supergraph Query Processing on Graph Databases. In International Conference on Extending Database Technology (EDBT). 204–215.
[39]
Shijie Zhang, Jiong Yang, and Wei Jin. 2010. SAPPER: Subgraph Indexing and Approximate Matching in Large Graphs. PVLDB 3, 1-2 (2010), 1185–1194.
[40]
Peixiang Zhao and Jiawei Han. 2010. On graph query optimization in large networks. Proceedings of the VLDB Endowment 3, 1-2 (2010), 340–351.
[41]
Lei Zou, Lei Chen, Jeffrey Xu Yu, and Yansheng Lu. 2008. A Novel Spectral Coding in a Large Graph Database. In International Conference on Extending Database Technology (EDBT). 181–192.

Index Terms

  1. VeNoM: Approximate Subgraph Matching with Enhanced Neighbourhood Structural Information

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CODS-COMAD '24: Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD)
    January 2024
    627 pages
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 January 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Approximate Matching
    2. Chi-Square
    3. Labeled Graph
    4. Statistical Significance
    5. Subgraph Similarity

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CODS-COMAD 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 229
      Total Downloads
    • Downloads (Last 12 months)229
    • Downloads (Last 6 weeks)52
    Reflects downloads up to 13 Dec 2024

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media