[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Semantic SPARQL similarity search over RDF knowledge graphs

Published: 01 July 2016 Publication History

Abstract

RDF knowledge graphs have attracted increasing attentions these years. However, due to the schema-free nature of RDF data, it is very difficult for users to have full knowledge of the underlying schema. Furthermore, the same kind of information can be represented in diverse graph fragments. Hence, it is a huge challenge to formulate complex SPARQL expressions by taking the union of all possible structures.
In this paper, we propose an effective framework to access the RDF repository even if users have no full knowledge of the underlying schema. Specifically, given a SPARQL query, the system could return as more answers that match the query based on the semantic similarity as possible. Interestingly, we propose a systematic method to mine diverse semantically equivalent structure patterns. More importantly, incorporating both structural and semantic similarities we are the first to propose a novel similarity measure, semantic graph edit distance. In order to improve the efficiency performance, we apply the semantic summary graph to summarize the knowledge graph, which supports both high-level pruning and drill-down pruning. We also devise an effective lower bound based on the TA-style access to each of the candidate sets. Extensive experiments over real datasets confirm the effectiveness and efficiency of our approach.

References

[1]
R. Angles and C. Gutiérrez. Querying RDF data from a graph database perspective. In ESWC, 2005.
[2]
J. Cheng, X. Zeng, and J. X. Yu. Top-k graph pattern matching over large graphs. In ICDE, 2013.
[3]
A. Fader, S. Soderland, and O. Etzioni. Identifying relations for open information extraction. In EMNLP, 2011.
[4]
H. He, H. Wang, J. Yang, and P. S. Yu. BLINKS: ranked keyword searches on graphs. In SIGMOD, 2007.
[5]
J. Hoffart, F. M. Suchanek, K. Berberich, and G. Weikum. YAGO2: A spatially and temporally enhanced knowledge base from wikipedia. Artif. Intell., 194, 2013.
[6]
H. W. Kuhn. The hungarian method for the assignment problem. In Naval Research Logistics, 1955.
[7]
A. Khan, Y. Wu, C. C. Aggarwal, and X. Yan. Nema: Fast graph search with label similarity. PVLDB, 6(3), 2013.
[8]
A. Khan, X. Yan, and K.-L. Wu. Towards proximity pattern mining in large graphs. In SIGMOD Conference, 2010.
[9]
W. Le, F. Li, A. Kementsietsidis, and S. Duan. Scalable keyword search on large RDF data. IEEE TKDE, 26(11), 2014.
[10]
N. Nakashole, T. Tylenda, and G. Weikum. Fine-grained semantic typing of emerging entities. In ACL, 2013.
[11]
N. Nakashole, G. Weikum, and F. M. Suchanek. PATTY: A taxonomy of relational patterns with semantic types. In EMNLP-CoNLL, 2012.
[12]
T. Neumann and G. Weikum. RDF-3X: a risc-style engine for RDF. PVLDB, 1(1), 2008.
[13]
E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. VLDB J., 10(4), 2001.
[14]
K. Riesen, S. Fankhauser, and H. Bunke. Speeding up graph edit distance computation with a bipartite heuristic. In MLG, 2007.
[15]
P. Shvaiko and J. Euzenat. A survey of schema-based matching approaches. 2005.
[16]
P. Shvaiko and J. Euzenat. Ontology matching: State of the art and future challenges. IEEE Trans. Knowl. Data Eng., 25(1), 2013.
[17]
R. Singh, J. Xu, and B. Berger. Global alignment of multiple protein interaction networks. In Biocomputing, 2008.
[18]
Y. Tian, R. C. McEachin, C. Santos, D. J. States, and J. M. Patel. Saga: a subgraph matching tool for biological graphs. Bioinformatics, 23(2), 2007.
[19]
Y. Tian and J. M. Patel. Tale: A tool for approximate large graph matching. In ICDE, 2008.
[20]
T. Tran, H. Wang, S. Rudolph, and P. Cimiano. Top-k exploration of query candidates for efficient keyword search on graph-shaped (RDF) data. In ICDE, 2009.
[21]
W. Wu, H. Li, H. Wang, and K. Q. Zhu. Probase: a probabilistic taxonomy for text understanding. In SIGMOD, 2012.
[22]
Y. Wu, S. Yang, and X. Yan. Ontology-based subgraph querying. In ICDE, 2013.
[23]
S. Yang, Y. Wu, H. Sun, and X. Yan. Schemaless and structureless graph querying. PVLDB, 7(7), 2014.
[24]
S. Zhang, J. Yang, and W. Jin. Sapper: Subgraph indexing and approximate matching in large graphs. PVLDB, 3(1), 2010.
[25]
X. Zhao, C. Xiao, X. Lin, Q. Liu, and W. Zhang. A partition-based approach to structure similarity search. PVLDB, 7(3), 2013.
[26]
W. Zheng, L. Zou, X. Lian, D. Wang, and D. Zhao. Efficient graph similarity search over large graph databases. TKDE, 27(4), 2015.
[27]
W. Zheng, L. Zou, X. Lian, and D. Zhao. Graph similarity search with edit distance constraint in large graph databases. In CIKM, 2013.
[28]
L. Zou, J. Mo, L. Chen, M. T. Özsu, and D. Zhao. gstore: Answering SPARQL queries via subgraph matching. PVLDB, 4(8), 2011.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 9, Issue 11
July 2016
60 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 July 2016
Published in PVLDB Volume 9, Issue 11

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)101
  • Downloads (Last 6 weeks)13
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Querying knowledge graphs through positive and negative examples and feedbackJournal of Intelligent Information Systems10.1007/s10844-024-00846-z62:5(1165-1186)Online publication date: 1-Oct-2024
  • (2024)Interactive SPARQL query formulation using provenanceKnowledge and Information Systems10.1007/s10115-023-01939-x66:3(2165-2191)Online publication date: 1-Mar-2024
  • (2023)Non-monotonic Generation of Knowledge Paths for Context UnderstandingACM Transactions on Management Information Systems10.1145/362799415:1(1-28)Online publication date: 20-Oct-2023
  • (2023)Knowledge Graphs QueryingACM SIGMOD Record10.1145/3615952.361595652:2(18-29)Online publication date: 11-Aug-2023
  • (2023)A transformer framework for generating context-aware knowledge graph pathsApplied Intelligence10.1007/s10489-023-04588-353:20(23740-23767)Online publication date: 14-Jul-2023
  • (2022)Learning to transfer knowledge from RDF Graphs with gated recurrent unitsIntelligent Data Analysis10.3233/IDA-21591926:3(679-694)Online publication date: 1-Jan-2022
  • (2022)Optimisation Techniques for Flexible SPARQL QueriesACM Transactions on the Web10.1145/353285516:4(1-44)Online publication date: 16-Nov-2022
  • (2022)Approximate and Interactive Processing of Aggregate Queries on Knowledge Graphs: A DemonstrationProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557158(5034-5038)Online publication date: 17-Oct-2022
  • (2022)A subgraph matching algorithm based on subgraph index for knowledge graphFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-020-0360-y16:3Online publication date: 1-Jun-2022
  • (2021)RDF knowledge graph keyword type search using frequent patternsJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-21095041:1(2239-2253)Online publication date: 1-Jan-2021
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media