[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1247480.1247516acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

BLINKS: ranked keyword searches on graphs

Published: 11 June 2007 Publication History

Abstract

Query processing over graph-structured data is enjoying a growing number of applications. A top-k keyword search query on a graph finds the top k answers according to some ranking criteria, where each answer is a substructure of the graph containing all query keywords. Current techniques for supporting such queries on general graphs suffer from several drawbacks, e.g., poor worst-case performance, not taking full advantage of indexes, and high memory requirements. To address these problems, we propose BLINKS, a bi-level indexing and query processing scheme for top-k keyword search on graphs. BLINKS follows a search strategy with provable performance bounds, while additionally exploiting a bi-level index for pruning and accelerating the search. To reduce the index space, BLINKS partitions a data graph into blocks: The bi-level index stores summary information at the block level to initiate and guide search among blocks, and more detailed information for each block to accelerate search within blocks. Our experiments show that BLINKS offers orders-of-magnitude performance improvement over existing approaches.

References

[1]
S. Agrawal, S. Chaudhuri, and G. Das. DBXplorer: A system for keyword-based search over relational databases. In ICDE, 2002.
[2]
A. Balmin, V. Hristidis, and Y. Papakonstantinou. Objectrank: Authority-based keyword search in databases. In VLDB, pages 564--575, 2004.
[3]
G. Bhalotia, C. Nakhe, A. Hulgeri, S. Chakrabarti, and S. Sudarshan. Keyword searching and browsing in databases using BANKS. In ICDE, 2002.
[4]
Y. Cai, X. Dong, A. Halevy, J. Liu, and J. Madhavan. Personal information management with SEMEX. In SIGMOD, 2005.
[5]
S. Cohen, J. Mamou, Y. Kanza, and Y. Sagiv. XSEarch: A semantic search engine for XML. In VLDB, 2003.
[6]
B. Ding, J. X. Yu, S. Wang, L. Qing, X. Zhang, and X. LIN. Finding top-k min-cost connected trees in databases. In ICDE, 2007.
[7]
S. E. Dreyfus and R. A. Wagner. The Steiner problem in graphs. Networks, 1:195--207, 1972.
[8]
S. Dumais, E. Cutrell, J. J. Cadiz, G. Jancke, R. Sarin, and D. C. Robbins. Stuff i've seen: a system for personal information retrieval and re-use. In SIGIR, 2003.
[9]
R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. In PODS, pages 102--113, 2001.
[10]
D. Florescu, D. Kossmann, and I. Manolescu. Integrating keyword search into XML query processing. Comput. Networks, 33(1--6):119--135, 2000.
[11]
M. Garey, D. Johnson, and L. Stockmeyer. Some simplified NP-complete graph problems. Theoretical Computer Science, 1:237--267, 1976.
[12]
R. Goldman, N. Shivakumar, S. Venkatasubramanian, and H. Garcia-Molina. Proximity search in databases. In VLDB, pages 26--37, 1998.
[13]
J. Graupmann, R. Schenkel, and G. Weikum. The spheresearch engine for unified ranked retrieval of heterogeneous XML and web documents. In VLDB, pages 529--540, 2005.
[14]
L. Guo, F. Shao, C. Botev, and J. Shanmugasundaram. XRANK: ranked keyword search over XML documents. In SIGMOD, pages 16--27, 2003.
[15]
H. He, H. Wang, J. Yang, and P. S. Yu. Blinks: Ranked keyword searches on graphs. Technical report, Duke CS Department, 2007.
[16]
V. Hristidis, L. Gravano, and Y. Papakonstantinou. Efficient IR-style keyword search over relational databases. In VLDB, pages 850--861, 2003.
[17]
V. Hristidis and Y. Papakonstantinou. Discover: Keyword search in relational databases. In VLDB, 2002.
[18]
V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar. Bidirectional expansion for keyword search on graph databases. In VLDB, 2005.
[19]
G. Karypis and V. Kumar. Analysis of multilevel graph partitioning. In Supercomputing, 1995.
[20]
R. Kaushik, R. Krishnamurthy, J. F. Naughton, and R. Ramakrishnan. On the integration of structure indexes and inverted lists. In SIGMOD, pages 779--790, 2004.
[21]
B. Kimelfeld and Y. Sagiv. Finding and approximating top-k answers in keyword proximity search. In PODS, pages 173--182, 2006.
[22]
Yunyao Li, Cong Yu, and H. V. Jagadish. Schema-free XQuery. In VLDB, pages 72--83, 2004.
[23]
F. Liu, C. T. Yu, W. Meng, and A. Chowdhury. Effective keyword search in relational databases. In SIGMOD, pages 563--574, 2006.
[24]
J. Liu. A graph partitioning algorithm by node separators. ACM Trans. Math. Softw., 15(3):198--219, 1989.
[25]
Y. Xu and Y. Papakonstantinou. Efficient keyword search for smallest LCAs in XML databases. In SIGMOD, 2005.

Cited By

View all
  • (2024)Contigra: Graph Mining with Containment ConstraintsProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3629589(50-65)Online publication date: 22-Apr-2024
  • (2024)A Fast Hop-Biased Approximation Algorithm for the Quadratic Group Steiner Tree ProblemProceedings of the ACM Web Conference 202410.1145/3589334.3645325(312-321)Online publication date: 13-May-2024
  • (2024)KS-GNN: Keyword Search via Graph Neural Network for Web API RecommendationIEEE Transactions on Network and Service Management10.1109/TNSM.2024.342007221:5(5464-5474)Online publication date: Oct-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
June 2007
1210 pages
ISBN:9781595936868
DOI:10.1145/1247480
  • General Chairs:
  • Lizhu Zhou,
  • Tok Wang Ling,
  • Program Chair:
  • Beng Chin Ooi
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graphs
  2. indexing
  3. keyword search
  4. ranking

Qualifiers

  • Article

Conference

SIGMOD/PODS07
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)5
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Contigra: Graph Mining with Containment ConstraintsProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3629589(50-65)Online publication date: 22-Apr-2024
  • (2024)A Fast Hop-Biased Approximation Algorithm for the Quadratic Group Steiner Tree ProblemProceedings of the ACM Web Conference 202410.1145/3589334.3645325(312-321)Online publication date: 13-May-2024
  • (2024)KS-GNN: Keyword Search via Graph Neural Network for Web API RecommendationIEEE Transactions on Network and Service Management10.1109/TNSM.2024.342007221:5(5464-5474)Online publication date: Oct-2024
  • (2024)Efficient Skyline Keyword-Based Tree Retrieval on Attributed GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338898836:11(6056-6070)Online publication date: 1-Nov-2024
  • (2024)Enriching Simple Keyword Queries for Domain-Aware Narrative RetrievalProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00029(143-154)Online publication date: 26-Jun-2024
  • (2024)DKWS: A Distributed System for Keyword Search on Massive Graphs (Extended Abstract)2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00476(5689-5690)Online publication date: 13-May-2024
  • (2024)Authenticated Keyword Search on Large-Scale Graphs in Hybrid-Storage Blockchains2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00155(1958-1971)Online publication date: 13-May-2024
  • (2024)GDRMA: Graph Neural Networks for Document Retrievals with Mean AggregationIEEE Access10.1109/ACCESS.2024.3512597(1-1)Online publication date: 2024
  • (2024)Efficient Top-k Keyword Search in Relational Databases Considering Integrated Candidate NetworkIEEE Access10.1109/ACCESS.2024.343346612(173775-173791)Online publication date: 2024
  • (2024)An effective keyword search co-occurrence multi-layer graph mining approachMachine Learning10.1007/s10994-024-06528-9113:8(5773-5806)Online publication date: 2-Apr-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media