Abstract
Keyword search is an effective paradigm for information discovery and has been introduced recently to query XML documents. Scoring of XML search results is an important issue in XML keyword search. Traditional “bag-of-words” model cannot differentiate the roles of keywords as well as the relationship between keywords, thus is not proper for XML keyword queries. In this paper, we present a new scoring method based on a novel query model, called keyword query with structure (QWS), which is specially designed for XML keyword query. The method is based on a totally new view taken by the QWS model on a keyword query that, a keyword query is a composition of several query units, each representing a query condition. We believe that this method captures the semantic relevance of the search results. The paper first introduces an algorithm reformulating a keyword query to a QWS. Then, a scoring method is presented which measures the relevance of search results according to how many and how well the query conditions are matched. The scoring method is also extended to clusters of search results. Experimental results verify the effectiveness of our methods.
Similar content being viewed by others
References
Chen Y, Wang W, Liu Z et al (2009) Keyword search on structured and semi-structured data. In: Proceedings of SIGMOD, pp 1005–1010
Liu Z, Chen Y (2011) Processing keyword search on XML: a survey. World Wide Web 14(5–6):671–707
Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge
Liu Z, Chen Y (2007) Identifying meaningful return information for XML keyword search. In: Proceedings of SIGMOD, pp 329–340
Liu Z, Chen Y (2010) Return specification inference and result clustering for keyword search on XML. ACM Trans Database Syst 35(2):1–47
Bao Z, Ling TW, Chen B et al (2009) Effective XML keyword search with relevance oriented ranking. In: Proceedings of ICDE, pp 517–528
Bao Z, Lu J, Ling TW et al (2010) Towards an effective XML keyword search. IEEE Trans Knowl Data Eng 22(8):1077–1092
Paparizos S (2013) Entwining structure into web search. In: Proceedings of DBRank, pp 1–4
Patil R, Chen Z (2012) STRUCT: incorporating contextual information for english query search on relational databases. In: Proceedings of KEYS, pp 11–22
Zeng Z, Bao Z, Le TN et al (2014) ExpressQ: identifying keyword context and search target in relational keyword queries. In: Proceedings of CIKM, pp 31–40
Guo L, Shao F, Botev C et al (2003) XRANK: ranked keyword search over XML documents. In: Proceedings of SIGMOD, pp 16–27
Cohen S, Mamou J, Kanza Y et al (2003) XSEarch: a semantic search engine for XML. In: Proceedings of VLDB, pp 45–56
Li G, Ooi BC, Feng J et al (2008) EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: Proceedings of SIGMOD, pp 903–914
Li J, Liu C, Zhou R et al (2014) XML keyword search with promising result type recommendations. World Wide Web 17(1):127–159
Carpineto C, Osiński S, Romano G et al (2009) A survey of web clustering engines. ACM Comput Surv 1(3):17
Moreno JG, Dias G, Cleuziou G (2014) Query log driven web search results clustering. In: Proceedings of SIGIR, pp 777–786
Liu X, Wan C, Chen L (2011) Returning Clustered results for keyword search on XML documents. IEEE Trans Knowl Data Eng 23(12):1811–1825
Wu Y, Yang S, Srivatsa M et al (2013) Summarizing answer graphs induced by keyword queries. Proc VLDB Endow 6(14):1774–1785
Liu X, Chen L, Wan C et al (2013) Exploiting structures in keyword queries for effective XML search. Inf Sci 240:56–71
Goldman R, Widom J (1997) DataGuides: enabling query formulation and optimization in semistructured databases. In: Proceedings of VLDB, pp 436–445
Scaiella U, Ferragina P, Marino A et al (2012) Topical clustering of search results. In: Proceedings of WSDM, pp 223–232
(2013) The lemur toolkit for language modelling and information retrieval. www.lemurproject.org/
Ley M (2013) DBLP bibliography. www.informatik.uni-trier.de/~ley/db/
Mikla G (2002) The Mondial dataset. http://www.cs.washington.edu/research/xmldatasets/
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, X., Wan, C. & Liu, D. Keyword query with structure: towards semantic scoring of XML search results. Inf Technol Manag 17, 151–163 (2016). https://doi.org/10.1007/s10799-015-0247-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10799-015-0247-z