Abstract
This paper presents a Query Auto-Completion (QAC) framework that aims at assisting users in a digital library to specify their search intent with reduced effort. The proposed system suggests metadata-based facets to users as they specify their queries into the system. In this work, we model the facet-based QAC problem as frequent pattern mining problem where the system aims at leveraging association among different facet combinations. Among several frequent pattern mining algorithms, the present work make use of FP-Growth to discover facet patterns at large-scale. These facet patterns represented in form of association rules are used for online query auto-completion or suggestion. A prototype QAC augmented digital library search system is implemented by considering a limited bibliographic dataset (35K resources) of the National Digital Library of India (NDLI: https://ndl.iitkgp.ac.in) portal. We perform extensive experiments to measure the quality of query suggestions and QAC augmented retrieval performance. Significant improvement over baseline search system is observed in both the aspects mentioned above.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Niu, X., Hemminger, B.: Analyzing the interaction patterns in a faceted search interface. J. Assoc. Inf. Sci. Technol. 66(5), 1030–1047 (2015)
Huston, S., Croft, W.B.: Evaluating verbose query processing techniques. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, pp. 291–298. ACM, New York (2010)
Sadhu, S., Bhowmick, P.K.: Automatic segmentation and semantic annotation of verbose queries in digital library. In: Méndez, E., Crestani, F., Ribeiro, C., David, G., Lopes, J.C. (eds.) TPDL 2018. LNCS, vol. 11057, pp. 270–276. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00066-0_23
Ma, H., Yang, H., King, I., Lyu, M.R.: Learning latent semantic relations from clickthrough data for query suggestion. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 709–718. ACM (2008)
Beeferman, D., Berger, A.: Agglomerative clustering of a search engine query log. In: KDD, vol. 2000, pp. 407–416 (2000)
Wen, J.-R., Nie, J.-Y., Zhang, H.-J.: Clustering user queries of a search engine. In: Proceedings of the 10th International Conference on World Wide Web, pp. 162–168. Citeseer (2001)
Baeza-Yates, R., Hurtado, C., Mendoza, M.: Query recommendation using query logs in search engines. In: Lindner, W., Mesiti, M., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 588–596. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30192-9_58
Cao, H., et al.: Context-aware query suggestion by mining click-through and session data. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 875–883. ACM (2008)
Shokouhi, M., Radinsky, K.: Time-sensitive query auto-completion. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 601–610. ACM (2012)
Shokouhi, M.: Learning to personalize query auto-completion. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 103–112. ACM (2013)
Bar-Yossef, Z., Kraus, N.: Context-sensitive query auto-completion. In: Proceedings of the 20th International Conference on World Wide Web, pp. 107–116. ACM (2011)
Cai, F., Liang, S., De Rijke, M.: Time-sensitive personalized query auto-completion. In: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management, pp. 1599–1608. ACM (2014)
Zhang, A., et al.: adaQAC: adaptive query auto-completion via implicit negative feedback. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 143–152. ACM (2015)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. ACM SIGMOD Rec. 29, 1–12 (2000)
Agrawal, R., Srikant, R., et al.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases, VLDB, vol. 1215, pp. 487–499 (1994)
Jiang, J.-Y., Ke, Y.-Y., Chien, P.-Y., Cheng, P.-J.: Learning user reformulation behavior for query auto-completion. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 445–454. ACM (2014)
Whiting, S., Jose, J.M.: Recent and robust query auto-completion. In: Proceedings of the 23rd International Conference on World Wide Web, pp. 971–982. ACM (2014)
Cai, F., Liang, S., de Rijke, M.: Prefix-adaptive and time-sensitive personalized query auto completion. IEEE Trans. Knowl. Data Eng. 28(9), 2452–2466 (2016)
Cai, F., de Rijke, M.: Selectively personalizing query auto-completion. In: Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 993–996. ACM (2016)
Duan, H., Hsu, B.-J.P.: Online spelling correction for query completion. In: Proceedings of the 20th International Conference on World Wide Web, pp. 117–126. ACM (2011)
Acknowledgement
This work is supported by IBM Research through Shared University Research grant and Ministry of Human Resource Development, Government of India the National Digital Library of India project.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Appendix: List of Tasks for Assessment
Appendix: List of Tasks for Assessment
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Sadhu, S., Bhowmick, P.K. (2019). Metadata-Based Automatic Query Suggestion in Digital Library Using Pattern Mining. In: Jatowt, A., Maeda, A., Syn, S. (eds) Digital Libraries at the Crossroads of Digital Information for the Future. ICADL 2019. Lecture Notes in Computer Science(), vol 11853. Springer, Cham. https://doi.org/10.1007/978-3-030-34058-2_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-34058-2_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34057-5
Online ISBN: 978-3-030-34058-2
eBook Packages: Computer ScienceComputer Science (R0)