Abstract
In this paper, we describe terms propagation method dealing with focussed XML component retrieval. Focussed XML component retrieval is one of the most important challenge in the XML IR field. The aim of the focussed retrieval approach is to find the most exhaustive and specific element that focus on the user need. These needs can be expressed through content queries composed of simple keyword. Our method provides a natural representation of document, its elements and its content, and allows an automatic selection of a combination of elements that better answers the user’s query. In this paper we show the efficiency of the terms propagation method using a terms weighting formula that takes into account the size of the nodes and the size of the document. Our method has been evaluated on the «Focused» task of INEX 2006 and compared to XFIRM model which is based on relevance propagation method. Evaluations have shown a significant improvement in the retrieval process efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lalmas, M.: Dempster-Shafer’s theory of evidence applied to structured documents: Modeling uncertainty. In: Proceedings of ACM-SIGIR, Philadelphia, pp. 110–118 (1997)
Mass, Y., Mandelbrod, M.: Retrieving the most relevant XML Component. In: Proceedings of the Second Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), December 15-17 (2003)
Mass, Y., Mandelbrod, M.: Component Ranking and Automatic Query Refinement for XML Retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 73–84. Springer, Heidelberg (2005)
Mass, Y., Mandelbrod, M.: Using the INEX Environment as a Test Bed for various User Models for XML Retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 187–195. Springer, Heidelberg (2006)
Berchiche-Fellag, S., Mezghiche, M.: XML Element Retrieval using terms propagation. In: International Conference on Automation, Control, Engineering and Computer Science, ACECS 2014 (2014) (to be published)
Grabs, T., Scheck, H.J.: Flexible information retrieval from XML with Power DB XML. In: Proceedings of the First Annual Workshop of INEX, pp. 141–148 (December 2002)
Kakade, V., Raghavan, P.: Encoding XML in vector spaces. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 96–111. Springer, Heidelberg (2005)
Fuhr, N., Malik, S., Lalmas, M.: Overview of the initiative for the evaluation of XML retrieval (INEX) 2003. In: Proceedings of INEX 2003 Workshop, Dagstuhl, Germany (December 2003)
Ogilvie, P., Callan, J.: Using language models for flat text queries in XML retrieval. In: Proceedings of INEX 2003 Workshop, Dagstuhl, Germany, pp. 12–18 (December 2003)
Kamps, J., Rijke, M., Sigurbjornsson, B.: Length normalization in XML retrieval. In: Proceedings of SIGIR 2004, Sheffield, England, pp. 80–87 (2004)
Piwowarski, B., Faure, G.E., Gallinari, P.: Bayesian Networks and INEX. In: Proceeding in the First Annual Workshop for the Evaluation of Xml Retrieval, INEX (2002)
Anh, V.N., Moffat, A.: Compression and an IR approach to XML Retrieval. In: INEX 2002 Workshop Proceedings, Germany, pp. 100–104 (2002)
Sauvagnat, K.: Modèle flexible pour la recherche d’information dans des corpus de documents semi-structurés. Thèse Doctorat, Université Paul Sabatier de Toulouse (2005)
Fuhr, N., Grossjohann, K.: XIRQL, a query language for information retrieval in XML documents. In: Proceedings of SIGIR 2001, Toronto, Canada (2001)
Gövert, N., Abolhassanni, M., Fuhr, N., Grossjohann, K.: Content-Oriented XML Retrieval with HyreX. In: INEX 2002 Workshop Proceedings, Germany, pp. 26–32 (2002)
Cui, H., Wen, J.-R., Chua, J.-R.: Hierarchical indexing and flexible element retrieval for structured document (April 2003)
Ben Aouicha, M.: Une approche algébrique pour la recherche d’information structurée. Thèse de doctorat en informatique, Université Paul Sabatier, Toulouse (2009)
Berchiche-Fellag, S., Boughanem, M.: Traitement des requêtes CO (Content Only) sur un corpus de documents XML. In: Colloque sur l’Optimisation et les Systèmes d’Information (2010)
Denoyer, L., Gallinari, P.: The Wikipedia XML corpus. SIGIR Forum 40(1), 64–69 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Berchiche-Fellag, S., Mezghiche, M. (2014). Searching XML Element Using Terms Propagation Method. In: Andreasen, T., Christiansen, H., Cubero, JC., Raś, Z.W. (eds) Foundations of Intelligent Systems. ISMIS 2014. Lecture Notes in Computer Science(), vol 8502. Springer, Cham. https://doi.org/10.1007/978-3-319-08326-1_40
Download citation
DOI: https://doi.org/10.1007/978-3-319-08326-1_40
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08325-4
Online ISBN: 978-3-319-08326-1
eBook Packages: Computer ScienceComputer Science (R0)