Abstract
To date, attempts for applying syntactic information in the document-based retrieval model dominant have led to little practical improvement, mainly due to the problems associated with the integration of this kind of information into the model. In this article we propose the use of a locality-based retrieval model for reranking, which deals with syntactic linguistic variation through similarity measures based on the distance between words. We study two approaches whose effectiveness has been evaluated on the CLEF corpus of Spanish documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alonso, M.A., Vilares, J., Darriba, V.M.: On the usefulness of extracting syntactic dependencies for text indexing. In: O’Neill, M., Sutcliffe, R.F.E., Ryan, C., Eaton, M., Griffith, N.J.L. (eds.) AICS 2002. LNCS (LNAI), vol. 2464, pp. 3–11. Springer, Heidelberg (2002)
Attar, R., Fraenkel, A.S.: Local feedback in full-text retrieval systems. Journal of the ACM 24(3), 397–417 (1977)
Buckley, C.: Implementation of the SMART information retrieval system. Technical report, Department of Computer Science, Cornell University (1985), Sources available in ftp://ftp.cs.cornell.edu/pub/smart
Croft, W.B.: Combining approaches to information retrieval. In: Croft, W.B. (ed.) Advances in Information Retrieval. Recent Research from the Center for Intelligent Information Retrieval. The Kluwer International Series on Information Retrieval, vol. 7, ch. 1, pp. 1–36. Kluwer Academic Publishers, Dordrecht (2000)
de Kretser, O., Moffat, A.: Effective document presentation with a localitybased similarity heuristic. In: Proc. of the 22nd annual international ACM SIGIR conference on Research and Development in Information Retrieval, Berkeley, California, USA, pp. 113–120. ACM Press, New York (1999)
de Kretser, O., Moffat, A.: Locality-based information retrieval. In: Roddick, J.F. (ed.) Proc. of 10th Australasian Database Conference (ADC 1999), Auckland, New Zealand, January 18- 21. Australian Computer Science Communications, vol. 21, pp. 177–188, Singapore. Springer, Heidelberg (1999)
Fox, E., Shaw, J.: Combination of multiple searches. In: Harman, D.K. (ed.) NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC-2), Gaithersburg, MD, USA, pp. 243–252 (1994), Department of Commerce, National Institute of Standards and Technology
Hull, D.A., Grefenstette, G., Schulze, B.M., Gaussier, E., Schütze, H., Pedersen, J.O.: Xerox TREC-5 site report: Routing, filtering, NLP, and Spanish tracks. In: Voorhees, E.M., Harman, D.K. (eds.) NIST Special Publication 500-238: The Fifth Text REtrieval Conference (TREC-5), Gaithersburg, MD, USA, pp. 167–180 (1997), Department of Commerce, National Institute of Standards and Technology
Jacquemin, C., Tzoukermann, E.: NLP for term variant extraction: synergy between morphology, lexicon and syntax. In: Natural Language Information Retrieval. Text, Speech and Language Technology, vol. 7, pp. 25–74. Kluwer Academic Publishers, Dordrecht (1999)
Kaszkiel, M., Zobel, J.: Effective ranking with arbitrary passages. Journal of the American Society of Information Science 52(4), 344–364 (2001)
Koster, C.H.: Head/modifier frames for information retrieval. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 420–432. Springer, Heidelberg (2004)
Lee, J.: Analyses of multiple evidence combination. In: Proc. of SIGIR 1997, Philadelphia, PA, USA, July 27-31, pp. 267–276. ACM Press, New York (1997)
Mitra, M., Buckley, C., Singhal, A., Cardie, C.: An analysis of statistical and syntactic phrases. In: Devroye, L., Chrisment, C. (eds.) Proc. of Computer-Aided Information Searching on the Internet (RIAO 1997), Montreal, Canada, pp. 200–214 (1997)
Perez-Carballo, J., Strzalkowski, T.: Natural language information retrieval: progress report. Information Processing and Management 36(1), 155–178 (2000)
Peters, C. (ed.): Results of the CLEF 2002 Cross-Language System Evaluation Campaign, Working Notes for the CLEF 2002 Workshop, Rome, Italy (Sept. 2002), CLEF official site http://www.clef-campaign.org
Rocchio, J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System - Experiments in Automatic Document Processing, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)
Saracevic, T., Kantor, P.: A study of information seeking and retrieving. III. Searchers, searches, overlap. Journal of the American Society for Information Science 39(3), 197–216 (1988)
Vilares, J., Alonso, M.A.: A grammatical approach to the extraction of index terms. In: Angelova, G., Bontcheva, K., Mitkov, R., Nicolov, N., Nikolov, N. (eds.) International Conference on Recent Advances in Natural, Borovets, Bulgaria, September. 2003, pp. 500–504 (2003)
Vilares, J., Alonso, M.A., Ribadas, F.J.: COLE experiments at CLEF 2003 Spanish monolingual track. In: To be published in Lecture Notes in Computer Science, Springer, Heidelberg (2004)
Vilares, J., Alonso, M.A., Ribadas, F.J., Vilares, M.: COLE experiments at CLEF 2002 Spanish monolingual track. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 265–278. Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vilares, J., Alonso, M.A. (2004). Dealing with Syntactic Variation Through a Locality-Based Approach. In: Apostolico, A., Melucci, M. (eds) String Processing and Information Retrieval. SPIRE 2004. Lecture Notes in Computer Science, vol 3246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30213-1_36
Download citation
DOI: https://doi.org/10.1007/978-3-540-30213-1_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23210-0
Online ISBN: 978-3-540-30213-1
eBook Packages: Springer Book Archive