Abstract
In this paper we report the work done by Linguateca in order to add Portuguese to two tracks of CLEF, namely the ad hoc IR and the QA tracks. We start with a brief description of Linguateca’s aims and the way we see CLEF from the standpoint of Portuguese language processing. We then comment on several interesting problems that emerged during our work and offer some suggestions for improvement, and finally raise some possibly controversial points for discussion.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Santos, D.: Toward Language-specific Applications. Machine Translation 14, 83–112 (1999)
Santos, D., Rocha, P.: Evaluating CETEMPúblico, a free resource for Portuguese. In: Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics, Toulouse, July 2001, pp. 442–449 (2001)
Afonso, S., Bick, E., Haber, R., Santos, D.: Floresta sintá(c)tica: a treebank for Portuguese. In: Rodríguez, M.G., Araujo, C.P.S. (eds.) Proceedings of LREC 2002, Las Palmas, May 2002, pp. 1698–1703. ELRA (2002)
Santos, D., Costa, L., Rocha, P.: Cooperatively evaluating Portuguese morphology. In: Mamede, et al. (eds.) Computational Processing of the Portuguese Language, 6th PROPOR, pp. 259–266. Springer, Heidelberg (2003)
Santos, D., Rocha, P.: AvalON: uma iniciativa de avaliação conjunta para o português. In: Mendes, A., Freitas, T. (orgs.) Actas do XVIII Encontro da Associação Portuguesa de Linguística, Porto, October 2002, pp. 693–704. APL (2003)
Rocha, P., Santos, D.: CLEF: Abrindo a porta à participação internacional em RI do português. In: Santos, D. (ed.) Avaliação conjunta: um novo paradigma no processamento computacional da língua portuguesa (in print)
Latin Union: L4: The fourth study on Languages and the Internet, http://www.funredes.org/LC/english/L4.html
Magnini, B., Romagnoli, S., Vallin, A., Herrera, J., Peñas, A., Peinado, V., Verdejo, F., de Rijke, M.: Creating the DISEQuA Corpus: a Test Set for Multilingual Question Answering. In: Peters, C. (ed.) Working Notes for the CLEF 2003 Workshop, Trondheim (August 2003), http://clef.isti.cnr.it/publications.html
Costa, L.: First evaluation of esfinge – A question answering system for portuguese. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, pp. 522–533. Springer, Heidelberg (2005)
Magnini, B., et al.: Overview of the CLEF 2004 Multilingual Question Answering Track. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, pp. 371–391. Springer, Heidelberg (2005)
Rocha, P., Santos, D.: CETEMPúblico: Um corpus de grandes dimensões de linguagem jornalística portuguesa. In: Nunes, M.G.V. (ed.) Actas do V Encontro para o processamento computacional da língua portuguesa escrita e falada. PROPOR 2000, Atibaia-SP, November 2000, pp. 131–140
Christ, O., Schulze, B.M., Hofmann, A., Koenig, E.: The IMS Corpus Workbench: Corpus Query Processor (CQP): User’s Manual. In: Institute for Natural Language Processing, University of Stuttgart, March 8 (1999) (CQP V2.2)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Santos, D., Rocha, P. (2005). The Key to the First CLEF with Portuguese: Topics, Questions and Answers in CHAVE. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_80
Download citation
DOI: https://doi.org/10.1007/11519645_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)