Cross-Lingual Question Answering Using Off-the-Shelf Machine Translation

Kisuh Ahn²²,
Beatrice Alex²²,
Johan Bos²²,
Tiphaine Dalmas²²,
Jochen L. Leidner²² &
…
Matthew B. Smillie²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3491))

Included in the following conference series:

Workshop of the Cross-Language Evaluation Forum for European Languages

640 Accesses

Abstract

We show how to adapt an existing monolingual open-domain QA system to perform in a cross-lingual environment, using off-the-shelf machine translation software. In our experiments we use French and German as source language, and English as target language. For answering factoid questions, our system performs with an accuracy of 16% (German to English) and 20% (French to English), respectively. The loss of correctly answered questions caused by the MT component is estimated at 10% for French, and 15% for German. The accuracy of our system on correctly translated questions is 28% for German and 29% for French.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

mRAT-SQL+GAP: A Portuguese Text-to-SQL Transformer

I Can Guess What You Mean: A Monolingual Query Enhancement for Machine Translation

Improving Question-Answering for Portuguese Using Triples Extracted from Corpora

References

Leidner, J.L., Bos, J., Dalmas, T., Curran, J.R., Clark, S., Bannard, C.J., Steedman, M., Webber, B.: The QED open-domain answer retrieval system for TREC 2003. In: Proceedings of the Twelfth Text Retrieval Conference (TREC 2003), pp. 595–599. NIST Special Publication 500-255, Gaithersburg (2004)
Google Scholar
Voorhees, E.M.: Overview of TREC 2003. In: Proceedings of the Twelfth Text Retrieval Conference (TREC 2003), pp. 1–13. NIST Special Publication 500-255, Gaithersburg (2004)
Google Scholar
Magnini, B., Romagnoli, S., Vallin, A., Herrera, J., Peñas, A., Peinado, V., Verdejo, F., de Rijke, M.: Creating the DISEQuA corpus: a test set for multilingual question answering. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 487–500. Springer, Heidelberg (2004)
Chapter Google Scholar
Curran, J.R., Clark, S.: Investigating GIS and smoothing for maximum entropy taggers. In: Proceedings of the 11th Annual Meeting of the European Chapter of the Association for Computational Linguistics (EACL 2003), Budapest, Hungary, pp. 91–98 (2003)
Google Scholar
Curran, J.R., Clark, S.: Language independent NER using a maximum entropy tagger. In: Proceedings of the Seventh Conference on Natural Language Learning (CoNLL 2003), Edmonton, Canada, pp. 164–167 (2003)
Google Scholar
Briscoe, T., Carroll, J.: Robust accurate statistical annotation of general text. In: Proceedings of the 3rd International Conference on Language Resources and Evaluation, Las Palmas, Gran Canaria, pp. 1499–1504 (2002)
Google Scholar
Kamp, H., Reyle, U.: From Discourse to Logic; An Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and DRT. Kluwer, Dordrecht (1993)
Google Scholar
Fellbaum, C. (ed.): WordNet. An Electronic Lexical Database. The MIT Press, Cambridge (1998)
MATH Google Scholar
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022), IBM Thomas J. Watson Research Center (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Edinburgh, Scotland, UK
Kisuh Ahn, Beatrice Alex, Johan Bos, Tiphaine Dalmas, Jochen L. Leidner & Matthew B. Smillie

Authors

Kisuh Ahn
View author publications
You can also search for this author in PubMed Google Scholar
Beatrice Alex
View author publications
You can also search for this author in PubMed Google Scholar
Johan Bos
View author publications
You can also search for this author in PubMed Google Scholar
Tiphaine Dalmas
View author publications
You can also search for this author in PubMed Google Scholar
Jochen L. Leidner
View author publications
You can also search for this author in PubMed Google Scholar
Matthew B. Smillie
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISTI-CNR, Area di Ricerca, Pisa, Italy
Carol Peters
Sheffield University, Sheffield, United Kingdom
Paul Clough
No Affiliations,
Julio Gonzalo
Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland
Gareth J. F. Jones
German Institute for International and Security Affairs, Stiftung Wissenschaft und Politik (SWP), Ludwigkirchplatz 3-4, P.O. Box, 10719, Berlin, Germany
Michael Kluck
ITC-IRST, Trento, Italy
Bernardo Magnini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ahn, K., Alex, B., Bos, J., Dalmas, T., Leidner, J.L., Smillie, M.B. (2005). Cross-Lingual Question Answering Using Off-the-Shelf Machine Translation. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_44

Download citation

DOI: https://doi.org/10.1007/11519645_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics