Abstract
A minimal approach to Chinese factoid QA is described. It employs entity extraction software, template matching, and statistical candidate answer ranking via five evidence types, and does not use explicit word segmentation or Chinese syntactic analysis. This simple approach is more portable to other Asian languages, and may serve as a base on which more precise techniques can be used to improve results. Applying to the NTCIR-5 monolingual environment, it delivers medium top-1 accuracy and MRR of .295, .3381 (supported answers) and .41, .4998 (including unsupported) respectively. When applied to English-Chinese cross language QA with three different forms of English-Chinese question translation, it attains top-1 accuracy and MRR of .155, .2094 (supported) and .215, .2932 (unsupported), about ~52% to ~62% of monolingual effectiveness. CLQA improvements via successively different forms of question translation are also demonstrated.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Voorhees, E.M., Tice, D.M.: The TREC-8 Question Answering Track Evaluation. In: Information Technology: The Eighth Text REtrieval Conference (TREC-8), pp. 83–105. NIST Special Publication 500-246 (2000)
Ramakrishnan, G., Chakrabarti, S., Paranjpe, D., Bhattacharyya, P.: Is Question Answering an Acquired Skill? In: Proc. of 13th International WWW Conference, pp. 111–120 (2004)
Magnini, B., Romagnoli, S., Vallin, A., Herrera, J., Peñas, A., Peinado, V., Verdejo, F., de Rijke, M.: The Multiple Language Question Answering Track at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 471–486. Springer, Heidelberg (2004); http://clef.iei.pi.cnr.it/
Sekine, S., Grishman, R.: Hindi-English Cross-lingual Question-Answering System. ACM TALIP 2, 181–192 (2004)
Sasaki, Y., Chen, H.-H., Chen, K.-H., Lin, C.-J.: Overview of the NTCIR-5 Cross-lingual Question Answering Task. In: Proc. Fifth Workshop Meeting on Evaluation of Information Access Technologies: IR, QA and CLIR, NII, Tokyo, pp. 175–185 (2005)
Kwok, K.L.: Improving English & Chinese Ad-Hoc Retrieval: A Tipster Text Phase 3 Project Report. Information Retrieval 3, 313–338 (2000)
Chang, Y., Xu, H., Bai, S.: A Re-examination of IR Techniques in QA Systems. In: Su, K.-Y., Tsujii, J., Lee, J.-H., Kwong, O.Y. (eds.) IJCNLP 2004. LNCS (LNAI), vol. 3248, pp. 71–80. Springer, Heidelberg (2005)
Plamondon, L., Foster, G.: Quantum, a French/English Cross-Language Question Answering System. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 549–558. Springer, Heidelberg (2004)
Bikel, D.M., Miller, S., Schwartz, R., Weischedel, R.: A High-Performance Learning Name-Finder. In: Proc. Conference of Applied Natural Language Processing (1997)
Lee, C.-W., Shih, C.-W., Day, M.-Y., Tsai, T.-H., Jiang, T.-J., Wu, C.-W., Sung, C.-L., Chen, Y.-R., Wu, S.-H., Hsu, W.-L.: ASQA: Academia Sinica Question Answering System for NTCIR- 5 CLQA. In: Proc. Fifth Workshop Meeting on Evaluation of Information Access Technologies: IR, QA and CLIR, NII, Tokyo, pp. 202–208 (2005)
Lin, F., Shima, H., Wang, M., Mitamura, T.: CMU JAVELIN System for NTCIR5 CLQA1. In: Proc. Fifth Workshop Meeting on Evaluation of Information Access Technologies: IR, QA and CLIR, NII, Tokyo, pp. 194–201 (2005)
Kwok, K.L., Deng, P., Sun, H.L., Xu, W., Dinstl, N., Peng, P., Doyon, J.: CHINET – a Chinese Name Finder for Docucment Triage. In: Proc. 2005 Intl. Conf. on Intelligence Analysis (2005); Sasaki, Y, Chen, H-H, Chen, K-H, Lin, C-J.: Overview of the NTCIR-5 Cross-lingual Question Answering Task. In: Proc. Fifth Workshop Meeting on Evaluation of Information Access (2005), https://analysis.mitre.org/proceedings_agenda.htm#papers
Kwok, K.L., Grunfeld, L., Sun, H.L., Deng, P.: TREC2004 Robust Track Experiments using PIRCS. In: Information Technology: The Fourteen Text REtrieval Conference (TREC- 2004). NIST Special Publication 500-261 (2005)
Brill, E., Lin, J., Banko, M., Dumais, S., Ng, A.: Data-Intensive Question Answering. In: Information Technology: The Tenth Text REtrieval Conference, TREC 2001, pp. 393–400. NIST Special Publication 500-250 (2002)
Kwok, K.L., Deng, P., Dinstl, N., Choi, S.: NTCIR-5 English-Chinese Cross Language Question-Answering Experiments using PIRCS. In: Proc. Fifth Workshop Meeting on Evaluation of Information Access Technologies: IR, QA and CLIR, NII, Tokyo, pp. 209–214 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kwok, KL., Deng, P. (2006). Chinese Question-Answering: Comparing Monolingual with English-Chinese Cross-Lingual Results. In: Ng, H.T., Leong, MK., Kan, MY., Ji, D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11880592_19
Download citation
DOI: https://doi.org/10.1007/11880592_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45780-0
Online ISBN: 978-3-540-46237-8
eBook Packages: Computer ScienceComputer Science (R0)