[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Technical issues of cross-language information retrieval: a review

Published: 01 May 2005 Publication History

Abstract

This paper reviews state-of-the-art techniques and methods for enhancing effectiveness of cross-language information retrieval (CLIR). The following research issues are covered: (1) matching strategies and translation techniques, (2) methods for solving the problem of translation ambiguity, (3) formal models for CLIR such as application of the language model, (4) the pivot language approach, (5) methods for searching multilingual document collection, (6) techniques for combining multiple language resources, etc.

References

[1]
Adriani, M. (2001). Ambiguity problem in multilingual information retrieval. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp. 156-165). Berlin: Springer-Verlag.]]
[2]
Adriani, M. (2002). English-Dutch CLIR using query translation techniques. In C. Peters et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 219-225). Berlin: Springer.]]
[3]
Ballesteros, L. A. (2000). Cross-language retrieval via transitive translation. In W. B. Croft et al. (Eds.), Advances in information retrieval: recent research from the center for intelligent information retrieval (pp. 203-234). Boston, MA: Kluwer.]]
[4]
Ballesteros, L., & Croft, W. B. (1997). Phrasal translation and query expansion techniques for cross-language information retrieval. In Proceedings of the 20th ACM SIGIR conference on research and development in information retrieval (pp. 84-91).]]
[5]
Ballesteros, L., & Croft, W. B. (1998). Resolving ambiguity for cross-language retrieval. In Proceedings of the 21st ACM SIGIR conference on research and development in information retrieval (pp. 64-71).]]
[6]
Belkin, N. J., Kantor, P., Fox, E. A., & Shaw, J. A. (1995). Combining the evidence of multiple query representations for information retrieval, Information Processing & Management, 31, 431-448.]]
[7]
Berry, M. W., & Young, P. G. (1995). Using latent semantic indexing for multilanguage information retrieval. Computers and the Humanities, 29(6), 413-429.]]
[8]
Bian, G. W., & Lin, C.C. (2001). Trans-EZ at NTCIR-2: synset co-occurrence method for English-Chinese cross-lingual information retrieval. In Proceedings of the second NTCIR workshop on research in Chinese & Japanese text retrieval and text summarization. Tokyo: National Institute of Informatics. Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[9]
Boughanem, M., & Nassr, N. (2001). Mercure at CLEF-1. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp. 202-209). Berlin: Springer-Verlag.]]
[10]
Boughanem, M., Chrisment, C., & Nassr, N. (2002). Investigation on disambiguation in CLIR: aligned corpus and bi-directional translation-based strategies. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 158-168). Berlin: Springer-Verlag.]]
[11]
Braschler, M., & Schääuble, P. (2000). Using corpus-based approaches in a system for multilingual information retrieval. Information Retrieval, 3, 273-284.]]
[12]
Braschler, M., & Schääuble, P. (2001). Experiments with the Eurospider retrieval system for CLEF 2000. In C. Peters (Ed.), Crosslanguage information retrieval evaluation. LNCS (2069, pp. 140-148). Berlin: Springer-Verlag.]]
[13]
Brown, P. F., Della Pietra, S. A., Della Pietra, V. J., & Mercer, R. L. (1993). The mathematics of statistical machine translation: parameter estimation. Computational Linguistics, 19(2), 263-311.]]
[14]
Buckley, C., Allan, J., & Salton, G. (1994). Automatic routing and ad-hoc retrieval using SMART: TREC2. In Proceedings of TREC-2. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[15]
Buckley, C., Walz, J., Mitra, M., & Cardie, C. (1998). Using clustering and superconcepts within SMART: TREC-6. In Proceedings of TREC-6. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[16]
Callan, J. P., Croft, W. B., & Broglio, J. (1995). TREC and TIPSTER experiments with INQUERY. Information Processing & Management, 31(3), 327-343.]]
[17]
Capstick, J., Diagne, A. K., Erbach, G., Uszkoreit, H., Leisenberg, A., & Leisenberg, M. (2000). A system for supporting cross-lingual information retrieval, Information Processing & Management, 36, 275-289.]]
[18]
Chen, A. (2002). Multilingual information retrieval using English and Chinese queries. In C. Peters et al. (Eds.), Evaluation of crosslanguage information retrieval systems LNCS (2406, pp. 44-58). Berlin: Springer-Verlag.]]
[19]
Chen, A., & Gey, F. C. (2003). Experiments on cross-language and patent retrieval at NTCIR-3 workshop. In Proceedings of the third NTCIR workshop on research in information retrieval, automatic text summarization and question answering. Tokyo: National Institute of Informatics, Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[20]
Chen, A., Gey, F. C., Kishida, K., Jiang, H., & Liang, Q. (1999). Comparing multiple methods for Japanese and Japanese-English text retrieval. In Proceedings of the first NTCIR workshop on research in Japanese text retrieval and term recognition. Tokyo: National Institute of Informatics. Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[21]
Cooper, W. S., Chen, A., & Gey, F. C. (1994). Full text retrieval based on probabilistic equations with coefficients fitted by logistic regression. In Proceedings of TREC-2. Gaithersburg, MD: National Institute of Standards and Technology. Available: http:// trec.nist.gov/pubs/.]]
[22]
Darwish, K., & Oard, D. W. (2003). CLIR experiments at Maryland for TREC-2002: evidence combination for Arabic-English retrieval. In Proceedings of TREC-2002. MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[23]
Davis, M. (1997). New experiments in cross-language text retrieval at NMSU's Computing Research Lab. In Proceedings of TREC-5. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[24]
Davis, M. W. (1998). On the effective use of large parallel corpora in cross-language text retrieval. In G. Grefenstette (Ed.), Cross-language information retrieval (pp. 12-22). Boston, MA: Kluwer.]]
[25]
Davis, M., & Dunning, T. (1995). A TREC evaluation of query translation methods for multi-lingual text retrieval. In Proceedings of TREC-4. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[26]
Davis, M. W., & Ogden, W. C. (1997). QUILT: implementing a Large-scale cross-language text retrieval system. In Proceedings of the 20th ACM SIGIR conference on research and development in information retrieval (pp. 92-98).]]
[27]
Davis, M. W., & Ogden, W. (2000). Towards universal text retrieval: Tipster text retrieval research at New Mexico State University. Information Retrieval, 3, 339-356.]]
[28]
Diekema, A., Oroumchian, F., Sheridan, P., & Liddy, E. D. (1999). TREC-7 evaluation of conceptual interlingua document retrieval (CINDOR) in English and French. In Proceedings of TREG-7. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[29]
Dumais, S. T., Landauer, T. K., & Littman, M. L. (1996). Automatic cross-linguistic information retrieval using latent semantic indexing. In Proceedings of the 19th ACM SIGIR conference on research and development in information retrieval (pp. 16-23).]]
[30]
Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1), 61-74.]]
[31]
Federico, M., & Bertoldi, N. (2002). Statistical cross-language information retrieval using n-best query translations. In Proceedings of the 25th ACM SIGIR conference on research and development in information retrieval (pp. 167-174).]]
[32]
Fox, C. (1990). A stop list for general text. ACM-SIGIR Forum, 24, 19-35.]]
[33]
Frakes, W. B. (1992). Stemming algorithms In W. B. Frakes & R. Baeza-Yates (Eds.), Information retrieval: data structures & algorithms (pp. 131-160). Englewood Cliffs, NJ: PTR Prentice-Hall.]]
[34]
Franz, M., & Scott McCarley, J. (2003). Arabic information retrieval at IBM. In Proceedings of TREC-2002. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[35]
Franz, M., Scott McCarley, J., & Roukos, S. (1999). Ad hoc and multilingual information retrieval at IBM. In Proceedings of TREC-7. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[36]
Franz, M., Scott McCarley, J., & Todd Ward, R. (2000). Ad hoc, cross-language and spoken document information retrieval at IBM. In Proceedings of TREC-8. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/ pubs/.]]
[37]
Fraser, A., Xu, J., & Weischedel, R. (2003). TREC 2002 cross-lingual retrieval at BBN. In Proceedings of TREC-2002. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[38]
Fujii, A., & Ishikawa, T. (2001). Japanese/English cross-language information retrieval: exploration of query translation and transliteration. Computers and the Humanities, 35, 389-420.]]
[39]
Gao, J., Nie, J. Y., Zhang, J., Xun, E., Su, Y., Zhou, M., & Huang, C. (2001a). TREC-9 CLIR experiments at MSRCN. In Proceedings of TREC-9. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[40]
Gao, J., Nie, J. Y., Xun, E. X., Zhang, J., Zhou, M., & Huang, C. (2001b). Improving query translation for cross-language information retrieval using statistical models. In Proceedings of 24th ACM SIGIR conference on research and development in information retrieval (pp. 96-104).]]
[41]
Gao, J., Nie, J. Y., He, H., Chen, W., & Zhou, M. (2002). Resolving query translation ambiguity using a decaying co-occurrence model and syntactic dependence relations. In Proceedings of 25th ACM SIGIR conference on research and development information retrieval (pp. 183-190).]]
[42]
Gey, F. C. (2001). Research to improve cross-language retrieval: position paper for CLEF. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp. 83-88). Berlin: Springer-Verlag.]]
[43]
Gey, F. C., Jiang, H., Chen, A., & Larson, R. R. (1999). Manual queries and machine translation in cross-language retrieval and interactive retrieval with Cheshire II at TREC-7. In Proceedings of TREC-7. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[44]
Gey, F., Kando, N., & Peters, C. (2002). Cross-language information retrieval: a research roadmap: In Summary of a workshop at SIGIR-2002. Available: http://ucdata.berkeley.edu/sigir-2002/.]]
[45]
Gollins, T., & Sanderson, M. (2001). Improving cross language information retrieval with triangulated translation. In Proceedings of the 24th ACM SIGIR conference on research and development in information retrieval (pp. 90-95).]]
[46]
Hedlund, T., Keskustalo, H., Pirkola, A., Sepponen, M., & Jäärvelin, K. (2001a). Bilingual tests with Swedish, Finish, and German queries: dealing with morphology, compound words, and query structure. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp. 210-223). Berlin: Springer-Verlag.]]
[47]
Hedlund, T., Pirkola, A., & Jäärvelin, K. (2001b). Aspects of Swedish morphology and semantics from the perspective of mono- and cross-language information retrieval, Information & Processing Management, 37, 147-161.]]
[48]
Hedlund, T., Keskustalo, H., Pirkola, A., Airio, E., & Jäärvelin, K. (2002). Utaclir@CLEF2001: effects of compound splitting and n- gram techniques. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 118-136). Berlin: Springer-Verlag.]]
[49]
Hiemstra, D. (1998). A linguistically motivated probabilistic model of information retrieval. In C. Nikolaou & C. Stephanidis (Eds.), Research and advanced technology for digital libraries. LNCS (1513, pp. 569-584). Berlin: Springer-Verlag.]]
[50]
Hiemstra, D., & Kraaij, W. (1999). Twenty-one at TREC-7: ad-hoc and cross-language track. In Proceedings of TREC-7. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[51]
Hiemstra, D., Kraaij, W., Pohlmann, R., & Westerveld, T. (2001). Translation resources, merging strategies, and relevance feedback for cross-language information retrieval. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp. 102-115). Berlin: Springer-Verlag.]]
[52]
Hull, D. (1998). A weighted Boolean model for cross-language text retrieval. In G. Grefenstette (Ed.), Cross-language information retrieval (pp. 119-136). Boston, MA: Kluwer.]]
[53]
Hull, D. A., & Grefenstette, G. (1996). Querying across languages: a dictionary-based approach to multilingual information retrieval. In Proceedings of the 19th ACM SIGIR conference on research and development in information retrieval (pp. 49-57).]]
[54]
Ide, N., & Vééronis, J. (1998). Introduction to the special issue on word sense disambiguation: the state of the art. Computational Linguistics, 24(1). 1-40.]]
[55]
Jones, G. J. F., & Lam-Adesina, A. M. (2002). Experiments with machine translation for bilingual retrieval. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 59-77). Berlin: Springer-Verlag.]]
[56]
Kim, T., Sim, C. M., Yuh, S., Jung, H., Kim, Y. K., Choi, S. K., Park, D. I., & Choi, K. S. (1999). FromTo-CLIR: web-based natural language interface for cross-language information retrieval. Information Processing & Management, 35, 559-586.]]
[57]
Knight, K., & Graehl, J. (1998). Machine transliteration. Computational Linguistics, 24(4), 599-612.]]
[58]
Kraaij, W., Pohlmann, R., & Hiemstra, D. (2000). Twenty-one at TREC-8: using language technology for information retrieval. In Proceedings of TREC-8. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[59]
Kraaij, W. (2002). TNO at CLEF-2001: comparing translation resources. In C. Peters et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 78-93). Berlin: Springer-Verlag.]]
[60]
Kwok, K. L. (1996). A network approach to probabilistic information retrieval. ACM Transaction on Information systems, 12, 325-353.]]
[61]
Kwok, K. L., Grunfeld, N., Dinstl, N., & Chan, M. (2001). TREC-9 cross language, web and question-answering track experiments using PIRCS. In Proceedings of TREC-9. Gaithersburg, MD: National Institute of Standards and Technology. Available: http:// trec.nist.gov/pubs/.]]
[62]
Landauer, T., & Littman, M. L. (1990). A statistical method for language-independent representation of the topical content of text segments. In Proceedings of the 6th annual conference of the UW centre for the new oxford English dictionary and text research. Available: http://www.cs.duke.edu/~mlittman/docs/x-lang.ps.]]
[63]
Lavrenko, V., Choquette, M., & Croft, W. B. (2002). Cross-lingual relevance models. In Proceedings of the 25th ACM SIGIR conference on research and development in information retrieval (pp. 175-182).]]
[64]
Lin. C. J., Lin, W. C., Bian, G. W., & Chen, H. H (1999). Description of the NTU Japanese-English cross-lingual information retrieval system used for NTCIR workshop. In Proceedings of the first NTCIR workshop on research in Japanese text retrieval and term recognition. Tokyo: National Institute of Informatics. Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[65]
Lin, W. C., & Chen, H. H. (2003). Description of NTU approach to NTCIR3 multilingual information retrieval, In Proceedings of the third NTCIR workshop on research in information retrieval automatic text summarization and question answering. Tokyo: National Institute of Informatics. Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[66]
Littman, M. L., Dumais, S. T., & Landauer, T. K. (1998). Automatic cross-language information retrieval using latent semantic indexing. In G. Grefenstette (Ed.), Cross-language information retrieval (pp. 51-62). Boston, MA: Kluwer.]]
[67]
Maeda, A., Sadat, F., Yoshikawa, M., & Uemura, S. (2000). Query term disambiguation for Web cross-language information retrieval using a search engine. In Proceeings of the 5th international workshop information retrieval with Asian languages (pp. 25-32).]]
[68]
Manning, C. D., & Shüütze, H. (1999). Foundations of Statistical natural language processing. Cambridge, MA: MIT Press.]]
[69]
McNamee, P., & Mayfield, J. (2002a). Comparing cross-language query expansion techniques by degrading translation resources. In Proceedings of the 25th ACM SIGIR conference on research and development in information retrieval(pp.159-166).]]
[70]
McNamee, P., & Mayfield, J. (2002b). JHU/APL experiments at CLEF: translation resources and score normalization. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 193-208). Berlin: Springer-Verlag.]]
[71]
McNamee, P., Mayfield, J., & Piatko, C. (2001). A language-independent approach to European text retrieval. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp 129-139). Berlin: Springer-verlang.]]
[72]
Miller, D. R. H., Leek, T., & Schwartz, R. M. (1999a). BBN at TREC7: using hidden Markov models for information retrieval. In Proceedings of TREC-7. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[73]
Miller, D. R. H., Leek, T., & Schwartz, R. M. (1999b). A hidden Markov model information retrieval system. In Proceedings of the 22nd ACM SIGIR conference on research and development in information retrieval (pp. 214-221).]]
[74]
Molina-Salgado, H., Moulinier, I., Knudson, M., Lund, E., & Sekhon, K. (2002). Thomson legal and regulatory at CLEF 2001: monolingual and bilingual experiments. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 226-234). Berlin: Springer-Verlag.]]
[75]
Mori, T., Kokubu, T., & Tanaka, T. (2001). Cross-lingual information retrieval based on LSI with multiple word spaces. In Proceeding of 2nd NTCIR workshop meeting on evaluation of Chinese & Japanese text retrieval and text summarization. Tokyo: National Institute of Informatics. Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[76]
Moulinier, I., & Molina-Salgado, H. (2002). Thomson legal and regulatory experiments for CLEF2002. In C. Peters (Ed.), Working notes for the CLEF-2002 Workshop (pp. 91-96).]]
[77]
Nie, J. Y. (1999). TREC-7 CLIR using a probabilistic translation model. In Proceedings of TREC-7. Gaithersburg, MD: National Institute of Standards and Technology. Available: http:/trec.nist.gov/pubs/.]]
[78]
Nie, J. Y. (2000). CLIR using a probabilistic translation model based on Web documents. In Proceedings of TREC-8. Gaithersburg, MD: National Institute of Standards and Technology. Available: http:trec.nist.gov/pubs/.]]
[79]
Nie, J. Y., & Jin, F. (2003). A multilingual approach to multilingual retrieval. In C, Peters, et al. (Eds.). Advances in cross-language information retrieval. LNCS (2785. pp, 101-110). Berlin: Springer-Verlag.]]
[80]
Nie, J. Y., & Simard, M. (2002). Using statistical translation model for bilingual IR. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 137-150). Berlin: Springer-Verlag.]]
[81]
Nie, J. Y., Simard, M., Isabelle, P., & Durand, R. (1999). Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web. In Proceedings of the 22nd ACM SIGIR conference on research and development in information retrieval (pp. 74-81).]]
[82]
Nie, J. Y., Simard, M., & Foster, G. (2001). Multilingual information retrieval based on parallel texts from the Web. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp. 188-201). Berlin: Springer-Verlag.]]
[83]
Oard, D. W. (2001). Evaluating interactive cross-language information retrieval: document selection. In C. Peters (Ed.), Cross-language information retrieval evaluation. LNCS (2069, pp. 57-71). Berlin: Springer-Verlag.]]
[84]
Oard, D. W., & Diekema, A. R. (1998). Cross-language information retrieval. Annual Review of Information Science and Technology. 33, 223-256.]]
[85]
Oard, D. W., & Dorr, B. J. (1996). A survey of multilingual text retrieval. College Park, MD: University of Maryland, Institute for Advanced Computer Studies. Available: http://www cs.umd.edu/TRs/authors/Douglas_W_Oard.html.]]
[86]
Oard, D. W., & Gonzalo, J. (2002). The CLEF 2001 interactive track. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 308-319). Berlin: Springer-Verlag.]]
[87]
Oard, D. W., & Hackett, P. (1998). Document translation for cross-language text retrieval at the University of Maryland. In Proceedings of TREC-6. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[88]
Oard, D. W., Levow, G. A., & Cabezas, C. I. (2001). CLEF experiments at Maryland: statistical stemming and backoff translation. In C. Peters (Ed.). Cross-language information retrieval evaluation. LNCS (2069, pp. 176-187). Berlin: Springer-Verlag.]]
[89]
Peters, C., & Sheridan, P. (2001). Multilingual information access. In M. Agosti, F. Crestani, & G. Pasi (Eds.), Lectures on information retrieval. LCSN (1980, pp. 51-80). Berlin: Springer-Verlag.]]
[90]
Pirkola, A. (1998). The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval. In Proceedings of the 21st ACM SIGIR conference on research and development in information retrieval (pp. 55-63).]]
[91]
Pirkola, A., Hedlund, T., Keskustalo, H., & Jäärvelin, K. (2001). Dictionary-based cross-language information retrieval: problems, methods. and research findings, Information Retrieval. 4, 209-230]]
[92]
Ponte, J. M., & Croft, W. B. (1998). A language modeling approach to information retrieval. In Proceedings of the 21st ACM SIGIR conference on research and development in information retrieval (pp. 275-281).]]
[93]
Powell, A. L., French, J. C., Callan, J., Connell, M., & Viles, C. L. (2000). The impact of database selection on distributed searching. In Proceeding, of the 23rd ACM SIGIR conference research and development in information retrieval (pp. 232-239).]]
[94]
Porter, M. F. (1980). An algorithm for suffix stripping. Program, 14(3), 130-137.]]
[95]
Porter, M. F. (2001). Snowball: a language for stemming algorithms. Available: http://snowball.tartarus.org/texts/introduction.html. 2004/Mar./10.]]
[96]
Qu, Y., Grefenstette, G., & Evans, D. A. (2003). Resolving translation ambiguity using monolingual corpora. In C. Peters, et al. (Eds.). Advances in cross-language information retrieval. LNCS (2785, pp. 223-241). Berlin: Springer-Verlag.]]
[97]
Rehder, B., Littman, M. L., Dumais, S., & Landauer, T. K. (1998). Automatic 3-language cross-language information retrieval with latent semantic indexing. In Proceedings of the TREC-6. Gaithersburg, MD: National Institute of Standards and Technology, Available: http://trec.nist.gov/pubs/.]]
[98]
Resnik, P. (1999). Mining the web for bilingual text. In 37th annual meeting of the association for computational linguistics (ACL'99), College Park, MD.]]
[99]
Robertson, S. E, & Sparek Jones, K. (1976). On relevance probabilistic indexing and information retrieval. Journal of the American Society for Information Science. 27(3), 129-146.]]
[100]
Robertson, S. E., Walker, S., Jones, S., Hancock-Beaulieu, M. M., & Gatford, M. (1995). Okapi at TREC-3. In Overview of TREC-3. Gaithersburg, MD: National Institute of Standards and Technology. Available: http://trec.nist.gov/pubs/.]]
[101]
Sadat, F., Maeda, A., Yoshikawa, M., & Uemura, S. (2002). Query expansion techniques for the CLEF Bilingual track. In C. Peters, et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 177-184). Berlin: Springer-Verlag.]]
[102]
Sakai, T., Koyama, M., Suzuki, M., & Manabe, T. (2003). Toshiba KIDS at NTCIR-3: Japanese and English-Japanese IR. In Proceedings of the third NTCIR workshop on researeh in information retrieval, automatic text summarization and question answering. Tokyo: National Institute of Informatics. Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[103]
Salton, G. (1970). Automatic processing of foreign language documents. Journal of the American Society for Information Science, 21(3), 187-194.]]
[104]
Savoy, J. (1999). A stemming procedure and stopword list for general French corpora. Journal of the American Society for information Science, 5(10), 944-952.]]
[105]
Savoy, J. (2002). Report on CLEF-2001 experiments: effective combined query-translation approach. In C. Peters et al. (Eds.), Evaluation of cross-language information retrieval systems. LNCS (2406, pp. 27-43). Berlin: Springer-Verlag.]]
[106]
Savoy, J. (2003a). Report on CLEF 2002 experiments: combining multiple sources of evidence. In C. Peters et al. (Eds.), Advances in cross-language information retrieval. LNCS (2785, pp. 66-90). Berlin: Springer-Verlag.]]
[107]
Savoy, J. (2003b). Cross-language information retrieval: experiments based on CLEF 2000 corpora, Information Processing & Management, 39, 75-115.]]
[108]
Seo, H. C., Kim, S. B., Kim, B. I., Rim, H. C., & Lee, S. Z. (2003). KUNLP system for NTCIR-3 English-Korean cross-language information retrieval. In Proceedings of the third NTCIR workshop on research in information retrieval, automatic text summarization and question answering. Tokyo: National Institute of Informatics. Available: http://research.nii.ac.jp/ntcir/workshop/.]]
[109]
Sheridan, P., & Ballerini, J. P. (1996). Experiments in multilingual information retrieval using the SPIDER system. In Proceedings of the 19th ACM SIGIR conference on research and development in information retrieval (pp. 58-64).]]
[110]
Sperer, R., & Oard, D. W. (2000). Structured translation for cross-language information retrieval. In Proceedings of the 23rd ACM SIGIR conferenee on research and development in information retrieval (pp. 120-127).]]
[111]
Stalls, B., & Knight, K. (1998). Translating names and technical terms in Arabic text. In Proceedings of the COLING/ACL workshop on computational approaches to semantic languages. Available: http://www.isi.edu/natural-language/people/knight.html.]]
[112]
Turtle, H. R., & Croft, W. B. (1991). Evaluation of an inference network-based retrieval model. ACM Transactions on Information Systems, 9(3), 187-222.]]
[113]
Xu, J., Weischedel, R., & Nguyen, C. (2001). Evaluating a probabilistic model for cross-lingual information retrieval. In Proceedings of the 24th ACM SIGIR conference on research and development in information retrieval (pp. 105-110).]]
[114]
Yamabana, K., Muraki, K., Doi, S., & Kamei, S. (1998). A language conversion front-end for cross-language information retrieval. In G. Grefenstette (Ed.), Cross-language information retrieval (pp. 93-104). Boston, MA: Kluwer.]]
[115]
Yang, Y., Carbonell, J., Brown, R. D., & Frederking, R. E. (1998). Translingual information retrieval: learning from bilingual corpora. Artificial Intelligence, 103, 323-345.]]

Cited By

View all
  • (2023)Soft Prompt Decoding for Multilingual Dense RetrievalProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591769(1208-1218)Online publication date: 19-Jul-2023
  • (2019)Design and Development of a Large Cross-Lingual Plagiarism Corpus for Urdu-English Language PairScientific Programming10.1155/2019/29620402019Online publication date: 17-Mar-2019
  • (2016)Arabic Cross-Language Information RetrievalACM Transactions on Asian and Low-Resource Language Information Processing10.1145/278921015:3(1-44)Online publication date: 28-Jan-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Information Processing and Management: an International Journal
Information Processing and Management: an International Journal  Volume 41, Issue 3
Special issue: Cross-language information retrieval
May 2005
303 pages

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 01 May 2005

Author Tags

  1. cross-language information retrieval
  2. language model
  3. machine translation
  4. word sense disambiguation

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Soft Prompt Decoding for Multilingual Dense RetrievalProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591769(1208-1218)Online publication date: 19-Jul-2023
  • (2019)Design and Development of a Large Cross-Lingual Plagiarism Corpus for Urdu-English Language PairScientific Programming10.1155/2019/29620402019Online publication date: 17-Mar-2019
  • (2016)Arabic Cross-Language Information RetrievalACM Transactions on Asian and Low-Resource Language Information Processing10.1145/278921015:3(1-44)Online publication date: 28-Jan-2016
  • (2015)Multilingual information retrieval in the language modeling frameworkInformation Retrieval10.1007/s10791-015-9255-118:3(246-281)Online publication date: 1-Jun-2015
  • (2015)Cross-language person-entity linking from 20 languagesJournal of the Association for Information Science and Technology10.1002/asi.2325466:6(1106-1123)Online publication date: 1-Jun-2015
  • (2013)Finding synonyms and other semantically-similar terms from coselection dataProceedings of the First Australasian Web Conference - Volume 14410.5555/2527208.2527213(35-42)Online publication date: 29-Jan-2013
  • (2013)Developing a Multilingual Application Using Linked DataProceedings of the 5th International Conference on Computational Collective Intelligence. Technologies and Applications - Volume 808310.1007/978-3-642-40495-5_13(120-129)Online publication date: 11-Sep-2013
  • (2013)Cross-lingual random indexing for information retrievalProceedings of the First international conference on Statistical Language and Speech Processing10.1007/978-3-642-39593-2_15(164-175)Online publication date: 29-Jul-2013
  • (2011)Indexing and weighting of multilingual and mixed documentsProceedings of the South African Institute of Computer Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, Multidisciplinary Environment10.1145/2072221.2072240(161-170)Online publication date: 3-Oct-2011
  • (2010)A non-linear semantic mapping technique for cross-language sentence matchingProceedings of the 7th international conference on Advances in natural language processing10.5555/1884371.1884381(57-66)Online publication date: 16-Aug-2010
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media