[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1244002.1244184acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

Translation disambiguation in web-based translation extraction for English-Chinese CLIR

Published: 11 March 2007 Publication History

Abstract

Dictionary based translation is a traditional approach in use by cross-language information retrieval systems. However, significant performance degradation is often observed when queries contain words that do not appear in the dictionary. This is called the Out Of Vocabulary (OOV) problem. In recent years, web-based translation extraction was shown to be one of the more effective approaches to the solution of this problem. Previous work focussed on selecting the correct translation from a set of web extracted terms. The common methods for translation selection for web-based translation always rely on word frequency calculation but the results are not always satisfactory. In this paper we present our approach to the selection of terms in a more accurate manner. Our experiments show improvement in translation accuracy over other commonly used approaches.

References

[1]
Chen A. and Gey F., Experiments on Cross-language and Patent retrieval at NTCIR3 Workshop. In Proceedings of the 3rd NTCIR Workshop, Japan.
[2]
Cheng, P.-J., J.-W. Teng, et al. (2004). Cross-language information retrieval: Translating unknown queries with web corpora for cross-language information retrieval. Proceedings of the 27th annual international conference on Research and development in information retrieval.
[3]
Gao, J., J.-Y. Nie, et al. (2001). Improving query translation for cross-language information retrieval using statistical models. SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, New Orleans, Louisiana, United States, ACM Press.
[4]
Jang, M.-G., Myaeng, S. H., and Park, S. Y. (1999). Using mutual information to resolve query translation ambiguities and query term weighting. Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics. College Park, Maryland, Association for Computational Linguistics.
[5]
Kishida K., Chen K., and Lee S., et al. (2005) Overview of CLIR Task at the Fifth NTCIR Workshop. Proceedings of the Fifth NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access
[6]
Maeda, A., Sadat, F. and Yoshikawa, M. et al. (2000). Query term disambiguation for Web cross-language information retrieval using a search engine. Proceedings of the fifth international workshop on Information retrieval with Asian languages. Hong Kong, China, ACM Press.
[7]
Pirkola A., Hedlund T., Keskustalo H. and Järvelin K. (2001). Dictionary-Based Cross-Language Information Retrieval: Problems, Methods, and Research Findings. Information retrieval 4:209--230
[8]
Silva, JF and Lopes, JGP (1999), A Local Maxima Method and a Fair Dispersion Normalization for Extracting multiword units, In Proceedings of the 6th meeting on the mathematics of language.
[9]
Wikipedia. (2006). "Mutual information." from http://en.wikipedia.org/wiki/Mutual_information.
[10]
Zhang, Y., and Vines, P (2004). Using the web for automated translation extraction in cross-language information retrieval. Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. Sheffield, United Kingdom, ACM Press.

Cited By

View all
  • (2019)The Use of Ontology in Retrieval: A Study on Textual, Multilingual and Multimedia RetrievalIEEE Access10.1109/ACCESS.2019.2897849(1-1)Online publication date: 2019
  • (2012)Mining a multilingual association dictionary from Wikipedia for cross-language information retrievalJournal of the American Society for Information Science and Technology10.1002/asi.2269663:12(2474-2487)Online publication date: 13-Nov-2012
  • (2011)Automatic English-Chinese name translation by using Web-Mining and phonetic similarity2011 IEEE International Conference on Information Reuse & Integration10.1109/IRI.2011.6009560(283-287)Online publication date: Aug-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '07: Proceedings of the 2007 ACM symposium on Applied computing
March 2007
1688 pages
ISBN:1595934804
DOI:10.1145/1244002
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 March 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CLIR
  2. OOV problem
  3. cross-language information retrieval
  4. query translation
  5. translation disambiguation

Qualifiers

  • Article

Conference

SAC07
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Upcoming Conference

SAC '25
The 40th ACM/SIGAPP Symposium on Applied Computing
March 31 - April 4, 2025
Catania , Italy

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)1
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)The Use of Ontology in Retrieval: A Study on Textual, Multilingual and Multimedia RetrievalIEEE Access10.1109/ACCESS.2019.2897849(1-1)Online publication date: 2019
  • (2012)Mining a multilingual association dictionary from Wikipedia for cross-language information retrievalJournal of the American Society for Information Science and Technology10.1002/asi.2269663:12(2474-2487)Online publication date: 13-Nov-2012
  • (2011)Automatic English-Chinese name translation by using Web-Mining and phonetic similarity2011 IEEE International Conference on Information Reuse & Integration10.1109/IRI.2011.6009560(283-287)Online publication date: Aug-2011
  • (2010)A review on the cross-lingual information retrieval2010 International Conference on Information Retrieval & Knowledge Management (CAMP)10.1109/INFRKM.2010.5466886(353-357)Online publication date: Mar-2010
  • (2010)A Hybrid Chinese Information Retrieval ModelActive Media Technology10.1007/978-3-642-15470-6_28(267-276)Online publication date: 2010
  • (2008)A Hybrid Technique for English-Chinese Cross Language Information RetrievalACM Transactions on Asian Language Information Processing (TALIP)10.1145/1362782.13627847:2(1-35)Online publication date: 1-Apr-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media