[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1073012.1073032dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

A machine learning approach to the automatic evaluation of machine translation

Published: 06 July 2001 Publication History

Abstract

We present a machine learning approach to evaluating the well-formedness of output of a machine translation system, using classifiers that learn to distinguish human reference translations from machine translations. This approach can be used to evaluate an MT system, tracking improvements over time; to aid in the kind of failure analysis that can help guide system development; and to select among alternative output strings. The method presented is fully automated and independent of source language, target language and domain.

References

[1]
Alshawi, H., S. Bangalore, and S. Douglas. 1998. Automatic acquisition of hierarchical transduction models for machine translation. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics, Montreal Canada, Vol. 1:41--47.
[2]
Bangalore, S., O. Rambow, and S. Whittaker. 2000. Evaluation Metrics for Generation. In Proceedings of the International Conference on Natural Language Generation (INLG 2000), Mitzpe Ramon, Israel. 1-13.
[3]
Chickering, D. M., D. Heckerman, and C. Meek. 1997. A Bayesian approach to learning Bayesian networks with local structure. In Geiger, D. and P. Punadlik Shenoy (Eds.), Uncertainty in Artificial Intelligence: Proceedings of the Thirteenth Conference. 80--89.
[4]
Clarkson, P. and R. Rosenfeld. 1997. Statistical Language Modeling Using the CMU-Cambridge Toolkit. Proceedings of Eurospeech97. 2707--2710.
[5]
Heckerman, D., D. M. Chickering, C. Meek, R. Rounthwaite, and C. Kadie. 2000. Dependency networks for inference, collaborative filtering and data visualization. Journal of Machine Learning Research 1:49--75.
[6]
Heidorn, G. E., 2000. Intelligent writing assistance. In R. Dale, H. Moisl and H. Somers (Eds.). Handbook of Natural Language Processing. New York, NY. Marcel Dekker. 181--207.
[7]
Langkilde, I., and K. Knight. 1998. Generation that exploits corpus-based statistical knowledge. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics, and 17th International Conference on Computational Linguistics, Montreal, Canada. 704--710.
[8]
Nyberg, E. H., T. Mitamura, and J. G. Carbonnell. 1994. Evaluation Metrics for Knowledge-Based Machine Translation. In Proceedings of the 15th International Conference on Computational Linguistics, Kyoto, Japan (Coling 94). 95--99.
[9]
Platt, J., N. Cristianini, J. Shawe-Taylor. 2000. Large margin DAGs for multiclass classification. In Advances in Neural Information Processing Systems 12, MIT Press. 547--553.
[10]
Richardson, S., B. Dolan, A. Menezes, and J. Pinkham. 2001. Achieving commercial-quality translation with example-based methods. Submitted for review.
[11]
Ringger, E., M. Corston-Oliver, and R. Moore. 2001. Using Word-Perplexity for Automatic Evaluation of Machine Translation. Manuscript.
[12]
Su, K., M. Wu, and J. Chang. 1992. A new quantitative quality measure for machine translation systems. In Proceedings of COLING-92, Nantes, France. 433--439.
[13]
Vapnik, V. 1998. Statistical Learning Theory, Wiley-Interscience, New York.

Cited By

View all
  • (2020)Challenges in Building Intelligent Open-domain Dialog SystemsACM Transactions on Information Systems10.1145/338312338:3(1-32)Online publication date: 9-Apr-2020
  • (2019)LCEval: Learned Composite Metric for Caption EvaluationInternational Journal of Computer Vision10.1007/s11263-019-01206-z127:10(1586-1610)Online publication date: 1-Oct-2019
  • (2018)Improvement of machine translation evaluation by simple linguistically motivated featuresJournal of Computer Science and Technology10.5555/1991836.199184326:1(57-67)Online publication date: 21-Dec-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
July 2001
562 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 06 July 2001

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)60
  • Downloads (Last 6 weeks)5
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Challenges in Building Intelligent Open-domain Dialog SystemsACM Transactions on Information Systems10.1145/338312338:3(1-32)Online publication date: 9-Apr-2020
  • (2019)LCEval: Learned Composite Metric for Caption EvaluationInternational Journal of Computer Vision10.1007/s11263-019-01206-z127:10(1586-1610)Online publication date: 1-Oct-2019
  • (2018)Improvement of machine translation evaluation by simple linguistically motivated featuresJournal of Computer Science and Technology10.5555/1991836.199184326:1(57-67)Online publication date: 21-Dec-2018
  • (2018)Optimizing Automatic Evaluation of Machine Translation with the ListMLE ApproachACM Transactions on Asian and Low-Resource Language Information Processing10.1145/322604518:1(1-18)Online publication date: 12-Nov-2018
  • (2017)A machine learning approach to evaluating translation qualityProceedings of the 17th ACM/IEEE Joint Conference on Digital Libraries10.5555/3200334.3200373(281-282)Online publication date: 19-Jun-2017
  • (2012)DCU-symantec submission for the WMT 2012 quality estimation taskProceedings of the Seventh Workshop on Statistical Machine Translation10.5555/2393015.2393034(138-144)Online publication date: 7-Jun-2012
  • (2011)Corroborating text evaluation results with heterogeneous measuresProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145485(455-466)Online publication date: 27-Jul-2011
  • (2011)Regression and ranking based optimisation for sentence level machine translation evaluationProceedings of the Sixth Workshop on Statistical Machine Translation10.5555/2132960.2132975(123-129)Online publication date: 30-Jul-2011
  • (2011)e-rating machine translationProceedings of the Sixth Workshop on Statistical Machine Translation10.5555/2132960.2132973(108-115)Online publication date: 30-Jul-2011
  • (2010)Structural features for predicting the linguistic quality of textEmpirical methods in natural language generation10.5555/1880370.1880386(222-241)Online publication date: 1-Jan-2010
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media