[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1699510.1699537dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
research-article
Free access

Non-projective parsing for statistical machine translation

Published: 06 August 2009 Publication History

Abstract

We describe a novel approach for syntax-based statistical MT, which builds on a variant of tree adjoining grammar (TAG). Inspired by work in discriminative dependency parsing, the key idea in our approach is to allow highly flexible reordering operations during parsing, in combination with a discriminative model that can condition on rich features of the source-language string. Experiments on translation from German to English show improvements over phrase-based systems, both in terms of BLEU scores and in human evaluations.

References

[1]
H. Alshawi. 1996. Head automata and bilingual tiling: Translation with minimal representations. In Proceedings of ACL, pages 167--176.
[2]
X. Carreras, M. Collins, and T. Koo. 2008. TAG, dynamic programming and the perceptron for efficient, feature-rich parsing. In Proc. of CoNLL.
[3]
E. Charniak, K. Knight, and K. Yamada. 2003. Syntax-based language models for machine translation. In Proceedings of MT Summit IX.
[4]
E. Charniak. 2001. Immediate-head parsing for language models. In Proceedings of ACL 2001.
[5]
C. Cherry. 2008. Cohesive phrase-based decoding for statistical machine translation. In Proceedings of ACL-08: HLT, pages 72--80, Columbus, Ohio, June. Association for Computational Linguistics.
[6]
D. Chiang. 2005. A hierarchical phrase-based model for statistical machine translation. In Proceedings of ACL.
[7]
M. Collins, P. Koehn, and I. Kucerova. 2005. Clause restructuring for statistical machine translation. In Proceedings of ACL.
[8]
M. Collins. 1997. Three generative, lexicalised models for statistical parsing. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, pages 16--23, Madrid, Spain, July. Association for Computational Linguistics.
[9]
J. Eisner. 2000. Bilexical grammars and their cubic-time parsing algorithms. In H. C. Bunt and A. Nijholt, editors, New Developments in Natural Language Parsing, pages 29--62. Kluwer Academic Publishers.
[10]
J. Eisner. 2003. Learning non-isomorphic tree mappings for machine translation. In Proceedings of ACL.
[11]
A. K. Joshi and Y. Schabes. 1997. Tree-adjoining grammars. In G. Rozenberg and K. Salomaa, editors, Handbook of Formal Languages, volume 3, pages 169--124. Springer.
[12]
P. Koehn, F. J. Och, and D. Marcu. 2003. Statistical phrase-based translation. In Proceedings of HLT/NAACL.
[13]
P. Koehn. 2004. Statistical significance tests for machine translation evaluation. In Dekang Lin and Dekai Wu, editors, Proceedings of EMNLP 2004, pages 388--395, Barcelona, Spain, July. Association for Computational Linguistics.
[14]
P. Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. In Proceedings of MT Summit.
[15]
D. Marcu, W. Wang, A. Echihabi, and K. Knight. 2006. Spmt: Statistical machine translation with syntactified target language phrases. In Proceedings of EMNLP.
[16]
R. McDonald, K. Crammer, and F. Pereira. 2005. Online large-margin training of dependency parsers. In Proceedings of ACL.
[17]
D. Melamed. 2004. Statistical machine translation by parsing. In Proceedings of ACL.
[18]
H. Mi, L. Huang, and Q. Liu. 2008. Forest-based translation. In Proceedings of ACL-08: HLT, pages 192--199. Association for Computational Linguistics.
[19]
R. Nesson, S. M. Shieber, and A. Rush. 2006. Induction of probabilistic synchronous tree-insertion grammars for machine translation. In Proceedings of the 7th AMTA.
[20]
F. J. Och. 2003. Minimum error rate training for statistical machine translation. In Proceedings of ACL.
[21]
K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of ACL, pages 311--318. Association for Computational Linguistics.
[22]
C. Quirk, A. Menezes, and Colin Cherry. 2005. Dependency tree translation: Syntactically informed phrasal smt. In Proceedings of ACL.
[23]
O. Rambow, K. Vijay-Shanker, and D. Weir. 1995. D-tree grammars. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pages 151--158, Cambridge, Massachusetts, USA, June. Association for Computational Linguistics.
[24]
L. Shen, J. Xu, and R. Weischedel. 2008. A new string-to-dependency machine translation algorithm with a target dependency language model. In Proceedings of ACL.
[25]
D. Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3):377--404.
[26]
K. Yamada and K. Knight. 2001. A syntax-based statistical translation model. In Proceedings of ACL.
[27]
H. Zhang and D. Gildea. 2005. Stochastic lexicalized inversion transduction grammar for alignment. In Proceedings of ACL, pages 473--482.
[28]
A. Zollmann and A. Venugopal. 2006. Syntax augmented machine translation via chart parsing. In Proceedings of NAACL 2006 Workshop on Statistical Machine Translation.

Cited By

View all
  • (2011)Quasi-synchronous phrase dependency grammars for machine translationProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145488(474-485)Online publication date: 27-Jul-2011
  • (2010)Constituent reordering and syntax models for English-to-Japanese statistical machine translationProceedings of the 23rd International Conference on Computational Linguistics10.5555/1873781.1873852(626-634)Online publication date: 23-Aug-2010
  • (2010)Statistical machine translation with a factorized grammarProceedings of the 2010 Conference on Empirical Methods in Natural Language Processing10.5555/1870658.1870718(616-625)Online publication date: 9-Oct-2010
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
August 2009
505 pages
ISBN:9781932432596

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 06 August 2009

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 73 of 234 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)2
Reflects downloads up to 05 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2011)Quasi-synchronous phrase dependency grammars for machine translationProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145488(474-485)Online publication date: 27-Jul-2011
  • (2010)Constituent reordering and syntax models for English-to-Japanese statistical machine translationProceedings of the 23rd International Conference on Computational Linguistics10.5555/1873781.1873852(626-634)Online publication date: 23-Aug-2010
  • (2010)Statistical machine translation with a factorized grammarProceedings of the 2010 Conference on Empirical Methods in Natural Language Processing10.5555/1870658.1870718(616-625)Online publication date: 9-Oct-2010
  • (2010)Non-isomorphic forest pair translationProceedings of the 2010 Conference on Empirical Methods in Natural Language Processing10.5555/1870658.1870701(440-450)Online publication date: 9-Oct-2010
  • (2010)Corpus creation for new genresProceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk10.5555/1866696.1866698(13-20)Online publication date: 6-Jun-2010
  • (2010)String-to-dependency statistical machine translationComputational Linguistics10.1162/coli_a_0001536:4(649-671)Online publication date: 1-Dec-2010

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media