[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1218955.1219038dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Statistical machine translation by parsing

Published: 21 July 2004 Publication History

Abstract

In an ordinary syntactic parser, the input is a string, and the grammar ranges over strings. This paper explores generalizations of ordinary parsing algorithms that allow the input to consist of string tuples and/or the grammar to range over string tuples. Such algorithms can infer the synchronous structures hidden in parallel texts. It turns out that these generalized parsers can do most of the work required to train and apply a syntax-aware statistical machine translation system.

References

[1]
A. Aho & J. Ullman (1969) "Syntax Directed Translations and the Pushdown Assembler," Journal of Computer and System Sciences 3, 37--56.
[2]
H. Alshawi, S. Bangalore, & S. Douglas (2000) "Learning Dependency Translation Models as Collections of Finite State Head Transducers," Computational Linguistics 26(1):45--60.
[3]
P. F. Brown, S. A. Della Pietra, V. J. Della Pietra, & R. L. Mercer (1993) "The Mathematics of Statistical Machine Translation: Parameter Estimation," Computational Linguistics 19(2):263--312.
[4]
J. Goodman (1999) "Semiring Parsing," Computational Linguistics 25(4):573--305.
[5]
R. Hwa, P. Resnik, A. Weinberg, & O. Kolak (2002) "Evaluating Translational Correspondence using Annotation Projection," Proceedings of the ACL.
[6]
J. Eisner (2002) "Parameter Estimation for Probabilistic Finite-State Transducers," Proceedings of the ACL.
[7]
K. Lari & S. Young (1990) "The Estimation of Stochastic Context-Free Grammars using the Inside-Outside Algorithm," Computer Speech and Language Processing 4:35--56.
[8]
Y. Lü, S. Li, T. Zhao, & M. Yang (2002) "Learning Chinese Bracketing Knowledge Based on a Bilingual Language Model," Proceedings of COLING.
[9]
G. S. Mann & D. Yarowsky (2001) "Multipath Translation Lexicon Induction via Bridge Languages," Proceedings of HLT/NAACL.
[10]
Y. Matsumoto (1993) "Structural Matching of Parallel Texts," Proceedings of the ACL.
[11]
I. D. Melamed (2003) "Multitext Grammars and Synchronous Parsers," Proceedings of HLT/NAACL.
[12]
I. D. Melamed, G. Satta, & B. Wellington (2004) "Generalized Multitext Grammars," Proceedings of the ACL (this volume).
[13]
A. Meyers, R. Yangarber, & R. Grishman (1996) "Alignment of Shared Forests for Bilingual Corpora," Proceedings of COLING.
[14]
F. Och & H. Ney (2001) "Statistical Multi-Source Translation," Proceedings of MT Summit VIII.
[15]
F. Och & H. Ney (2003) "A Systematic Comparison of Various Statistical Alignment Models," Computational Linguistics 29(1): 19--51.
[16]
K. Sima'an (1996) "Computational Complexity of Probabilistic Disambiguation by means of Tree-Grammars," Proceedings of COLING.
[17]
D. Wu (1996) "A polynomial-time algorithm for statistical machine translation," Proceedings of the ACL.
[18]
D. Wu (1997) "Stochastic inversion transduction grammars and bilingual parsing of parallel corpora," Computational Linguistics 23(3):377--404.
[19]
D. Wu & H. Wong (1998) "Machine translation with a stochastic grammatical channel," Proceedings of the ACL.
[20]
K. Yamada & K. Knight (2002) "A Decoder for Syntax-based Statistical MT," Proceedings of the ACL.
[21]
D. Yarowsky & G. Ngai (2001) "Inducing Multilingual POS Taggers and NP Bracketers via Robust Projection Across Aligned Corpora," Proceedings of the NAACL.

Cited By

View all
  • (2011)Learning sentential paraphrases from bilingual parallel corpora for text-to-text generationProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145556(1168-1179)Online publication date: 27-Jul-2011
  • (2011)A joint sequence translation model with integrated reorderingProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002604(1045-1054)Online publication date: 19-Jun-2011
  • (2011)Incremental syntactic language models for phrase-based translationProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002552(620-631)Online publication date: 19-Jun-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '04: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
July 2004
729 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 21 July 2004

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)41
  • Downloads (Last 6 weeks)6
Reflects downloads up to 04 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2011)Learning sentential paraphrases from bilingual parallel corpora for text-to-text generationProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145556(1168-1179)Online publication date: 27-Jul-2011
  • (2011)A joint sequence translation model with integrated reorderingProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002604(1045-1054)Online publication date: 19-Jun-2011
  • (2011)Incremental syntactic language models for phrase-based translationProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002552(620-631)Online publication date: 19-Jun-2011
  • (2010)Joint parsing and translationProceedings of the 23rd International Conference on Computational Linguistics10.5555/1873781.1873861(707-715)Online publication date: 23-Aug-2010
  • (2009)Non-projective parsing for statistical machine translationProceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 110.5555/1699510.1699537(200-209)Online publication date: 6-Aug-2009
  • (2009)Case markers and morphologyProceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 210.5555/1690219.1690258(800-808)Online publication date: 2-Aug-2009
  • (2009)Paraphrase identification as probabilistic quasi-synchronous recognitionProceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 110.5555/1687878.1687944(468-476)Online publication date: 2-Aug-2009
  • (2009)Reordering model using syntactic information of a source tree for statistical machine translationProceedings of the Third Workshop on Syntax and Structure in Statistical Translation10.5555/1626344.1626353(69-77)Online publication date: 5-Jun-2009
  • (2009)Translation as weighted deductionProceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics10.5555/1609067.1609126(532-540)Online publication date: 30-Mar-2009
  • (2009)N-gram-based statistical machine translation versus syntax augmented machine translationProceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics10.5555/1609067.1609114(424-432)Online publication date: 30-Mar-2009
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media