[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1870658.1870701dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
research-article
Free access

Non-isomorphic forest pair translation

Published: 09 October 2010 Publication History

Abstract

This paper studies two issues, non-isomorphic structure translation and target syntactic structure usage, for statistical machine translation in the context of forest-based tree to tree sequence translation. For the first issue, we propose a novel non-isomorphic translation framework to capture more non-isomorphic structure mappings than traditional tree-based and tree-sequence-based translation methods. For the second issue, we propose a parallel space searching method to generate hypothesis using tree-to-string model and evaluate its syntactic goodness using tree-to-tree/tree sequence model. This not only reduces the search complexity by merging spurious-ambiguity translation paths and solves the data sparseness issue in training, but also serves as a syntax-based target language model for better grammatical generation. Experiment results on the benchmark data show our proposed two solutions are very effective, achieving significant performance improvement over baselines when applying to different translation models.

References

[1]
Eugene Charniak. 2000. A maximum-entropy inspired parser. NAACL-00.
[2]
Eugene Charniak, Kevin Knight, and Kenji Yamada. 2003. Syntax-based language models for statistical machine translation. MT Summit IX. 40--46.
[3]
David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 33(2).
[4]
Steve DeNeefe, Kevin Knight. 2009. Synchronous Tree Adjoining Machine Translation. EMNLP-2009. 727--736.
[5]
Jason Eisner. 2003. Learning non-isomorphic tree mappings for MT. ACL-03 (companion volume).
[6]
Michel Galley, Mark Hopkins, Kevin Knight and Daniel Marcu. 2004. What's in a translation rule? HLT-NAACL-04. 273--280.
[7]
Liang Huang. 2008. Forest Reranking: Discriminative Parsing with Non-Local Features. ACL-HLT-08. 586--594
[8]
Liang Huang and David Chiang. 2005. Better k-best Parsing. IWPT-05. 53--64
[9]
Liang Huang and David Chiang. 2007. Forest rescoring: Faster decoding with integrated language models. ACL-07. 144--151
[10]
Dan Klein and Christopher D. Manning. 2001. Parsing and Hypergraphs. IWPT-2001.
[11]
Reinhard Kneser and Hermann Ney. 1995. Improved backing-off for M-gram language modeling. ICASSP-95, 181--184
[12]
Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin and Evan Herbst. 2007. Moses: Open Source Toolkit for Statistical Machine Translation. ACL-07. 177--180. (poster)
[13]
Yang Liu, Qun Liu and Shouxun Lin. 2006. Tree-to-String Alignment Template for Statistical Machine Translation. COLING-ACL-06. 609--616.
[14]
Yang Liu, Yun Huang, Qun Liu and Shouxun Lin. 2007. Forest-to-String Statistical Translation Rules. ACL-07. 704--711.
[15]
Yang Liu, Yajuan Lü, Qun Liu. 2009. Improving Tree-to-Tree Translation with Packed Forests. ACL-09. 558--566
[16]
Haitao Mi, Liang Huang, and Qun Liu. 2008. Forest-based translation. ACL-HLT-08. 192--199.
[17]
Haitao Mi and Liang Huang. 2008. Forest-based Translation Rule Extraction. EMNLP-08. 206--214.
[18]
Franz J. Och. 2003. Minimum error rate training in statistical machine translation. ACL-03. 160--167.
[19]
Franz Josef Och and Hermann Ney. 2003. A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics. 29(1) 19--51.
[20]
Kishore Papineni, Salim Roukos, Todd Ward and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. ACL-02. 311--318.
[21]
Andreas Stolcke. 2002. SRILM - an extensible language modeling toolkit. ICSLP-02. 901--904.
[22]
Masaru Tomita. 1987. An Efficient Augmented-Context-Free Parsing Algorithm. Computational Linguistics 13(1--2): 31--46
[23]
Xavier Carreras and Michael Collins. 2009. Non-projective Parsing for Statistical Machine Translation. EMNLP-2009. 200--209.
[24]
K. Yamada and K. Knight. 2001. A Syntax-Based Statistical Translation Model. ACL-01. 523--530.
[25]
Hui Zhang, Min Zhang, Haizhou Li, Aiti Aw and Chew Lim Tan. 2009a. Forest-based Tree Sequence to String Translation Model. ACL-IJCNLP-09. 172--180.
[26]
Hui Zhang, Min Zhang, Haizhou Li, and Chew Lim Tan. 2009b. Fast Translation Rule Matching for Syntax-based Statistical Machine Translation. EMNLP-09. 1037--1045.
[27]
Min Zhang, Hongfei Jiang, Ai Ti Aw, Jun Sun, Chew Lim Tan and Sheng Li. 2007. A Tree-to-Tree Alignment-based model for statistical Machine translation. MT-Summit-07. 535--542
[28]
Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, Sheng Li. 2008. A Tree Sequence Alignment-based Tree-to-Tree Translation Model. ACL-HLT-08. 559--567.
[29]
Ying Zhang, Stephan Vogel, Alex Waibel. 2004. Interpreting BLEU/NIST scores: How much improvement do we need to have a better system? LREC-04. 2051--2054.

Cited By

View all
  • (2011)Augmenting string-to-tree translation models with fuzzy use of source-side syntaxProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145456(204-215)Online publication date: 27-Jul-2011

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
EMNLP '10: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
October 2010
1332 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 09 October 2010

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 73 of 234 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)43
  • Downloads (Last 6 weeks)6
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2011)Augmenting string-to-tree translation models with fuzzy use of source-side syntaxProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145456(204-215)Online publication date: 27-Jul-2011

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media