[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1219840.1219907dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Machine translation using probabilistic synchronous dependency insertion grammars

Published: 25 June 2005 Publication History

Abstract

Syntax-based statistical machine translation (MT) aims at applying statistical models to structured data. In this paper, we present a syntax-based statistical machine translation system based on a probabilistic synchronous dependency insertion grammar. Synchronous dependency insertion grammars are a version of synchronous grammars defined on dependency trees. We first introduce our approach to inducing such a grammar from parallel corpora. Second, we describe the graphical model for the machine translation task, which can also be viewed as a stochastic tree-to-tree transducer. We introduce a polynomial time decoding algorithm for the model. We evaluate the outputs of our MT system using the NIST and Bleu automatic MT evaluation software. The result shows that our system outperforms the baseline system based on the IBM models in both translation speed and quality.

References

[1]
Y. Al-Onaizan, J. Curin, M. Jahr, K. Knight, J. Lafferty, I. D. Melamed, F. Och, D. Purdy, N. A. Smith, and D. Yarowsky. 1999. Statistical machine translation. Technical report, CLSP, Johns Hopkins University.
[2]
H. Alshawi, S. Bangalore, S. Douglas. 2000. Learning dependency translation models as collections of finite state head transducers. Comp. Linguistics, 26(1):45--60.
[3]
Daniel M. Bikel. 2002. Design of a multi-lingual, parallel-processing statistical parsing engine. In HLT 2002.
[4]
Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, and Robert Mercer. 1993. The mathematics of statistical machine translation: parameter estimation. Computational Linguistics, 19(2): 263--311.
[5]
Michael John Collins. 1999. Head-driven Statistical Models for Natural Language Parsing. Ph.D. thesis, University of Pennsylvania, Philadelphia.
[6]
Ding and Palmer. 2004a. Automatic Learning of Parallel Dependency Treelet Pairs. In First International Joint Conference on NLP (IJCNLP-04).
[7]
Ding and Palmer. 2004b. Synchronous Dependency Insertion Grammars: A Grammar Formalism for Syntax Based Statistical MT. Workshop on Recent Advances in Dependency Grammars, COLING-04.
[8]
Bonnie J. Dorr. 1994. Machine translation divergences: A formal description and proposed solution. Computational Linguistics, 20(4): 597--633.
[9]
Jason Eisner. 2003. Learning non-isomorphic tree mappings for machine translation. In ACL-03. (companion volume), Sapporo, July.
[10]
Heidi J. Fox. 2002. Phrasal cohesion and statistical machine translation. In Proceedings of EMNLP-02.
[11]
Ulrich Germann, Michael Jahr, Kevin Knight, Daniel Marcu, and Kenji Yamada. 2001. Fast Decoding and Optimal Decoding for Machine Translation. ACL-01.
[12]
Daniel Gildea. 2003. Loosely tree based alignment for machine translation. ACL-03, Japan.
[13]
Jonathan Graehl and Kevin Knight. 2004. Training Tree Transducers. In NAACL/HLT-2004
[14]
Jan Hajic, et al. 2002. Natural language generation in the context of machine translation. Summer workshop final report, Center for Language and Speech Processing, Johns Hopkins University, Baltimore.
[15]
Rebecca Hwa, Philip S. Resnik, Amy Weinberg, and Okan Kolak. 2002. Evaluating translational correspondence using annotation projection. ACL-02
[16]
Ali Ibrahim, Boris Katz, and Jimmy Lin. 2003. Extracting Structural Paraphrases from Aligned Monolingual Corpora. In Proceedings of the Second International Workshop on Paraphrasing (IWP 2003)
[17]
Dan Melamed. 2004. Statistical Machine Translation by Parsing. In ACL-04, Barcelona, Spain.
[18]
Dan Melamed. 2003. Multitext Grammars and Synchronous Parsers, In NAACL/HLT-2003.
[19]
K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. ACL-02, Philadelphia, USA.
[20]
Ryan McDonald, Koby Crammer and Fernando Pereira. 2005. Online Large-Margin Training of Dependency Parsers. ACL-05.
[21]
Franz Josef Och and Hermann Ney. 2003. A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics, 29(1):19--51.
[22]
S. M. Shieber and Y. Schabes. 1990. Synchronous Tree-Adjoining Grammars, Proceedings of the 13th COLING, pp. 253--258, August 1990.
[23]
Dekai Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3):3--403.
[24]
Fei Xia. 2001. Automatic grammar generation from two different perspectives. PhD thesis, U. of Pennsylvania.
[25]
Kenji Yamada and Kevin Knight. 2001. A syntax based statistical translation model. ACL-01, France.
[26]
Kenji Yamada and Kevin Knight. 2002. A decoder for syntax-based statistical MT. ACL-02, Philadelphia.

Cited By

View all
  • (2023)Named entity recognition and dependency parsing for better concept extraction in summary obfuscation detectionExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.119579217:COnline publication date: 1-May-2023
  • (2021)Automated Text SimplificationACM Computing Surveys10.1145/344269554:2(1-36)Online publication date: 5-Mar-2021
  • (2019)Word Reordering for Translation into Korean Sign Language Using Syntactically-guided ClassificationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/335761219:2(1-20)Online publication date: 23-Nov-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
June 2005
657 pages
  • General Chair:
  • Kevin Knight

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 25 June 2005

Qualifiers

  • Article

Acceptance Rates

ACL '05 Paper Acceptance Rate 77 of 423 submissions, 18%;
Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)73
  • Downloads (Last 6 weeks)10
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Named entity recognition and dependency parsing for better concept extraction in summary obfuscation detectionExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.119579217:COnline publication date: 1-May-2023
  • (2021)Automated Text SimplificationACM Computing Surveys10.1145/344269554:2(1-36)Online publication date: 5-Mar-2021
  • (2019)Word Reordering for Translation into Korean Sign Language Using Syntactically-guided ClassificationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/335761219:2(1-20)Online publication date: 23-Nov-2019
  • (2019)Multitask Pointer Network for Korean Dependency ParsingACM Transactions on Asian and Low-Resource Language Information Processing10.1145/328244218:3(1-10)Online publication date: 8-Feb-2019
  • (2017)Leveraging event-based semantics for automated text simplificationExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.04.00582:C(383-395)Online publication date: 1-Oct-2017
  • (2016)Modeling Monolingual Character Alignment for Automatic Evaluation of Chinese TranslationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/281561915:3(1-18)Online publication date: 28-Jan-2016
  • (2014)Statistical machine translation enhancements through linguistic levelsACM Computing Surveys10.1145/251813046:3(1-28)Online publication date: 1-Jan-2014
  • (2013)Post-Ordering by Parsing with ITG for Japanese-English Statistical Machine TranslationACM Transactions on Asian Language Information Processing10.1145/251810012:4(1-22)Online publication date: 1-Oct-2013
  • (2013)Bagging and Boosting statistical machine translation systemsArtificial Intelligence10.1016/j.artint.2012.11.005195(496-527)Online publication date: 1-Feb-2013
  • (2012)Who did what to whom?Proceedings of the Sixth Linguistic Annotation Workshop10.5555/2392747.2392751(2-11)Online publication date: 12-Jul-2012
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media