[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1620853.1620915dlproceedingsArticle/Chapter ViewAbstractPublication PagesnaaclConference Proceedingsconference-collections
research-article
Free access

Tree linearization in English: improving language model based approaches

Published: 31 May 2009 Publication History

Abstract

We compare two approaches to dependency tree linearization, a task which arises in many NLP applications. The first one is the widely used 'overgenerate and rank' approach which relies exclusively on a trigram language model (LM); the second one combines language modeling with a maximum entropy classifier trained on a range of linguistic features. The results provide strong support for the combined method and show that trigram LMs are appropriate for phrase linearization while on the clause level a richer representation is necessary to achieve comparable performance.

References

[1]
Clarkson, P.&R. Rosenfeld (1997). Statistical language modeling using the CMU-Cambridge toolkit. In Proc. of EUROSPEECH-97, pp. 2707--2710.
[2]
Filippova, K.&M. Strube (2007). Generating constituent order in German clauses. In Proc. of ACL-07, pp. 320--327.
[3]
Goodman, J. T. (2001). A bit of progress in language modeling. Computer Speech and Language, pp. 403--434.
[4]
Jurafsky, D.&J. H. Martin (2008). Speech and Language Processing. Upper Saddle River, N.J.: Prentice Hall.
[5]
Kendall, M. G. (1938). A new measure of rank correlation. Biometrika, 30:81--93.
[6]
Klein, D.&C. D. Manning (2003). Accurate unlexicalized parsing. In Proc. of ACL-03, pp. 423--430.
[7]
Langkilde, I.&K. Knight (1998). Generation that exploits corpus-based statistical knowledge. In Proc. of COLING-ACL-98, pp. 704--710.
[8]
Lapata, M. (2006). Automatic evaluation of information ordering: Kendall's tau. Computational Linguistics, 32(4):471--484.
[9]
Marsi, E.&E. Krahmer (2005). Explorations in sentence fusion. In Proc. of ENLG-05, pp. 109--117.
[10]
Ringger, E., M. Gamon, R. C. Moore, D. Rojas, M. Smets&S. Corston-Oliver (2004). Linguistically informed statistical models of constituent structure for ordering in sentence realization. In Proc. of COLING-04, pp. 673--679.
[11]
Uchimoto, K., M. Murata, Q. Ma, S. Sekine&H. Isahara (2000). Word order acquisition from corpora. In Proc. of COLING-00, pp. 871--877.

Cited By

View all
  • (2015)Generating Abstractive Summaries from Meeting TranscriptsProceedings of the 2015 ACM Symposium on Document Engineering10.1145/2682571.2797061(51-60)Online publication date: 8-Sep-2015
  • (2012)Generating non-projective word order in statistical linearizationProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2391049(928-939)Online publication date: 12-Jul-2012
  • (2012)Minimal dependency length in realization rankingProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2390979(244-255)Online publication date: 12-Jul-2012
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
NAACL-Short '09: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
May 2009
317 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 31 May 2009

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 21 of 29 submissions, 72%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)54
  • Downloads (Last 6 weeks)8
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2015)Generating Abstractive Summaries from Meeting TranscriptsProceedings of the 2015 ACM Symposium on Document Engineering10.1145/2682571.2797061(51-60)Online publication date: 8-Sep-2015
  • (2012)Generating non-projective word order in statistical linearizationProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2391049(928-939)Online publication date: 12-Jul-2012
  • (2012)Minimal dependency length in realization rankingProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2390979(244-255)Online publication date: 12-Jul-2012
  • (2012)To what extent does sentence-internal realisation reflect discourse context?Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics10.5555/2380816.2380910(767-776)Online publication date: 23-Apr-2012
  • (2012)Syntax-based word ordering incorporating a large-scale language modelProceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics10.5555/2380816.2380906(736-746)Online publication date: 23-Apr-2012
  • (2011)Learning to fuse disparate sentencesProceedings of the Workshop on Monolingual Text-To-Text Generation10.5555/2107679.2107686(54-63)Online publication date: 24-Jun-2011
  • (2011)Towards strict sentence intersectionProceedings of the Workshop on Monolingual Text-To-Text Generation10.5555/2107679.2107685(43-53)Online publication date: 24-Jun-2011
  • (2011)Underspecifying and predicting voice for surface realisation rankingProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002599(1007-1017)Online publication date: 19-Jun-2011
  • (2010)On the limits of sentence compression by deletionEmpirical methods in natural language generation10.5555/1880370.1880374(45-66)Online publication date: 1-Jan-2010
  • (2010)Broad coverage multilingual deep sentence generation with a stochastic multi-level realizerProceedings of the 23rd International Conference on Computational Linguistics10.5555/1873781.1873793(98-106)Online publication date: 23-Aug-2010
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media