[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/2206329.2206346dlproceedingsArticle/Chapter ViewAbstractPublication PagesiwptConference Proceedingsconference-collections
research-article
Free access

Active learning for dependency parsing using partially annotated sentences

Published: 05 October 2011 Publication History

Abstract

Current successful probabilistic parsers require large treebanks which are difficult, time consuming, and expensive to produce. Some parts of these data do not contain any useful information for training a parser. Active learning strategies allow to select the most informative samples for annotation. Most existing active learning strategies for parsing rely on selecting uncertain sentences for annotation. We show in this paper that selecting full sentences is not an optimal solution and propose a way to select only subparts of sentences.

References

[1]
Anne Abeillé, Lionel Clément, and Toussenel François, 2003. Treebanks, chapter Building a treebank for French. Kluwer, Dordrecht.
[2]
Bernd Bohnet. 2010. Top Accuracy and Fast Dependency Parsing is not a Contradiction. In Proceedings of COLING.
[3]
Marie Candito, Benoît Crabbé, and Pascal Denis. 2010. Statistical French Dependency Parsing: Treebank Conversion and First Results. In Proceedings of LREC2010.
[4]
X. Carreras. 2007. Experiments with a higher-order projective dependency parser. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL, volume 7, pages 957--961.
[5]
Y. J. Chu and T. H. Liu. 1965. On the shortest arborescence of a directed graph. Science Sinica, 14(1396--1400):270.
[6]
K. Church and R. Patil. 1982. Coping with syntactic ambiguity or how to put the block in the box on the table. Computational Linguistics, 8(3--4):139--149.
[7]
Koby Crammer, Ofer Dekel, Joseph Keshet, Shai ShalevShwartz, and Yoram Singer. 2006. Online Passive-Aggressive Algorithm. Journal of Machine Learning Research.
[8]
J. Edmonds, J. Edmonds, and J. Edmonds. 1968. Optimum branchings. National Bureau of standards.
[9]
K. Hall. 2007. K-best spanning tree parsing. In Proceedings of the 45th Annual Meeting of the ACL, page 392.
[10]
L. Huang and D. Chiang. 2005. Better k-best parsing. In Proceedings of the 9th International Workshop on Parsing Technology, pages 53--64.
[11]
R. Hwa. 2004. Sample selection for statistical parsing. Computational Linguistics, 30(3):253--276.
[12]
T. Koo and M. Collins. 2010. Efficient third-order dependency parsers. In Proceedings of the 48th Annual Meeting of the ACL, pages 1--11.
[13]
S. Kübler, R. McDonald, and J. Nivre. 2009. Dependency parsing. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers.
[14]
D. D. Lewis and W. A. Gale. 1994. A sequential algorithm for training text classifiers. In Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, pages 3--12.
[15]
R. McDonald and G. Satta. 2007. On the complexity of non-projective data-driven dependency parsing. In Proceedings of IWPT, pages 121--132.
[16]
R. McDonald, K. Crammer, and F. Pereira. 2005.
[17]
Online large-margin training of dependency parsers. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pages 91--98.
[18]
F. Olsson. 2009. A literature survey of active machine learning in the context of natural language processing. Technical report, Swedish Institute of Computer Science.
[19]
R. Sánchez-Sáez, J. A. Sánchez, and J. M. Benedí. 2009. Statistical confidence measures for probabilistic parsing. In Proceedings of RANLP, pages 388--392.
[20]
Manabu Sassano and Sadao Kurohashi. 2010. Using smaller constituents rather than sentences in active learning for japanese dependency parsing. In Proceedings of the 48th Annual Meeting of the ACL, pages 356--365.
[21]
Burr Settles. 2010. Active Learning Literature Survey. Technical Report Technical Report 1648, University of Wisconsin-Madison.
[22]
M. Tang, X. Luo, and S. Roukos. 2002. Active learning for statistical natural language parsing. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 120--127.

Cited By

View all
  • (2019)A novel fusion mixture of active experts algorithm for traffic signs recognitionMultimedia Tools and Applications10.1007/s11042-019-7391-078:14(20217-20237)Online publication date: 2-Aug-2019
  • (2012)Semi-supervised dependency parsing using lexical affinitiesProceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 110.5555/2390524.2390634(777-785)Online publication date: 8-Jul-2012

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
IWPT '11: Proceedings of the 12th International Conference on Parsing Technologies
October 2011
264 pages
ISBN:9781932432046

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 05 October 2011

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)60
  • Downloads (Last 6 weeks)7
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2019)A novel fusion mixture of active experts algorithm for traffic signs recognitionMultimedia Tools and Applications10.1007/s11042-019-7391-078:14(20217-20237)Online publication date: 2-Aug-2019
  • (2012)Semi-supervised dependency parsing using lexical affinitiesProceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 110.5555/2390524.2390634(777-785)Online publication date: 8-Jul-2012

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media