[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1273073.1273133dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
Article
Free access

Minority vote: at-least-N voting improves recall for extracting relations

Published: 17 July 2006 Publication History

Abstract

Several NLP tasks are characterized by asymmetric data where one class label NONE, signifying the absence of any structure (named entity, coreference, relation, etc.) dominates all other classes. Classifiers built on such data typically have a higher precision and a lower recall and tend to overproduce the NONE class. We present a novel scheme for voting among a committee of classifiers that can significantly boost the recall in such situations. We demonstrate results showing up to a 16% relative improvement in ACE value for the 2004 ACE relation extraction task for English, Arabic and Chinese.

References

[1]
D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel. 1997. Nymble: a high-performance learning name-finder. In Proceedings of ANLP-97, pages 194--201.
[2]
A. Borthwick. 1999. A Maximum Entropy Approach to Named Entity Recognition. Ph.D. thesis, New York University.
[3]
L. Breiman. 1996. Bagging predictors. In Machine Learning, volume 24, page 123.
[4]
E. Brill and J. Wu. 1998. Classifier combination for improved lexical disambiguation. Proceedings of COLING-ACL'98, pages 191--195, August.
[5]
Radu Florian and David Yarowsky. 2002. Modeling consensus: Classifier combination for word sense disambiguation. In Proceedings of EMNLP'02, pages 25--32.
[6]
R. Florian, A. Ittycheriah, H. Jing, and T. Zhang. 2003. Named entity recognition through classifier combination. In Proceedings of CoNNL'03, pages 168--171.
[7]
R. Florian, H. Hassan, A. Ittycheriah, H. Jing, N. Kambhatla, X. Luo, N Nicolov, and S Roukos. 2004. A statistical model for multilingual entity detection and tracking. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, pages 1--8.
[8]
J. Henderson and E. Brill. 1999. Exploiting diversity in natural language processing: Combining parsers. In Proceedings on EMNLP99, pages 187--194.
[9]
T. K. Ho, J. J. Hull, and S. N. Srihari. 1994. Decision combination in multiple classifier systems. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(1):66--75, January.
[10]
Nanda Kambhatla. 2004. Combining lexical, syntactic, and semantic features with maximum entropy models for information extraction. In The Proceedings of 42st Annual Meeting of the Association for Computational Linguistics, pages 178--181, Barcelona, Spain, July. Association for Computational Linguistics.
[11]
D. Magerman. 1993. Parsing as statistical pattern recognition.
[12]
NIST. 2004. The ACE evaluation plan. www.nist.gov/speech/tests/ace/index.htm.
[13]
Adwait Ratnaparkhi. 1999. Learning to parse natural language with maximum entropy models. Machine Learning, 34:151--178.
[14]
W. M. Soon, H. T. Ng, and C. Y. Lim. 2001. A machine learning approach to coreference resolution of noun phrases. Computational Linguistics, 27(4):521--544.
[15]
E. F. Tjong Kim Sang, W. Daelemans, H. Dejean, R. Koeling, Y. Krymolowsky, V. Punyakanok, and D. Roth. 2000. Applying system combination to base noun phrase identification. In Proceedings of COLING 2000, pages 857--863.
[16]
H. van Halteren, J. Zavrel, and W. Daelemans. 1998. Improving data driven wordclass tagging by system combination. In Proceedings of COLING-ACL'98, pages 491--497.
[17]
L. Xu, A. Krzyzak, and C. Suen. 1992. Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. on Systems, Man. Cybernet, 22(3):418--435.
[18]
T. Zhang, F. Damerau, and D. E. Johnson. 2002. Text chunking based on a generalization of Winnow. Journal of Machine Learning Research, 2:615--637.

Cited By

View all
  • (2019)Named entity recognition with multiple segment representationsInformation Processing and Management: an International Journal10.1016/j.ipm.2013.03.00249:4(954-965)Online publication date: 10-Dec-2019
  • (2011)Developing Position Structure-Based Framework for Chinese Entity Relation ExtractionACM Transactions on Asian Language Information Processing10.1145/2002980.200298410:3(1-22)Online publication date: 1-Sep-2011
  1. Minority vote: at-least-N voting improves recall for extracting relations

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      COLING-ACL '06: Proceedings of the COLING/ACL on Main conference poster sessions
      July 2006
      992 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 17 July 2006

      Qualifiers

      • Article

      Acceptance Rates

      COLING-ACL '06 Paper Acceptance Rate 126 of 126 submissions, 100%;
      Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)62
      • Downloads (Last 6 weeks)10
      Reflects downloads up to 19 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)Named entity recognition with multiple segment representationsInformation Processing and Management: an International Journal10.1016/j.ipm.2013.03.00249:4(954-965)Online publication date: 10-Dec-2019
      • (2011)Developing Position Structure-Based Framework for Chinese Entity Relation ExtractionACM Transactions on Asian Language Information Processing10.1145/2002980.200298410:3(1-22)Online publication date: 1-Sep-2011

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media