More Web Proxy on the site http://driver.im/

Article

Free access

Minority vote: at-least-N voting improves recall for extracting relations

Author:

Nanda KambhatlaAuthors Info & Claims

COLING-ACL '06: Proceedings of the COLING/ACL on Main conference poster sessions

Pages 460 - 466

Published: 17 July 2006 Publication History

Abstract

Several NLP tasks are characterized by asymmetric data where one class label NONE, signifying the absence of any structure (named entity, coreference, relation, etc.) dominates all other classes. Classifiers built on such data typically have a higher precision and a lower recall and tend to overproduce the NONE class. We present a novel scheme for voting among a committee of classifiers that can significantly boost the recall in such situations. We demonstrate results showing up to a 16% relative improvement in ACE value for the 2004 ACE relation extraction task for English, Arabic and Chinese.

References

[1]

D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel. 1997. Nymble: a high-performance learning name-finder. In Proceedings of ANLP-97, pages 194--201.

Digital Library

[2]

A. Borthwick. 1999. A Maximum Entropy Approach to Named Entity Recognition. Ph.D. thesis, New York University.

Digital Library

[3]

L. Breiman. 1996. Bagging predictors. In Machine Learning, volume 24, page 123.

Digital Library

[4]

E. Brill and J. Wu. 1998. Classifier combination for improved lexical disambiguation. Proceedings of COLING-ACL'98, pages 191--195, August.

Digital Library

[5]

Radu Florian and David Yarowsky. 2002. Modeling consensus: Classifier combination for word sense disambiguation. In Proceedings of EMNLP'02, pages 25--32.

Digital Library

[6]

R. Florian, A. Ittycheriah, H. Jing, and T. Zhang. 2003. Named entity recognition through classifier combination. In Proceedings of CoNNL'03, pages 168--171.

Digital Library

[7]

R. Florian, H. Hassan, A. Ittycheriah, H. Jing, N. Kambhatla, X. Luo, N Nicolov, and S Roukos. 2004. A statistical model for multilingual entity detection and tracking. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, pages 1--8.

[8]

J. Henderson and E. Brill. 1999. Exploiting diversity in natural language processing: Combining parsers. In Proceedings on EMNLP99, pages 187--194.

[9]

T. K. Ho, J. J. Hull, and S. N. Srihari. 1994. Decision combination in multiple classifier systems. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(1):66--75, January.

Digital Library

[10]

Nanda Kambhatla. 2004. Combining lexical, syntactic, and semantic features with maximum entropy models for information extraction. In The Proceedings of 42st Annual Meeting of the Association for Computational Linguistics, pages 178--181, Barcelona, Spain, July. Association for Computational Linguistics.

Digital Library

[11]

D. Magerman. 1993. Parsing as statistical pattern recognition.

[12]

NIST. 2004. The ACE evaluation plan. www.nist.gov/speech/tests/ace/index.htm.

[13]

Adwait Ratnaparkhi. 1999. Learning to parse natural language with maximum entropy models. Machine Learning, 34:151--178.

Digital Library

[14]

W. M. Soon, H. T. Ng, and C. Y. Lim. 2001. A machine learning approach to coreference resolution of noun phrases. Computational Linguistics, 27(4):521--544.

[15]

E. F. Tjong Kim Sang, W. Daelemans, H. Dejean, R. Koeling, Y. Krymolowsky, V. Punyakanok, and D. Roth. 2000. Applying system combination to base noun phrase identification. In Proceedings of COLING 2000, pages 857--863.

Digital Library

[16]

H. van Halteren, J. Zavrel, and W. Daelemans. 1998. Improving data driven wordclass tagging by system combination. In Proceedings of COLING-ACL'98, pages 491--497.

Digital Library

[17]

L. Xu, A. Krzyzak, and C. Suen. 1992. Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. on Systems, Man. Cybernet, 22(3):418--435.

[18]

T. Zhang, F. Damerau, and D. E. Johnson. 2002. Text chunking based on a generalization of Winnow. Journal of Machine Learning Research, 2:615--637.

Digital Library

Cited By

Cho HOkazaki NMiwa MTsujii J(2019)Named entity recognition with multiple segment representationsInformation Processing and Management: an International Journal10.1016/j.ipm.2013.03.00249:4(954-965)Online publication date: 10-Dec-2019
https://dl.acm.org/doi/10.1016/j.ipm.2013.03.002
Zhang PLi WHou YSong D(2011)Developing Position Structure-Based Framework for Chinese Entity Relation ExtractionACM Transactions on Asian Language Information Processing10.1145/2002980.200298410:3(1-22)Online publication date: 1-Sep-2011
https://dl.acm.org/doi/10.1145/2002980.2002984

Minority vote: at-least-N voting improves recall for extracting relations
1. Computing methodologies
  1. Artificial intelligence
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Weighted Vote-Based Classifier Ensemble for Named Entity Recognition: A Genetic Algorithm-Based Approach

In this article, we report the search capability of Genetic Algorithm (GA) to construct a weighted vote-based classifier ensemble for Named Entity Recognition (NER). Our underlying assumption is that the reliability of predictions of each classifier ...
Vote-Based Classifier Selection for Biomedical NER Using Genetic Algorithms
IbPRIA '07: Proceedings of the 3rd Iberian conference on Pattern Recognition and Image Analysis, Part II

We propose a genetic algorithm for constructing a classifier ensemble using a vote-based classifier selection approach for biomedical named entity recognition task. Assuming that the reliability of the predictions of each classifier differs among ...
The Hmong Medical Corpus: a biomedical corpus for a minority language
Abstract
Biomedical communication is an area that increasingly benefits from natural language processing (NLP) work. Biomedical named entity recognition (NER) in particular provides a foundation for advanced NLP applications, such as automated medical ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings

COLING-ACL '06: Proceedings of the COLING/ACL on Main conference poster sessions

July 2006

992 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 17 July 2006

Qualifiers

Article

Acceptance Rates

COLING-ACL '06 Paper Acceptance Rate 126 of 126 submissions, 100%;

Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
305
Total Downloads

Downloads (Last 12 months)62
Downloads (Last 6 weeks)10

Reflects downloads up to 19 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Cho HOkazaki NMiwa MTsujii J(2019)Named entity recognition with multiple segment representationsInformation Processing and Management: an International Journal10.1016/j.ipm.2013.03.00249:4(954-965)Online publication date: 10-Dec-2019
https://dl.acm.org/doi/10.1016/j.ipm.2013.03.002
Zhang PLi WHou YSong D(2011)Developing Position Structure-Based Framework for Chinese Entity Relation ExtractionACM Transactions on Asian Language Information Processing10.1145/2002980.200298410:3(1-22)Online publication date: 1-Sep-2011
https://dl.acm.org/doi/10.1145/2002980.2002984

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents