[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3018009.3018054acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccipConference Proceedingsconference-collections
research-article

Exploiting collaborative learning for concept extraction in the medical field

Published: 26 November 2016 Publication History

Abstract

With the increasing interests of second use of medical data, concept extraction in Electronic Medical Records has drawn more and more scholars' attention. Owing to the artificial data annotation task is labor intensive, the method of concept extraction is mainly to use the fully labeled documents as training data in order to build a concept instance identifier. However, in many cases, the available training data are sparse labeling. This fact makes the performance of the constructed classifier is poor. Existing methods for extracting concepts either considered the diversity of datasets or considered the various learning models. Therefore, this paper proposes a novel approach to improve the performance of concept extraction from electronic medical records by combining the diversity of datasets with the various learning models. The large sparsely labeled dataset is split into multiple subsets. Then the different subsets are trained by different learning models, such as HMM, MEMM, and CRF, in an iterative way. Our technique leverages off the fact that different learning algorithms have different inductive biases and that better predictions can be made by the voted majority.

References

[1]
Yang, J., Li, J., Mulder, J., Wang, Y., and Wang, Q. 2015. Emerging technologies for enhanced healthcare. Computer in Industry. 69, 1, 3--11.
[2]
Fridsma, D. 2012. Electronic health records: The HHS perspective. Computer. 45, 11, 24--26. http://dx.doi.org/10.1109/MC.2012.371
[3]
Hersh, W. R. Information Retrieval: A Health and Biomedical Perspective. 3rd ed. Springer, 2009.
[4]
Uzuner, O., South, B. R., Shen, S., et al. 2011. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. Journal of the America Medical Informatics Association. 18, 5, 552--556.
[5]
J. Li, F. Wang. February 2016. SemiSupervised learning via mean field methods. Neurocomputing. 177, 12, 385--393.
[6]
Jonnalagadda, S., Cohen, T., Wu, S., et al. 2012. Enhancing clinical concept extraction with distributional semantics. Journal of Biomedical Informatics. 45, 1, 129--140.
[7]
Roberts, K., Rink, B., and Harabagiu. S. 2010. Extraction of medical concepts, assertions, and relations from discharge summaries for the fourth i2b2/VA shared task. In Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA.
[8]
Bruijn, B. D., Cherry, C., kiritchenko, S., et al. 2011. Machine-learned solutions for three stages of clinical information extraction: The state of the art at i2b2 2010. Journal of the American Medical Informatics Association. 18, 5, 777--562.
[9]
McCallum, A., Freitag, D., and Pereira, F. 2000. Maximum entropy Markov models for information extraction and segmentation. Proceedings of the Seventeenth International Conference on Machine Learning. 951, 591--598.
[10]
Sobel, A. E. K. 2012. The move toward electronic health records. Computer. 45, 11, 22--23.
[11]
Li, J., et al. 2014. Diversity-aware retrieval of medical records. Computer. Industry. http://dx.doi.org/10.1016/j.compind.09.004
[12]
Shen, W., Doan, A. H., Naughton, J. F. et al. 2007. Declarative information extraction using datalog with embedded extraction predicates. In Proceedings of the 33rd International Conference on Very Large Data Bases. VLDB Endowment, 1033--1044.
[13]
Abdallah, S., Shaalan, K., and Shoaib, M. 2012. Integrating rule-based system with classification for Arabic named entity recognition. In Computational Linguistics and Intelligent Text Processing. Springer Berlin Heidelberg, 311--322.
[14]
Hui, N. I. E. 2012. Person-specific named entity recognition using SVM with rich feature sets. Chinese Journal of Library and Information Science. 5, 3.
[15]
Morwal, S. and Jahan, N. 2013. Named entity recognition using Hidden Markov Model (HMM): An experimental result on Hindi, Urdu and Marathi Languages. International Journal. 3, 4.
[16]
Nabende, P., Tiedemann, J., and Nerbonne, J. 2010. Pair hidden Markov model for named entity matching. In Innovations and Advances in Computer Sciences and Engineering. Springer Netherlands, 497--502.
[17]
Li, J., Yang, J., Liu, C., Liu, B., Zhao, Y., and Shi, Y. 2014. Exploiting semantic linkages among multiple sources for semantic information retrieval. Enterprise Information Systems. 8, 3, 464--489.
[18]
Sutton, C. and McCallum, A. 2010. An introduction to conditional random fields. arXiv preprint arXiv: 1011.4088.
[19]
Li, J. and Liu, C. 2012. A cooperative co-learning approach for concept detection in documents. In Proceedings of ICSC. IEEE, P310--317.
[20]
Zhou, Y. and Goldman, S. 2004. Democratic co-learning. In Proceedings of ICTAL. IEEE 1082--3409/04.

Index Terms

  1. Exploiting collaborative learning for concept extraction in the medical field

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCIP '16: Proceedings of the 2nd International Conference on Communication and Information Processing
    November 2016
    272 pages
    ISBN:9781450348195
    DOI:10.1145/3018009
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 November 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. classifiers combination
    2. concept extraction
    3. electronic medical records
    4. entity recognition

    Qualifiers

    • Research-article

    Conference

    ICCIP '16

    Acceptance Rates

    Overall Acceptance Rate 61 of 301 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 82
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 15 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media