[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Disambiguation of semantic types in complex noun phrases for extracting candidate terms

Published: 01 July 2015 Publication History

Abstract

Mapping concepts from medical resources to structured medical documents is a prerequisite for many automatic document processing tasks. These resources are characterised by an abundance of material to represent any given concept. Moreover, the resources may include ambiguous terms in unstructured form that lead to distorted results in automating biomedical text mining. This paper is an exploratory study on disambiguation of semantic types for extracting a structured taxonomy from unstructured reports. Specifically, the terms that will be disambiguated are terms that have more than one semantic type in the Unified Medical Language System UMLS Metathesaurus. We suggest a word sense disambiguation algorithm that utilises the UMLS is-a hierarchy, augmented with a higher level representing semantic groups, as a knowledge base. The purpose is to explore all possible commonalities to classify simple or composed candidate terms with the Nearest Common Kinship NCK. Experiments with the training corpora provide encouraging results.

References

[1]
Basile, P., Caputo, A. and Semeraro, G. (2014) 'An enhanced Lesk word sense disambiguation algorithm through a distributional semantic model', COLING 2014: Proceedings of the 25th International Conference on Computational Linguistics, Dublin, Ireland, pp.1591-1600.
[2]
Basile, P., de Gemmis, M., Gentile, A.L., Lops, P. and Semeraro, G. (2007) 'UNIBA: JIGSAW algorithm for word sense disambiguation', Proceedings of the 4th International Workshop on Semantic Evaluations, Stroudsburg, PA, USA, pp.398-401.
[3]
Bentounsi, I. and Boufaïda, Z. (2013) 'Extracting candidate terms from medical texts', ACS/IEEE-AICCSA 2013: Proceedings of the International Conference on Computer Systems and Applications, Fes/Ifrane, Morocco, pp.1-4.
[4]
El-Rab, W.G., Zaïane, O.R. and El-Hajj, M. (2013) 'Biomedical text disambiguation using UMLS', ASONAM 2013: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, IEEE and ACM SIGKDD, Niagara Falls, Canada, pp.943-947.
[5]
Jiang, J. and Conrath, D.W. (1997) 'Semantic similarity based on corpus statistics and lexical taxonomy', ROCLING 1997: Proceedings of the International Conference on Research in Computational Linguistics, Taipei, Taiwan, pp.19-33.
[6]
Leacock, C. and Chodorow, M. (1998) 'Combining local context and WordNet similarity for word sense identification', Journal of WordNet: An Electronic Lexical Database, Vol. 49 No. 2, pp.265-283.
[7]
Lee, J.H., Kim, M.H. and Lee, Y.J. (1993) 'Information retrieval based on conceptual distance in is-a hierarchies', Journal of Documentation, Vol. 49, No. 2, pp.188-207.
[8]
Lin, D. (1998) 'An information-theoretic definition of similarity', ICML 1998: Proceedings of the 15th International Conference on Machine Learning, International Machine Learning Society, Madison, Wisconsin, USA, pp.296-304.
[9]
Lesk, M.E. (1986) 'Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone', SIGDOC 1986: Proceedings of the 5th Annual International Conference on Systems Documentation, Association for Computer Machinery, Toronto, Canada, pp.24-26.
[10]
McInnes, B. (2008) 'An unsupervised vector approach to biomedical term disambiguation: integrating UMLS and Medline', ACL-08 (Student Research Workshop): Proceedings of the Association for Computational Linguistics, Association for Computational Linguistics, Columbus, Ohio, USA, pp.49-54.
[11]
Mervis, C. and Rosch, E. (1981) 'Categorization of natural objects', Journal of Annual Review of Psychology, Vol. 32 No. 1, pp.89-113.
[12]
Navigli, R. (2009) 'Word sense disambiguation: a survey', Journal of ACM Computing Surveys, Vol. 41, No. 2, pp.1-69.
[13]
Palomar, M., Saiz-Noeda, M., Muñoz, R., Suárez, A., Martínez-Barco, P. and Montoyo, A. (2001) 'PHORA: a NLP system for Spanish', CICLing 2001: Proceedings of the Computational Linguistics and Intelligent Text Processing, Mexico City, Mexico, pp.126-139.
[14]
Rada, R., Mili, H., Bicknell, E. and Blettner, M. (1989) 'Development and application of a metric on semantic nets', Journal of IEEE Transactions on Systems, Man, and Cybernetics, Vol. 19, No. 1, pp.17-30.
[15]
Resnik, P. (1995) 'Using information content to evaluate semantic similarity in a taxonomy', IJCAI 1995: Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, Canada, pp.448-453.
[16]
Sussna, M. (1993) 'Word sense disambiguation for free-text indexing using a massive semantic network', CIKM-93: Proceedings of the 2nd International Conference on Information and Knowledge Management, Association for Computing Machinery, Arlington, Virginia, USA, pp.67-74.
[17]
Wu, Z. and Palmer, M. (1994) 'Verbs semantics and lexical selection', ACL'94: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, USA, pp.133-138.

Cited By

View all
  • (2022)Named entity disambiguation in short texts over knowledge graphsKnowledge and Information Systems10.1007/s10115-021-01642-964:2(325-351)Online publication date: 1-Feb-2022
  1. Disambiguation of semantic types in complex noun phrases for extracting candidate terms

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image International Journal of Metadata, Semantics and Ontologies
    International Journal of Metadata, Semantics and Ontologies  Volume 10, Issue 2
    July 2015
    79 pages
    ISSN:1744-2621
    EISSN:1744-263X
    Issue’s Table of Contents

    Publisher

    Inderscience Publishers

    Geneva 15, Switzerland

    Publication History

    Published: 01 July 2015

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Named entity disambiguation in short texts over knowledge graphsKnowledge and Information Systems10.1007/s10115-021-01642-964:2(325-351)Online publication date: 1-Feb-2022

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media