[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2479787.2479807acmotherconferencesArticle/Chapter ViewAbstractPublication PageswimsConference Proceedingsconference-collections
research-article

A genetic graph-based clustering approach to biomedical summarization

Published: 12 June 2013 Publication History

Abstract

Summarization techniques have become increasingly important over the last few years, specially in biomedical research, where information overload is major problem. Researchers of this area need a shorter version of the texts which contains all the important information while discarding irrelevant one. There are several applications which deal with this problem, however, these applications are sometimes less informative than the user needs. This work deals with this problem trying to improve a summarization graph-based process using genetic clustering techniques. Our automatic summaries are compared to those produced by several commercial and research summarizers, and demonstrate the appropriateness of using genetic techniques in automatic summarization.

References

[1]
Medical subject headings. http://www.nlm.nih.gov/mesh/.
[2]
Metamap. http://metamap.nlm.nih.gov/.
[3]
Unified medical language system. http://www.nlm.nih.gov/research/umls/.
[4]
Biomed central corpus, 2012. http://www.biomedcentral.com/about/datamining.
[5]
R. Barzilay and M. Elhadad. Using lexical chains for text summarization. In Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization, pages 10--17, 1997.
[6]
R. Brandow, K. Mitze, and L. Rau. Automatic condensation of electronic publications by sentence selection. Information Processing and Management, 5(31):675--685, 1995.
[7]
Coley. An Introduction to Genetic Algorithms for scientists and engineers. World Scientific Publishing, 1999.
[8]
M. Dehmer, editor. Structural Analysis of Complex Networks. Birkhäuser Publishing, 2010.
[9]
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1):1--38, 1977.
[10]
H. P. Edmundson. New Methods in Automatic Extracting. Journal of the Association for Computing Machinery, 2(16):264--285, 1969.
[11]
G. Erkan and D. R. Radev. LexRank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research (JAIR), 22:457--479, 2004.
[12]
S. Fortunato, V. Latora, and M. Marchiori. Method to find community structures based on information centrality. Physical Review E (Statistical, Nonlinear, and Soft Matter Physics), 70(5):056104:1--13, 2004.
[13]
S. Geisser. Predictive Inference: An Introduction. Monographs on Statistics and Applied Probability. Chapman & Hall, 1993.
[14]
E. Hartuv and R. Shamir. A clustering algorithm based on graph connectivity. Information Processing Letters, 76(4--6):175--181, 2000.
[15]
E. Hruschka, R. Campello, A. Freitas, and A. de Carvalho. A survey of evolutionary algorithms for clustering. Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, 39(2):133--155, march 2009.
[16]
S. M. Humphrey, W. J. Rogers, H. Kilicoglu, D. Demner-Fushman, and T. C. Rindflesch. Word sense disambiguation by selecting the best semantic type based on journal descriptor indexing: Preliminary experiment. J. Am. Soc. Inf. Sci. Technol., 57(1):96--113, Jan. 2006.
[17]
K. S. Jones and J. R. Galliers. Evaluating Natural Language Processing Systems: An Analysis and Review. Springer-Verlag New York, Inc., Secaucus, NJ, USA, 1996.
[18]
D. T. Larose. Discovering Knowledge in Data. John Wiley & Sons, 2005.
[19]
C. Y. Lin. Looking for a few good metrics: Automatic summarization evaluation - how many samples are enough? In Proceedings of the NTCIR Workshop 4, 2004.
[20]
C. Y. Lin. Rouge: A Package for Automatic Evaluation of Summaries. In M. F. Moens and S. Szpakowicz, editors, Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, pages 74--81, Barcelona, Spain, 2004. Association for Computational Linguistics.
[21]
H. P. Luhn. The Automatic Creation of Literature Abstracts. IBM Journal of Research Development, 2(2):159--165, 1958.
[22]
J. B. Macqueen. Some methods of classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pages 281--297, 1967.
[23]
I. Mani. Summarization evaluation: An overview. In Proceedings of the 2nd NTCIR workshop on research in Chinese and Japonese text retrieval and text summarization. Tokio, Japan: National Institute of Informatics, 2001.
[24]
H. D. Menéndez and D. Camacho. A genetic graph-based clustering algorithm. In H. Yin, J. Costa, and G. Barreto, editors, Intelligent Data Engineering and Automated Learning - IDEAL 2012, volume 7435 of Lecture Notes in Computer Science, pages 216--225. Springer Berlin/Heidelberg, 2012.
[25]
R. Mihalcea and P. Tarau. TextRank - Bringing order into text. In Proceedings of the Conference EMNLP 2004, pages 404--411, 2004.
[26]
M. C. V. Nascimento and A. C. P. L. F. Carvalho. A graph clustering algorithm based on a clustering coefficient for weighted graphs. J. Braz. Comp. Soc., 17(1):19--29, 2011.
[27]
S. Nelson, T. Powell, and B. Humphreys. The unified medical language system (umls) project. Encyclopedia of library and information science., pages 368--378, 2002.
[28]
E. Pitler and A. Nenkova. Revisiting readability: A unified framework for predicting text quality. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 186--195, Honolulu, Hawaii, October 2008. Association for Computational Linguistics.
[29]
L. Plaza, A. Díaz, and P. Gervás. A semantic graph-based approach to biomedical summarisation. Artif. Intell. Med., 53(1):1--14, Sept. 2011.
[30]
D. R. Radev, S. Teufel, H. Saggion, W. Lam, J. Blitzer, H. Qi, A. Çelebi, D. Liu, and E. Drabek. Evaluation challenges in large-scale document summarization. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1, ACL '03, pages 375--382, Stroudsburg, PA, USA, 2003. Association for Computational Linguistics.
[31]
L. Reeve, H. Han, and A. Brooks. The use of domain-specific concepts in biomedical text summarization. Information Processing and Management, 43:1765--1776, 2007.
[32]
S. E. Schaeffer. Graph clustering. Computer Science Review, 1(1):27--64, 2007.
[33]
Y. Shang, Y. Li, H. Lin, and Z. Yang. Enhancing biomedical text summarization using semantic relation extraction. PLoS one, 6(8), 2011.
[34]
Z. Shi, G. Melli, Y. Wang, Y. Liu, B. Gu, M. M. Kashani, A. Sarkar, and F. Popowich. Question answering summarization of multiple biomedical documents. In Proceedings of the Canadian Conference on Artificial Intelligence, pages 284--295, 2007.
[35]
M. SLitvak and M. Last. Graph-based keyword extraction for single-document summarization. In Proceedings of the International Conference on Computational Linguistics, Workshop on Multi-source Multilingual Information Extraction and Summarization, 2008.
[36]
S. Tratz and E. Hovy. Summarization evaluation using transformed basic elements. In In Proceedings of the 1st Text Analysis Conference (TAC, 2008.
[37]
U. von Luxburg. A tutorial on spectral clustering. Statistics and Computing, 17(4):395--416, Dec. 2007.
[38]
D. J. Watts. Small worlds: The dynamics of networks between order and randomness. Princeton University Press, Princeton, NJ, 1999.
[39]
I. Yoo, X. Hu, and I.-Y. Song. A Coherent Graph-Based Semantic Clustering and Summarization Approach for Biomedical Literature and a New Summarization Evaluation Method. BMC Bioinformatics, 8(9):S4, 2007.
[40]
P. Zweigenbaum, D. Demner-Fushman, H. Yu, and K. B. Cohen. Frontiers of biomedical text mining: current progress. Briefings in Bioinformatics, 8(5):358--375, 2007.

Cited By

View all

Index Terms

  1. A genetic graph-based clustering approach to biomedical summarization

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WIMS '13: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
    June 2013
    408 pages
    ISBN:9781450318501
    DOI:10.1145/2479787
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • UAM: Autonomous University of Madrid

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 June 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. automatic summarization
    2. clustering
    3. genetic algorithms
    4. natural language processing
    5. summarization

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    WIMS '13
    Sponsor:
    • UAM

    Acceptance Rates

    WIMS '13 Paper Acceptance Rate 28 of 72 submissions, 39%;
    Overall Acceptance Rate 140 of 278 submissions, 50%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)MultiGBSJournal of Biomedical Informatics10.1016/j.jbi.2021.103706116:COnline publication date: 22-Apr-2022
    • (2021)Designing large quantum key distribution networks via medoid-based algorithmsFuture Generation Computer Systems10.1016/j.future.2020.09.037115(814-824)Online publication date: Feb-2021
    • (2019)Word Embedding-Based Biomedical Text SummarizationEmerging Trends in Intelligent Computing and Informatics10.1007/978-3-030-33582-3_28(288-297)Online publication date: 2-Nov-2019
    • (2019)A New Biomedical Text Summarization Method Based on Sentence Clustering and Frequent Itemsets MiningProceedings of the 8th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT’18), Vol.110.1007/978-3-030-21005-2_14(144-152)Online publication date: 11-Jul-2019
    • (2018)Different approaches for identifying important concepts in probabilistic biomedical text summarizationArtificial Intelligence in Medicine10.1016/j.artmed.2017.11.00484(101-116)Online publication date: Jan-2018
    • (2018)Evaluating Different Similarity Measures for Automatic Biomedical Text SummarizationIntelligent Systems Design and Applications10.1007/978-3-319-76348-4_30(305-314)Online publication date: 22-Mar-2018
    • (2014)Combining graph connectivity and genetic clustering to improve biomedical summarization2014 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2014.6900370(2740-2747)Online publication date: Jul-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media