More Web Proxy on the site http://driver.im/

research-article

A genetic graph-based clustering approach to biomedical summarization

Authors:

Héctor D. Menéndez,

David CamachoAuthors Info & Claims

WIMS '13: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics

Article No.: 10, Pages 1 - 8

https://doi.org/10.1145/2479787.2479807

Published: 12 June 2013 Publication History

Abstract

Summarization techniques have become increasingly important over the last few years, specially in biomedical research, where information overload is major problem. Researchers of this area need a shorter version of the texts which contains all the important information while discarding irrelevant one. There are several applications which deal with this problem, however, these applications are sometimes less informative than the user needs. This work deals with this problem trying to improve a summarization graph-based process using genetic clustering techniques. Our automatic summaries are compared to those produced by several commercial and research summarizers, and demonstrate the appropriateness of using genetic techniques in automatic summarization.

References

[1]

Medical subject headings. http://www.nlm.nih.gov/mesh/.

[2]

Metamap. http://metamap.nlm.nih.gov/.

[3]

Unified medical language system. http://www.nlm.nih.gov/research/umls/.

[4]

Biomed central corpus, 2012. http://www.biomedcentral.com/about/datamining.

[5]

R. Barzilay and M. Elhadad. Using lexical chains for text summarization. In Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization, pages 10--17, 1997.

[6]

R. Brandow, K. Mitze, and L. Rau. Automatic condensation of electronic publications by sentence selection. Information Processing and Management, 5(31):675--685, 1995.

Digital Library

[7]

Coley. An Introduction to Genetic Algorithms for scientists and engineers. World Scientific Publishing, 1999.

Digital Library

[8]

M. Dehmer, editor. Structural Analysis of Complex Networks. Birkhäuser Publishing, 2010.

Digital Library

[9]

A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1):1--38, 1977.

[10]

H. P. Edmundson. New Methods in Automatic Extracting. Journal of the Association for Computing Machinery, 2(16):264--285, 1969.

Digital Library

[11]

G. Erkan and D. R. Radev. LexRank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research (JAIR), 22:457--479, 2004.

Digital Library

[12]

S. Fortunato, V. Latora, and M. Marchiori. Method to find community structures based on information centrality. Physical Review E (Statistical, Nonlinear, and Soft Matter Physics), 70(5):056104:1--13, 2004.

[13]

S. Geisser. Predictive Inference: An Introduction. Monographs on Statistics and Applied Probability. Chapman & Hall, 1993.

[14]

E. Hartuv and R. Shamir. A clustering algorithm based on graph connectivity. Information Processing Letters, 76(4--6):175--181, 2000.

Digital Library

[15]

E. Hruschka, R. Campello, A. Freitas, and A. de Carvalho. A survey of evolutionary algorithms for clustering. Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, 39(2):133--155, march 2009.

Digital Library

[16]

S. M. Humphrey, W. J. Rogers, H. Kilicoglu, D. Demner-Fushman, and T. C. Rindflesch. Word sense disambiguation by selecting the best semantic type based on journal descriptor indexing: Preliminary experiment. J. Am. Soc. Inf. Sci. Technol., 57(1):96--113, Jan. 2006.

Digital Library

[17]

K. S. Jones and J. R. Galliers. Evaluating Natural Language Processing Systems: An Analysis and Review. Springer-Verlag New York, Inc., Secaucus, NJ, USA, 1996.

Digital Library

[18]

D. T. Larose. Discovering Knowledge in Data. John Wiley & Sons, 2005.

[19]

C. Y. Lin. Looking for a few good metrics: Automatic summarization evaluation - how many samples are enough? In Proceedings of the NTCIR Workshop 4, 2004.

[20]

C. Y. Lin. Rouge: A Package for Automatic Evaluation of Summaries. In M. F. Moens and S. Szpakowicz, editors, Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, pages 74--81, Barcelona, Spain, 2004. Association for Computational Linguistics.

[21]

H. P. Luhn. The Automatic Creation of Literature Abstracts. IBM Journal of Research Development, 2(2):159--165, 1958.

Digital Library

[22]

J. B. Macqueen. Some methods of classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pages 281--297, 1967.

[23]

I. Mani. Summarization evaluation: An overview. In Proceedings of the 2nd NTCIR workshop on research in Chinese and Japonese text retrieval and text summarization. Tokio, Japan: National Institute of Informatics, 2001.

[24]

H. D. Menéndez and D. Camacho. A genetic graph-based clustering algorithm. In H. Yin, J. Costa, and G. Barreto, editors, Intelligent Data Engineering and Automated Learning - IDEAL 2012, volume 7435 of Lecture Notes in Computer Science, pages 216--225. Springer Berlin/Heidelberg, 2012.

Digital Library

[25]

R. Mihalcea and P. Tarau. TextRank - Bringing order into text. In Proceedings of the Conference EMNLP 2004, pages 404--411, 2004.

[26]

M. C. V. Nascimento and A. C. P. L. F. Carvalho. A graph clustering algorithm based on a clustering coefficient for weighted graphs. J. Braz. Comp. Soc., 17(1):19--29, 2011.

[27]

S. Nelson, T. Powell, and B. Humphreys. The unified medical language system (umls) project. Encyclopedia of library and information science., pages 368--378, 2002.

[28]

E. Pitler and A. Nenkova. Revisiting readability: A unified framework for predicting text quality. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 186--195, Honolulu, Hawaii, October 2008. Association for Computational Linguistics.

Digital Library

[29]

L. Plaza, A. Díaz, and P. Gervás. A semantic graph-based approach to biomedical summarisation. Artif. Intell. Med., 53(1):1--14, Sept. 2011.

Digital Library

[30]

D. R. Radev, S. Teufel, H. Saggion, W. Lam, J. Blitzer, H. Qi, A. Çelebi, D. Liu, and E. Drabek. Evaluation challenges in large-scale document summarization. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1, ACL '03, pages 375--382, Stroudsburg, PA, USA, 2003. Association for Computational Linguistics.

Digital Library

[31]

L. Reeve, H. Han, and A. Brooks. The use of domain-specific concepts in biomedical text summarization. Information Processing and Management, 43:1765--1776, 2007.

Digital Library

[32]

S. E. Schaeffer. Graph clustering. Computer Science Review, 1(1):27--64, 2007.

Digital Library

[33]

Y. Shang, Y. Li, H. Lin, and Z. Yang. Enhancing biomedical text summarization using semantic relation extraction. PLoS one, 6(8), 2011.

[34]

Z. Shi, G. Melli, Y. Wang, Y. Liu, B. Gu, M. M. Kashani, A. Sarkar, and F. Popowich. Question answering summarization of multiple biomedical documents. In Proceedings of the Canadian Conference on Artificial Intelligence, pages 284--295, 2007.

Digital Library

[35]

M. SLitvak and M. Last. Graph-based keyword extraction for single-document summarization. In Proceedings of the International Conference on Computational Linguistics, Workshop on Multi-source Multilingual Information Extraction and Summarization, 2008.

Digital Library

[36]

S. Tratz and E. Hovy. Summarization evaluation using transformed basic elements. In In Proceedings of the 1st Text Analysis Conference (TAC, 2008.

[37]

U. von Luxburg. A tutorial on spectral clustering. Statistics and Computing, 17(4):395--416, Dec. 2007.

Digital Library

[38]

D. J. Watts. Small worlds: The dynamics of networks between order and randomness. Princeton University Press, Princeton, NJ, 1999.

Digital Library

[39]

I. Yoo, X. Hu, and I.-Y. Song. A Coherent Graph-Based Semantic Clustering and Summarization Approach for Biomedical Literature and a New Summarization Evaluation Method. BMC Bioinformatics, 8(9):S4, 2007.

[40]

P. Zweigenbaum, D. Demner-Fushman, H. Yu, and K. B. Cohen. Frontiers of biomedical text mining: current progress. Briefings in Bioinformatics, 8(5):358--375, 2007.

Cited By

Menéndez HDakhama A(2025)Automatic Summarization Evaluation: Methods and PracticesTesting Software and Systems10.1007/978-3-031-80889-0_12(169-181)Online publication date: 25-Jan-2025
https://doi.org/10.1007/978-3-031-80889-0_12
Davoodijam EGhadiri NLotfi Shahreza MRinaldi F(2022)MultiGBSJournal of Biomedical Informatics10.1016/j.jbi.2021.103706116:COnline publication date: 22-Apr-2022
https://dl.acm.org/doi/10.1016/j.jbi.2021.103706
García-Cobo IMenéndez H(2021)Designing large quantum key distribution networks via medoid-based algorithmsFuture Generation Computer Systems10.1016/j.future.2020.09.037115(814-824)Online publication date: Feb-2021
https://doi.org/10.1016/j.future.2020.09.037
Show More Cited By

Index Terms

A genetic graph-based clustering approach to biomedical summarization
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

An Ontology-Based Approach to Text Summarization
WI-IAT '08: Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03

Extractive text summarization aims to create a condensed version of one or more source documents by selecting the most informative sentences. Research in text summarization has therefore often focused on measures of the usefulness of sentences for a ...
Extractive text summarization using clustering-based topic modeling
Abstract
Text summarization is the process of converting the input document into a short form, provided that it preserves the overall meaning associated with it. Primarily, text summarization is achieved in two ways, i.e., abstractive and extractive. ...
Deep Learning-Based Abstractive Summarization for Brazilian Portuguese Texts
Intelligent Systems
Abstract
Automatic summarization captures the most relevant information and condenses it into an understandable text in natural language. Such a task can be classified as either extractive or abstractive summarization. Research on Brazilian Portuguese-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WIMS '13: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics

June 2013

408 pages

ISBN:9781450318501

DOI:10.1145/2479787

Conference Chair:
David Camacho
Autonomous University of Madrid, Spain
,
Program Chairs:
Rajendra Akerkar
Western Norway Research Institute, Norway
,
Maria D. Rodriguez Moreno
University of Alcalá, Spain

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

UAM: Autonomous University of Madrid

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 June 2013

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Ministerio de Educación, Cultura y Deporte

Conference

WIMS '13

Sponsor:

UAM

WIMS '13: 3rd International Conference on Web Intelligence, Mining and Semantics

June 12 - 14, 2013

Madrid, Spain

Acceptance Rates

WIMS '13 Paper Acceptance Rate 28 of 72 submissions, 39%;

Overall Acceptance Rate 140 of 278 submissions, 50%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

8
Total Citations
View Citations
158
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Menéndez HDakhama A(2025)Automatic Summarization Evaluation: Methods and PracticesTesting Software and Systems10.1007/978-3-031-80889-0_12(169-181)Online publication date: 25-Jan-2025
https://doi.org/10.1007/978-3-031-80889-0_12
Davoodijam EGhadiri NLotfi Shahreza MRinaldi F(2022)MultiGBSJournal of Biomedical Informatics10.1016/j.jbi.2021.103706116:COnline publication date: 22-Apr-2022
https://dl.acm.org/doi/10.1016/j.jbi.2021.103706
García-Cobo IMenéndez H(2021)Designing large quantum key distribution networks via medoid-based algorithmsFuture Generation Computer Systems10.1016/j.future.2020.09.037115(814-824)Online publication date: Feb-2021
https://doi.org/10.1016/j.future.2020.09.037
Rouane OBelhadef HBouakkaz M(2019)Word Embedding-Based Biomedical Text SummarizationEmerging Trends in Intelligent Computing and Informatics10.1007/978-3-030-33582-3_28(288-297)Online publication date: 2-Nov-2019
https://doi.org/10.1007/978-3-030-33582-3_28
Rouane OBelhadef HBouakkaz M(2019)A New Biomedical Text Summarization Method Based on Sentence Clustering and Frequent Itemsets MiningProceedings of the 8th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT’18), Vol.110.1007/978-3-030-21005-2_14(144-152)Online publication date: 11-Jul-2019
https://doi.org/10.1007/978-3-030-21005-2_14
Moradi MGhadiri N(2018)Different approaches for identifying important concepts in probabilistic biomedical text summarizationArtificial Intelligence in Medicine10.1016/j.artmed.2017.11.00484(101-116)Online publication date: Jan-2018
https://doi.org/10.1016/j.artmed.2017.11.004
Nasr Azadani MGhadiri N(2018)Evaluating Different Similarity Measures for Automatic Biomedical Text SummarizationIntelligent Systems Design and Applications10.1007/978-3-319-76348-4_30(305-314)Online publication date: 22-Mar-2018
https://doi.org/10.1007/978-3-319-76348-4_30
Menendez HPlaza LCamacho D(2014)Combining graph connectivity and genetic clustering to improve biomedical summarization2014 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2014.6900370(2740-2747)Online publication date: Jul-2014
https://doi.org/10.1109/CEC.2014.6900370

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten