Abstract
It is fundamental and important task to extract keyword form documents. Existing methods mainly use statistical or linguistic information to extract the most salient keyword from document. However, those methods ignore the relationship between different granularities (i.e., relationship between word, sentence, and topic). In order to capture and make better use of their relationships between these granularities, this paper proposed an iterative approach for keyword extraction. The method is first implemented by constructing a graph which reflect relationship between different size of granularity nodes, and then using iterative algorithm to calculate score of keywords. Finally, highest score of words in the document will be chosen as keywords. Experimental results show that our approach outperforms baseline methods.
This paper is funded by College science and technology projects in Shanxi Province china under grant 20110015.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Turney, P.D.: Learning algorithms for keyphrase extraction. Information Retrieval 2, 303–336 (2000)
Barker, K., Cornacchia, N.: Using nounphrase heads to extract document keyphrases. In: Canadian Conference on AI (2000)
Frank, E., Paynter, G.W., Witten, I.H., Gutwin, C., Nevill-Manning, C.G.: Domain-specific keyphrase extraction. In: Proceedings of IJCAI 1999, pp. 668–673 (1999)
Mihalcea, R., Tarau, P.: TextRank: Bringing order into texts. In: Proceedings of EMNL 2004 (2004)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Libraries (1998)
Muñoz, A.: Compound key word generation from document databases using a hierarchical clustering ART model. Intelligent Data Analysis 1(1) (1996)
Pedersen, T., Patwardhan, S., Michelizzi, J.: WordNet:Similarity – Measuring the relatedness of concepts. In: Proceedings of AAAI 2004 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wei, Y. (2012). An Iterative Approach to Keywords Extraction. In: Tan, Y., Shi, Y., Ji, Z. (eds) Advances in Swarm Intelligence. ICSI 2012. Lecture Notes in Computer Science, vol 7332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31020-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-31020-1_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31019-5
Online ISBN: 978-3-642-31020-1
eBook Packages: Computer ScienceComputer Science (R0)