Abstract
In vector space model, a document is represented by words. As the new words appear dramatically in the Internet era, this kind of method draws back the IR systems performance. This paper puts forward a new approach to present the concepts, query expressions, and documents based on the ontology. The approach has two levels, the Word-Concept level and the Concept-Document level. In the first level, the transition probability matrix is constructed by using the appearing times of word-word pairs in documents. The biggest eigenvector of matrix is computed, and it reflects the importance of words to the concept. In the second level, the distance matrix is constructed by using the distance between words in a given ontology, and the average variance value of elements is computed. It reflects the relevance of documents to concepts. In the last section, the query expansion is discussed by using the personal information profile of the user. It is proofed to be more effective than previous one.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kara, S., Alan, O., Sabuncu, O., Akpınar, S., Cicekli, N.K., Alpaslan, F.N.: An ontology-based retrieval system using semantic indexing. Information Systems 37(4), 294–305 (2012)
Kang, X., Li, D., Wang, S.: Research on domain ontology in different granulations based on concept lattice. Knowledge-Based Systems 27, 152–161 (2012)
Dragoni, M., da Costa Pereira, C., Tettamanzi, A.G.: A conceptual representation of documents and queries for information retrieval systems by using light ontologies. Expert Systems with Applications (2012) 10.1016/j.eswa.2012.01.188
Qu, S., Wang, S., Zou, Y.: Improvement of text feature selection method based on tfidf. In: International Seminar on Future Information Technology and Management Engineering, FITME 2008, pp. 79–81 (November 2008)
Kayed, A., Colomb, R.M.: Using ontologies to index conceptual structures for tendering automation. In: Proceedings of the 13th Australasian Database Conference, ADC 2002, vol. 5, pp. 95–101. Australian Computer Society, Inc., Darlinghurst (2002)
Kim, M.-C., Choi, K.-S.: A comparison of collocation-based similarity measures in query expansion. Information Processing and Management 35(1), 19–30 (1999)
Efthimiadis, E.N.: Query expansion. Annual Review of Information Science and Technology 31, 121–187 (1996)
Cronen-townsend, S., Zhou, Y., Croft, W.B.: A framework for selective query expansion. In: Proceedings of 13th International Conference on Information and Knowledge Management, CIKM 2004, pp. 236–237. ACM (2004)
Gardiner, C.: Stochastic Methods: A Handbook for the Natural and Social Sciences. Springer Series in Synergetics. Springer (2009)
Mian, R., Khan, S.: Markov Chain. VDM Verlag Dr. Muller (2010)
Serre, D.: Matrices: theory and applications. Graduate texts in mathematics. Springer (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, H., Guo, Y., Shi, X., Yang, F. (2012). Conceptual Representing of Documents and Query Expansion Based on Ontology. In: Wang, F.L., Lei, J., Gong, Z., Luo, X. (eds) Web Information Systems and Mining. WISM 2012. Lecture Notes in Computer Science, vol 7529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33469-6_61
Download citation
DOI: https://doi.org/10.1007/978-3-642-33469-6_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33468-9
Online ISBN: 978-3-642-33469-6
eBook Packages: Computer ScienceComputer Science (R0)