Abstract
In this paper, we study how to better merge a WordNet-like ontology with an online encyclopedia. We first eliminate the noises with some heuristic rules, and then adopt a domain-dependent strategy to trim the encyclopedia structure. Finally, we integrate entities from the trimmed structure into the original ontology, and construct a refinedly-enriched ontology. The experimental results show that this ontology can achieve better performance than the original version as well as a coarsely-enriched version constructed without pruning and trimming.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Anderka, M., Stein, B., Lipka, N.: Predicting quality flaws in user-generated content: The case of wikipedia. In: Proc. of SIGIR 2012, pp. 981–990 (2012)
Artola, X., Soroa, A.: Elhisa: An architecture for the integration of heterogeneous lexical information. Natural Language Engineering 14(2) (April 2008)
Bentivogli, L., Pianta, E.: Extending wordnet with syntagmatic information. In: Proc. of GWC 2004, pp. 47–53 (2004)
Borodin, A., Roberts, G.O., Rosenthal, J.S., Tsaparas, P.: Finding authorities and hubs from link structures on the world wide web. In: Proc. of WWW 2001, pp. 415–429 (2001)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001)
Fellbaum, C.: WordNet: An Electronic Lexical Database. The MIT Press, Cambridge (1998)
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proc. of COLING 1992, pp. 539–545 (1992)
Jiang, S., Bing, L., Sun, B., Zhang, Y., Lam, W.: Ontology enhancement and concept granularity learning: Keeping yourself current and adaptive. In: Proc. of KDD 2011, pp. 1244–1252 (2011)
Kamps, J., Koolen, M.: Is wikipedia link structure different? In: Proc. of WSDM 2009 (2009)
Kozareva, Z., Hovy, E.: A semi-supervised method to learn and construct taxonomies using the web. In: Proc. of EMNLP 2010, pp. 1110–1118 (2010)
Liu, Y., Yu, S., Yu, J.: Building a bilingual wordnet-like lexicon: the new approach and algorithms. In: Proc. of COLING 2002 (2002)
Mandhani, B., Soderland, S.: Exploiting hyponymy in extracting relations and enhancing ontologies. In: Proc. of WIIAT 2008, pp. 325–329 (2008)
McCrae, J., Spohr, D., Cimiano, P.: Linking lexical resources and ontologies on the semantic web with lemon. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part I. LNCS, vol. 6643, pp. 245–259. Springer, Heidelberg (2011)
McGuinness, D., Fikes, R., Rice, J., Wilder, S.: An environment for merging and testing large ontologies. In: Proc. of KR 2000 (2000)
Melo, G.D., Weikum, G.: Towards a universal wordnet by learning from combined evidence. In: Proc. of CIKM 2009, pp. 513–522 (2009)
Morato, J.Á., Marzal, M., Lloréns, J., Moreiro, J.: Wordnet applications. In: Proc. of GWC 2004 (2004)
Navigli, R., Crisafulli, G.: Inducing word senses to improve web search result clustering. In: Proc. of EMNLP 2010, pp. 116–126 (2010)
Navigli, R., Ponzetto, S.P.: Babelnet: Building a very large multilingual semantic network. In: Proc. of ACL 2010, pp. 216–225 (2010)
Navigli, R., Velardi, P., Cucchiarelli, A., Neri, F.: Extending and enriching wordnet with ontolearn. In: Proc. of GWC 2004, pp. 279–284 (2004)
Pociello, E., Agirre, E., Aldezabal, I.: Methodology and construction of the basque wordnet. Language Resources and Evaluation 45(2), 121–142 (2011)
Ponzetto, S.P., Navigli, R.: Large-scale taxonomy mapping for restructuring and integrating wikipedia. In: Proc. of IJCAI 2009, pp. 2083–2088 (2009)
Ramírez, J., Asahara, M., Matsumoto, Y.: Japanese-spanish thesaurus construction using english as a pivot. In: Proc. of IJCNLP 2008, pp. 473–480 (2008)
Shi, L., Mihalcea, R.: Putting pieces together: Combining framenet, verbnet and wordnet for robust semantic parsing. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 100–111. Springer, Heidelberg (2005)
Subramaniam, L.V., Nanavati, A.A., Mukherjea, S.: Enriching one taxonomy using another. IEEE TKDE 22(10) (October 2010)
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: Proc. of WWW 2007, pp. 697–706 (2007)
Veale, T., Hao, Y.: Enriching wordnet with folk knowledge and stereotypes. In: Proc. of GWC 2008 (2008)
Wang, P., Hu, J., Zeng, H., Chen, Z.: Using wikipedia knowledge to improve text classification. Knowledge and Information Systems 19(3), 265–281 (2009)
Wu, F., Weld, D.S.: Automatically refining the wikipedia infobox ontology. In: Proc. of WWW 2008, pp. 635–644 (2008)
Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proc. of SIGIR 1999, pp. 42–49 (1999)
Yu, J., Yu, S., Liu, Y., Zhang, H.: Introduction to chineses concept dictionary. In: Proc. of ICCC 2001, pp. 361–366 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, S., Nian, J., Zhao, S., Zhang, Y. (2013). Small Is Powerful! Towards a Refinedly Enriched Ontology by Careful Pruning and Trimming. In: Motoda, H., Wu, Z., Cao, L., Zaiane, O., Yao, M., Wang, W. (eds) Advanced Data Mining and Applications. ADMA 2013. Lecture Notes in Computer Science(), vol 8346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-53914-5_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-53914-5_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-53913-8
Online ISBN: 978-3-642-53914-5
eBook Packages: Computer ScienceComputer Science (R0)