Abstract
Developing linguistically data-compliant rules for entity extraction is usually an intensive and time-consuming process for any ontology engineer. Thus, an automated mechanism to convert textual data into ontology instances (Ontology Population) may be crucial. In this context, this paper presents an inductive logic programming-based method that induces rules for extracting instances of various entity classes. This method uses two sources of evidence: domain-independent linguistic patterns for identifying candidates of class instances, and a WordNet semantic similarity measure. These two evidences are integrated as background knowledge to automatically generate extractions rules by a generic inductive logic programming system. Some experiments were conducted on the class instance classification problem with encouraging results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bing Search Engine API. API Basics, http://www.bing.com/developers/s/APIBasics.html
Cimiano, P.: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications. Springer, New York (2006)
De Raedt, L.: Inductive Logic Programming. In: Encyclopedia of Machine Learning, pp. 529–537 (2010)
Downey, D., et al.: Learning Text Patterns for Web Information Extraction and Assessment. In: Proceedings of the 19th National Conference on Artificial Intelligence Workshop on Adaptive Text Extraction and Mining, San Jose, USA (2004)
Etzioni, O., et al.: Web-Scale Information Extraction in KnowItAll. In: Proc. of the 13th International World Wide Web Conference (WWW 2004), New York, USA, pp. 100–110 (2004)
Finn, A.: A Multi-Level Boundary Classification Approach to Information Extraction. Phd thesis, University College Dublin (2006)
Lavrac, N., Dzeroski, S.: Inductive Logic Programming: Techniques and Applications. Ellis Horwood, New York (1994)
Miller, G.A.: WordNet: a Lexical Database for English. Communications of the ACM 38(11), 39–41 (1995)
Patel, A., Ramakrishnan, G., Bhattacharya, P.: Incorporating Linguistic Expertise Using ILP for Named Entity Recognition in Data Hungry Indian Languages. In: De Raedt, L. (ed.) ILP 2009. LNCS, vol. 5989, pp. 178–185. Springer, Heidelberg (2010)
Santos, J.: Efficient Learning and Evaluation of Complex Concepts in Inductive Logic Programming. Ph.D. Thesis, Imperial College (2010)
Stanford CoreNLP Tools. The Stanford Natural Language Processing Group, http://nlp.stanford.edu/software/corenlp.shtml
Wu, Z., Palmer, M.: Verb Semantics and Lexical Selection. In: Proc. of the 32nd Annual Meeting of the Association for Comp. Linguistics, New Mexico, USA, pp. 133–138 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lima, R. et al. (2013). An Inductive Logic Programming-Based Approach for Ontology Population from the Web. In: Decker, H., Lhotská, L., Link, S., Basl, J., Tjoa, A.M. (eds) Database and Expert Systems Applications. DEXA 2013. Lecture Notes in Computer Science, vol 8055. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40285-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-40285-2_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40284-5
Online ISBN: 978-3-642-40285-2
eBook Packages: Computer ScienceComputer Science (R0)