Abstract
Many attempts have been made to extract structured data from Web resources, exposing them as RDF triples and interlinking them with other RDF datasets: in this way it is possible to create clouds of highly integrated Semantic Web data collections. In this paper we describe an approach to enhance the extraction of semantic contents from unstructured textual documents, in particular considering Wikipedia articles and focusing on event mining. Starting from the deep parsing of a set of English Wikipedia articles, we produce a semantic annotation compliant with the Knowledge Annotation Format (KAF). We extract events from the KAF semantic annotation and then we structure each event as a set of RDF triples linked to both DBpedia and WordNet. We point out examples of automatically mined events, providing some general evaluation of how our approach may discover new events and link them to existing contents.
Chapter PDF
Similar content being viewed by others
Keywords
References
RDF W3C Web Page, http://www.w3.org/RDF/
OWL W3C Recomm., http://www.w3.org/TR/owl-features/
Urbansky, D., Thom, J.A.: WebKnox: Web Knowledge Extraction. In: 13th Australasian Document Computing Symposium, Hobart (2008)
Zhao, S., Betx, J.: Corroborate and Learn Facts from the Web. In: 13th International Conference on Knowledge Discovery and Data Mining, San Josè (2007)
Banko, M., Etzioni, O.: The Tradeoffs Between Open and Traditional Relation Extraction. In: 46th ACL: Human Language Technologies, Columbus (2008)
Linked Data Web Site, http://linkeddata.org/
DBpedia Web Site, http://dbpedia.org/About
Open Calais Web Site, http://www.opencalais.com/
Wikify! Web Site, http://www.wikifyer.com/
Faviki Web Site, http://www.faviki.com/
Passant, A.: LODr - A Linking Open Data Tagging System. In: Social Data on the Web Workshop at the 7th Int. Semantic Web Conference, Karlsrhue (2008)
Tesconi, M., Ronzano, F., Marchetti, A., Minutoli, S.: Semantify del.icio.us: automatically turn your tags into senses. In: Social Data on the Web Workshop at the 7th International Semantic Web Conference, Karlsrhue (2008)
Tagpedia Web Site, http://www.tagpedia.org/
Nakayama, K.: Extracting Structured Knowledge for Semantic Web by Mining Wikipedia. In: Social Data on the Web Workshop at the 7th International Semantic Web Conference, Karlsrhue (2008)
Ronzano, F., Marchetti, A., Tesconi, M., Minutoli, S.: Tagpedia: a Semantic Reference to Describe and Search for Web Resources. In: Social Web and Knowledge Management Workshop at the 17th World Wide Web Conference, WWW 2008, Beijing (2008)
Adafre, S.F., Jijkoun, V., de Rijke, M.: Fact Discovery in Wikipedia. In: IEEE/WIC/ACM International Conference on Web Intelligence, Silicon Valley (2007)
Bhole, A., Fortuna, B., Grobelnik, M., Mladenic, D.: Mining Wikipedia and Relating Named Entities over Time. In: 13th International Conference on Knowledge Discovery and Data Mining, San Josè (2007)
Asterias, J., Zaragoza, H., Ciaramita, M., Attardi, G.: Semantically Annotated Snapshot of the English Wikipedia. In: 6th International Language Resources and Evaluation Conference LREC 2008, Marrakech (2008)
Suh, S., Halpin, H., Klein, E.: Extracting Common Sense Knowledge from Wikipedia. In: 6th International Semantic Web Conference, Athens, GA, USA (2006)
Bosma, W., Vossen, P., Soroa, A., Rigau, G., Tesconi, M., Marchetti, A., Aliprandi, C., Monachini, M.: KAF: a generic semantic annotation format. In: 5th International Conference on Generative Approaches to the Lexicon, Pisa (2009)
McCord, M.C.: Slot Grammar: A System for Simpler Construction of Practical Natural Language Grammars. Natural Language and Logic, 118–145 (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aliprandi, C., Ronzano, F., Marchetti, A., Tesconi, M., Minutoli, S. (2011). Extracting Events from Wikipedia as RDF Triples Linked to Widespread Semantic Web Datasets. In: Ozok, A.A., Zaphiris, P. (eds) Online Communities and Social Computing. OCSC 2011. Lecture Notes in Computer Science, vol 6778. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21796-8_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-21796-8_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21795-1
Online ISBN: 978-3-642-21796-8
eBook Packages: Computer ScienceComputer Science (R0)