Abstract
The relevance of Named-Entity Recognition and Entity Linking for cultural heritage institutions is evaluated through a case-study involving the semantic enrichment of historical periodicals. A language-independent approach is proposed in order to improve the search experience of end-users with the mapping of entities to the Linked Open Data (LOD) cloud. Preliminary results show that a precision rate of almost 90% can be achieved with very little fine-tuning, while an increase in recall remains necessary.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lin, Y., Ahn, J.W., Brusilovsky, P., He, D., Real, W.: ImageSieve: Exploratory Search of Museum Archives with Named Entity-Based Faceted Browsing. Proceedings of the American Society for Information Science and Technology 47, 1–10 (2010)
Segers, R., van Erp, M., van der Meij, L., Aroyo, L., Schreiber, G., Wielinga, B., van Ossenbruggen, J., Oomen, J., Jacobs, G.: Hacking History: Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections. In: Proceedings of the 6th International Conference on Knowledge Capture (K-CAP), Banff, Alberta, Canada (2011)
Rodriquez, K.J., Bryant, M., Blanke, T., Luszczynska, M.: Comparison of Named Entity Recognition Tools for Raw OCR Text. In: Proceedings of KONVENS 2012, Vienna, pp. 410–414 (2012)
van Hooland, S., De Wilde, M., Verborgh, R., Steiner, T., Van de Walle, R.: Exploring Entity Recognition and Disambiguation for Cultural Heritage Collections. Digital Scholarship in the Humanities 30, 262–279 (2015)
Raimond, Y., Smethurst, M., McParland, A., Lowis, C.: Using the Past to Explain the Present: Interlinking Current Affairs with Archives via the Semantic Web. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 146–161. Springer, Heidelberg (2013)
Bingel, J., Haider, T.: Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC), Reykjavik, Iceland (2014)
Kupietz, M., Belica, C., Keibel, H., Witt, A.: The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC), Valletta, Malta (2010)
Agirre, E., Barrena, A., De Lacalle, O.L., Soroa, A., Fernando, S., Stevenson, M.: Matching Cultural Heritage Items to Wikipedia. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC), pp. 1729–1735 (2012)
Fernando, S., Stevenson, M.: Adapting wikification to cultural heritage. In: Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pp. 101–106. ACL (2012)
Shen, W., Wang, J., Han, J.: Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions. IEEE Transactions on Knowledge and Data Engineering 27, 443–460 (2015)
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A Crystallization Point for the Web of Data. Web Semantics: Science, Services and Agents on the World Wide Web 7, 154–165 (2009)
Ruiz, P., Poibeau, T.: Combining Open Source Annotators for Entity Linking through Weighted Voting. In: Proceedings of the 4th Joint Conference on Lexical and Computational Semantics (*SEM), Denver, CO, USA (2015)
Cornolti, M., Ferragina, P., Ciaramita, M.: A Framework for Benchmarking Entity-Annotation Systems. In: Proceedings of the 22nd International Conference on the World Wide Web, pp. 249–260 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
De Wilde, M. (2015). Improving Retrieval of Historical Content with Entity Linking. In: Morzy, T., Valduriez, P., Bellatreche, L. (eds) New Trends in Databases and Information Systems. ADBIS 2015. Communications in Computer and Information Science, vol 539. Springer, Cham. https://doi.org/10.1007/978-3-319-23201-0_50
Download citation
DOI: https://doi.org/10.1007/978-3-319-23201-0_50
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23200-3
Online ISBN: 978-3-319-23201-0
eBook Packages: Computer ScienceComputer Science (R0)