[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Improving Retrieval of Historical Content with Entity Linking

  • Conference paper
  • First Online:
New Trends in Databases and Information Systems (ADBIS 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 539))

Included in the following conference series:

  • East European Conference on Advances in Databases and Information Systems
  • 1283 Accesses

Abstract

The relevance of Named-Entity Recognition and Entity Linking for cultural heritage institutions is evaluated through a case-study involving the semantic enrichment of historical periodicals. A language-independent approach is proposed in order to improve the search experience of end-users with the mapping of entities to the Linked Open Data (LOD) cloud. Preliminary results show that a precision rate of almost 90% can be achieved with very little fine-tuning, while an increase in recall remains necessary.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 35.99
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 44.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Lin, Y., Ahn, J.W., Brusilovsky, P., He, D., Real, W.: ImageSieve: Exploratory Search of Museum Archives with Named Entity-Based Faceted Browsing. Proceedings of the American Society for Information Science and Technology 47, 1–10 (2010)

    Article  Google Scholar 

  2. Segers, R., van Erp, M., van der Meij, L., Aroyo, L., Schreiber, G., Wielinga, B., van Ossenbruggen, J., Oomen, J., Jacobs, G.: Hacking History: Automatic Historical Event Extraction for Enriching Cultural Heritage Multimedia Collections. In: Proceedings of the 6th International Conference on Knowledge Capture (K-CAP), Banff, Alberta, Canada (2011)

    Google Scholar 

  3. Rodriquez, K.J., Bryant, M., Blanke, T., Luszczynska, M.: Comparison of Named Entity Recognition Tools for Raw OCR Text. In: Proceedings of KONVENS 2012, Vienna, pp. 410–414 (2012)

    Google Scholar 

  4. van Hooland, S., De Wilde, M., Verborgh, R., Steiner, T., Van de Walle, R.: Exploring Entity Recognition and Disambiguation for Cultural Heritage Collections. Digital Scholarship in the Humanities 30, 262–279 (2015)

    Article  Google Scholar 

  5. Raimond, Y., Smethurst, M., McParland, A., Lowis, C.: Using the Past to Explain the Present: Interlinking Current Affairs with Archives via the Semantic Web. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 146–161. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  6. Bingel, J., Haider, T.: Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC), Reykjavik, Iceland (2014)

    Google Scholar 

  7. Kupietz, M., Belica, C., Keibel, H., Witt, A.: The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC), Valletta, Malta (2010)

    Google Scholar 

  8. Agirre, E., Barrena, A., De Lacalle, O.L., Soroa, A., Fernando, S., Stevenson, M.: Matching Cultural Heritage Items to Wikipedia. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC), pp. 1729–1735 (2012)

    Google Scholar 

  9. Fernando, S., Stevenson, M.: Adapting wikification to cultural heritage. In: Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pp. 101–106. ACL (2012)

    Google Scholar 

  10. Shen, W., Wang, J., Han, J.: Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions. IEEE Transactions on Knowledge and Data Engineering 27, 443–460 (2015)

    Article  Google Scholar 

  11. Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A Crystallization Point for the Web of Data. Web Semantics: Science, Services and Agents on the World Wide Web 7, 154–165 (2009)

    Article  Google Scholar 

  12. Ruiz, P., Poibeau, T.: Combining Open Source Annotators for Entity Linking through Weighted Voting. In: Proceedings of the 4th Joint Conference on Lexical and Computational Semantics (*SEM), Denver, CO, USA (2015)

    Google Scholar 

  13. Cornolti, M., Ferragina, P., Ciaramita, M.: A Framework for Benchmarking Entity-Annotation Systems. In: Proceedings of the 22nd International Conference on the World Wide Web, pp. 249–260 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Max De Wilde .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

De Wilde, M. (2015). Improving Retrieval of Historical Content with Entity Linking. In: Morzy, T., Valduriez, P., Bellatreche, L. (eds) New Trends in Databases and Information Systems. ADBIS 2015. Communications in Computer and Information Science, vol 539. Springer, Cham. https://doi.org/10.1007/978-3-319-23201-0_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23201-0_50

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23200-3

  • Online ISBN: 978-3-319-23201-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics