[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Using Clustering Across Union Catalogues to Enrich Entries with Indexing Information

  • Conference paper
  • First Online:
Data Analysis, Machine Learning and Knowledge Discovery

Abstract

The federal system in Germany has created a segmented library landscape. Instead of a central entity responsible for cataloguing and indexing, regional library unions share the workload cooperatively among their members. One result of this approach is limited sharing of cataloguing and indexing information across union catalogues as well as heterogeneous indexing of items with almost equivalent content: different editions of the same work. In this paper, a method for clustering entries in library catalogues is proposed that can be used to reduce this heterogeneity as well as share indexing information across catalogue boundaries. In two experiments, the method is applied to several union catalogues and the results show that a surprisingly large number of previously not indexed entries can be enriched with indexing information. The quality of the indexing has been positively evaluated by human professionals and the results have already been imported into the production catalogues of two library unions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 71.50
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 89.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Subject Headings Authority File. In a recent development, the SWD is combined with authority files for persons and corporate bodies to form the Gemeinsame Normdatei (GND) (engl.: Universal Authority File). This file will be suitable to be used with the proposed RDA cataloguing rules. The catalogue data used for the experiments described in this paper still used the SWD, but the results can be applied to catalogues using the GND.

  2. 2.

    Rules the for subject catalogue. See Scheven et al. (2012) for a complete reference.

  3. 3.

    Regensburg Union Classification System. See Lorenz (2008) for an introduction.

  4. 4.

    Library Union of South-West Germany. Catalogue: http://swb.bsz-bw.de/.

  5. 5.

    Hessian Library Information System. Catalogue: http://www.portal.hebis.de/.

  6. 6.

    University Library Center of North-Rhine Westphalia. Catalogue: http://okeanos-www.hbz-nrw.de/F/.

  7. 7.

    Bavarian Library Union. Catalogue: http://www.gateway-bayern.de/.

  8. 8.

    Common Library Network. Catalogue: http://gso.gbv.de/.

  9. 9.

    Reprository: http://culturegraph.sf.net.

References

  • Dewey, M. (2005). Dewey-Dezimalklassifikation und Register, Mitchell J. S. (Ed.). Munich: Saur.

    Google Scholar 

  • Dickey, T. J. (2008). FRBRization of a library catalog: better collocation of records, leading to enhanced search, retrieval, and display. Information Technology and Libraries, 27(1), 23–32.

    Google Scholar 

  • Eckert, K. (2010). Linked open projects: Nachnutzung von Projektergebnissen als Linked Data. In: M. Ockenfeld et al. (Eds.), Semantic Web & Linked Data: Elemente zukünftiger Informationsstrukturen; 1. DGI-Konferenz, proceedings (pp. 231–236). Frankfurt: DGI.

    Google Scholar 

  • Hickey, T. B., O’Neill, E. T., & Toves, J. (2002). Experiments with the IFLA functional requirements for bibliographic records (FRBR). D-Lib Magazine, 8(9).

    Google Scholar 

  • Lorenz, B. (2008). Handbuch zur Regensburger Verbundklassifikation: Materialien zur Einführung (2nd edn). Wiesbaden: Harrassowitz.

    Google Scholar 

  • Lux, C. (2003) The German library system: structure and new developments. IFLA Journal, 29(2), 113–128.

    Article  Google Scholar 

  • Pfeffer, M. (2010). Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen. In: U. Hohoff et al. (Eds.), 97. Deutscher Bibliothekartag in Mannheim 2008 - Wissen bewegen. Bibliotheken in der Informationsgesellschaft, proceedings (pp. 245–254). Frankfurt: Klostermann.

    Google Scholar 

  • Scheven, E., Kunz, M., & Bellgardt, S. (Eds.) (2012). Regeln für den Schlagwortkatalog: RSWK. Frankfurt: Dt. Nationalbibliothek.

    Google Scholar 

  • Sitas, A., & Kapidakis, S. (2008). Duplicate detection algorithms of bibliographic descriptions. Library Hi Tech, 26(2), 287–301.

    Article  Google Scholar 

  • Taniguchi, S. (2009). Automatic identification of “Works” toward construction of FRBRized OPACs: An experiment on JAPAN/MARC bibliographic records. Library and Information Science, 61, 119–151.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Magnus Pfeffer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Pfeffer, M. (2014). Using Clustering Across Union Catalogues to Enrich Entries with Indexing Information. In: Spiliopoulou, M., Schmidt-Thieme, L., Janning, R. (eds) Data Analysis, Machine Learning and Knowledge Discovery. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-01595-8_47

Download citation

Publish with us

Policies and ethics