Abstract
The federal system in Germany has created a segmented library landscape. Instead of a central entity responsible for cataloguing and indexing, regional library unions share the workload cooperatively among their members. One result of this approach is limited sharing of cataloguing and indexing information across union catalogues as well as heterogeneous indexing of items with almost equivalent content: different editions of the same work. In this paper, a method for clustering entries in library catalogues is proposed that can be used to reduce this heterogeneity as well as share indexing information across catalogue boundaries. In two experiments, the method is applied to several union catalogues and the results show that a surprisingly large number of previously not indexed entries can be enriched with indexing information. The quality of the indexing has been positively evaluated by human professionals and the results have already been imported into the production catalogues of two library unions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Subject Headings Authority File. In a recent development, the SWD is combined with authority files for persons and corporate bodies to form the Gemeinsame Normdatei (GND) (engl.: Universal Authority File). This file will be suitable to be used with the proposed RDA cataloguing rules. The catalogue data used for the experiments described in this paper still used the SWD, but the results can be applied to catalogues using the GND.
- 2.
Rules the for subject catalogue. See Scheven et al. (2012) for a complete reference.
- 3.
Regensburg Union Classification System. See Lorenz (2008) for an introduction.
- 4.
Library Union of South-West Germany. Catalogue: http://swb.bsz-bw.de/.
- 5.
Hessian Library Information System. Catalogue: http://www.portal.hebis.de/.
- 6.
University Library Center of North-Rhine Westphalia. Catalogue: http://okeanos-www.hbz-nrw.de/F/.
- 7.
Bavarian Library Union. Catalogue: http://www.gateway-bayern.de/.
- 8.
Common Library Network. Catalogue: http://gso.gbv.de/.
- 9.
Reprository: http://culturegraph.sf.net.
References
Dewey, M. (2005). Dewey-Dezimalklassifikation und Register, Mitchell J. S. (Ed.). Munich: Saur.
Dickey, T. J. (2008). FRBRization of a library catalog: better collocation of records, leading to enhanced search, retrieval, and display. Information Technology and Libraries, 27(1), 23–32.
Eckert, K. (2010). Linked open projects: Nachnutzung von Projektergebnissen als Linked Data. In: M. Ockenfeld et al. (Eds.), Semantic Web & Linked Data: Elemente zukünftiger Informationsstrukturen; 1. DGI-Konferenz, proceedings (pp. 231–236). Frankfurt: DGI.
Hickey, T. B., O’Neill, E. T., & Toves, J. (2002). Experiments with the IFLA functional requirements for bibliographic records (FRBR). D-Lib Magazine, 8(9).
Lorenz, B. (2008). Handbuch zur Regensburger Verbundklassifikation: Materialien zur Einführung (2nd edn). Wiesbaden: Harrassowitz.
Lux, C. (2003) The German library system: structure and new developments. IFLA Journal, 29(2), 113–128.
Pfeffer, M. (2010). Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen. In: U. Hohoff et al. (Eds.), 97. Deutscher Bibliothekartag in Mannheim 2008 - Wissen bewegen. Bibliotheken in der Informationsgesellschaft, proceedings (pp. 245–254). Frankfurt: Klostermann.
Scheven, E., Kunz, M., & Bellgardt, S. (Eds.) (2012). Regeln für den Schlagwortkatalog: RSWK. Frankfurt: Dt. Nationalbibliothek.
Sitas, A., & Kapidakis, S. (2008). Duplicate detection algorithms of bibliographic descriptions. Library Hi Tech, 26(2), 287–301.
Taniguchi, S. (2009). Automatic identification of “Works” toward construction of FRBRized OPACs: An experiment on JAPAN/MARC bibliographic records. Library and Information Science, 61, 119–151.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Pfeffer, M. (2014). Using Clustering Across Union Catalogues to Enrich Entries with Indexing Information. In: Spiliopoulou, M., Schmidt-Thieme, L., Janning, R. (eds) Data Analysis, Machine Learning and Knowledge Discovery. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-01595-8_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-01595-8_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01594-1
Online ISBN: 978-3-319-01595-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)