Abstract
The development of an online version of the Trinity College Dublin Printed Catalogue, which list books from the 14th C to 1872, is described. The principal benefit of the system is the ability to search on words and word stems in the title field. As the entries are in at least fourteen languages the language of each Roman script entry was determined, with a success rate of over 90%. The image of the entry from the catalogue is displayed. This hides the OCR errors.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Catalogus Librorum Impressorum qui in Bibliotheca Collegii Sacrosanctae et Individuae Trinitatis, Reginae Elizabethae, juxta Dublin. 9 vols (1864-1886)
Clarke, R.M.: OCROC Optical Character Recognition Output Corrector. Final year project Trinity College Dublin (May 1993)
Culligan, B.T.: Design of an On-Line Database Query System for the 1872 Printed Catalogue. Final year project Trinity College Dublin (May 1993)
Anderson, G.: Computerising a Library Catalogue using Optical Character Recognition. M.Sc. thesis, University of Dublin (1992)
Clarke, R.M.: User-Oriented Access to a Multilingual Database. M.Sc. thesis, University of Dublin (1995)
Kinane, V., Walsh, A. (eds.): Essays on the History of the Trinity College Library Dublin. Four Courts Press, Dublin (2000)
Bandinel, B.: Catalogus Librorum Impressorum in Bibliotheca Bodleiana, Oxford (1843)
Emmer, M.B., Quillen, E.K., Dewar, R.B.K.: MACRO SPITBOL The High-Performance SNOBOL Language. Catspaw Inc. (1991)
Zipf, G.K.: HumanBehaviour and the Principle of Least Effort. Addison-Wesley, Reading (1949)
Nic Gerailt, D., Byrne, J.G.: Error Detection in Several Languages for an OCR-Generated Multilingual Database. In: Proc. Third International Workshop on Applications of Natural Language to Information Systems, Simon Fraser University, Canada, June 26-27 (1997)
Smith, F.J., Devine, K.: BIRD, QUILL and MicroBIRD - A successful family of text retrieval systems. Literary and Linguistic Computing 4(2), 115–120 (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Byrne, J.G. (2004). The Trinity College Dublin 1872 Online Catalogue. In: Marinai, S., Dengel, A.R. (eds) Document Analysis Systems VI. DAS 2004. Lecture Notes in Computer Science, vol 3163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28640-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-28640-0_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23060-1
Online ISBN: 978-3-540-28640-0
eBook Packages: Springer Book Archive