More Web Proxy on the site http://driver.im/

Article

A Convolutional Recurrent Neural Network for the Handwritten Text Recognition of Historical Greek Manuscripts

Authors:

L. Tsochatzidis,

X. Karagiannis,

I. PratikakisAuthors Info & Claims

Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10-15, 2021, Proceedings, Part VII

Pages 249 - 262

https://doi.org/10.1007/978-3-030-68787-8_18

Published: 10 January 2021 Publication History

Abstract

In this paper, a Convolutional Recurrent Neural Network architecture for offline handwriting recognition is proposed. Specifically, a Convolutional Neural Network is used as an encoder for the input which is a textline image, while a Bidirectional Long Short-Term Memory (BLSTM) network followed by a fully connected neural network acts as the decoder for the prediction of a sequence of characters. This work was motivated by the need to transcribe historical Greek manuscripts that entail several challenges which have been extensively analysed. The proposed architecture has been tested for standard datasets, namely the IAM and RIMES, as well as for a newly created dataset, namely EPARCHOS, which contains historical Greek manuscripts and has been made publicly available for research purposes. Our experimental work relies upon a detailed ablation study which shows that the proposed architecture outperforms state-of-the-art approaches.

References

[1]

Chen, Z., Wu, Y., Yin, F., Liu, C.: Simultaneous script identification and handwriting recognition via multi-task learning of recurrent neural networks. In: 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017, Kyoto, Japan, 9–15 November 2017, pp. 525–530. IEEE (2017)

[2]

Dutta, K., Krishnan, P., Mathew, M., Jawahar, C.V.: Improving CNN-RNN hybrid networks for handwriting recognition. In: 16th International Conference on Frontiers in Handwriting Recognition, ICFHR 2018, Niagara Falls, NY, USA, 5–8 August 2018, pp. 80–85. IEEE Computer Society (2018)

[3]

Graves, A., Fernández, S., Gomez, F.J., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Cohen, W.W., Moore, A.W. (eds.) Machine Learning, Proceedings of the Twenty-Third International Conference (ICML 2006), Pittsburgh, Pennsylvania, USA, 25–29 June 2006. ACM International Conference Proceeding Series, vol. 148, pp. 369–376. ACM (2006)

[4]

Grosicki, E., Carré, M., Brodin, J., Geoffrois, E.: Results of the RIMES evaluation campaign for handwritten mail processing. In: 10th International Conference on Document Analysis and Recognition, ICDAR 2009, Barcelona, Spain, 26–29 July 2009, pp. 941–945. IEEE Computer Society (2009)

[5]

Ingle, R.R., Fujii, Y., Deselaers, T., Baccash, J., Popat, A.C.: A scalable handwritten text recognition system. In: 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia, 20–25 September 2019, pp. 17–24. IEEE (2019)

[6]

Krishnan, P., Dutta, K., Jawahar, C.V.: Word spotting and recognition using deep embedding. In: 13th IAPR International Workshop on Document Analysis Systems, DAS 2018, Vienna, Austria, 24–27 April 2018, pp. 1–6. IEEE Computer Society (2018)

[7]

Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30, p. 3 (2013)

[8]

Marti U and Bunke H The iam-database: an english sentence database for offline handwriting recognition IJDAR 2002 5 1 39-46

[9]

Papazoglou, A., Pratikakis, I., Markou, K., Tsochatzidis, L.: Eparchos - historical Greek handwritten document dataset (version 1.0) [data set] (2020).

[10]

Pham, V., Bluche, T., Kermorvant, C., Louradour, J.: Dropout improves recurrent neural networks for handwriting recognition. In: 14th International Conference on Frontiers in Handwriting Recognition, ICFHR 2014, Crete, Greece, 1–4 September 2014, pp. 285–290. IEEE Computer Society (2014)

[11]

Puigcerver, J.: Are multidimensional recurrent layers really necessary for handwritten text recognition? In: 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017, Kyoto, Japan, 9–15 November 2017, pp. 67–72. IEEE (2017)

[12]

Puigcerver, J.: PyLaia Toolkit (2017). https://github.com/jpuigcerver/PyLaia. Accessed 7 Apr 2020

[13]

Ruder, S.: An overview of gradient descent optimization algorithms. CoRR abs/1609.04747 (2016)

[14]

Sainath, T.N., Vinyals, O., Senior, A.W., Sak, H.: Convolutional, long short-term memory, fully connected deep neural networks. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, 19–24 April 2015. pp. 4580–4584. IEEE (2015)

[15]

Shi B, Bai X, and Yao C An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition IEEE Trans. Pattern Anal. Mach. Intell. 2017 39 11 2298-2304

[16]

Voigtlaender, P., Doetsch, P., Ney, H.: Handwriting recognition with large multidimensional long short-term memory recurrent neural networks. In: 15th International Conference on Frontiers in Handwriting Recognition, ICFHR 2016, Shenzhen, China, 23–26 October 2016, pp. 228–233. IEEE Computer Society (2016)

[17]

Yu Y, Si X, Hu C, and Zhang J A review of recurrent neural networks: LSTM cells and network architectures Neural Comput. 2019 31 7 1235-1270

Cited By

Retsinas GNikolaidou KSfikas G(2024)Enhancing CRNN HTR Architectures with Transformer BlocksDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70546-5_25(425-440)Online publication date: 30-Aug-2024
https://dl.acm.org/doi/10.1007/978-3-031-70546-5_25
Sfikas GRetsinas GDimitrakopoulos PGatos BNikou C(2023)Shared-Operation Hypercomplex Networks for Handwritten Text RecognitionDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41685-9_13(200-216)Online publication date: 21-Aug-2023
https://dl.acm.org/doi/10.1007/978-3-031-41685-9_13
Mohammed HMalik JAl-Maadeed SKiranyaz S(2022)2D Self-organized ONN model for Handwritten Text RecognitionApplied Soft Computing10.1016/j.asoc.2022.109311127:COnline publication date: 1-Sep-2022
https://dl.acm.org/doi/10.1016/j.asoc.2022.109311
Show More Cited By

Recommendations

Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network
Abstract
Digitizing ancient manuscripts and making them accessible to a broader audience is a crucial step in unlocking the wealth of information they hold. However, automatic recognition of handwritten text and the extraction of relevant information such ...
Low resolution Arabic recognition with multidimensional recurrent neural networks
MOCR '13: Proceedings of the 4th International Workshop on Multilingual OCR

OCR of multi-font Arabic text is difficult due to large variations in character shapes from one font to another. It becomes even more challenging if the text is rendered at very low resolution. This paper describes a multi-font, low resolution, and open ...
Offline handwritten word recognition in Hindi
DAR '12: Proceeding of the workshop on Document Analysis and Recognition

This paper discusses the Hindi offline handwritten word recognizer (HWR) that we are developing. For the purpose of training and testing the offline HWR, we have created a Hindi handwritten word and character database from 100 writers. In our HWR we use ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10-15, 2021, Proceedings, Part VII

Jan 2021

695 pages

ISBN:978-3-030-68786-1

DOI:10.1007/978-3-030-68787-8

Editors:
Alberto Del Bimbo
Dipartimento di Ingegneria dell’Informazione, University of Firenze, Firenze, Italy
,
Rita Cucchiara
Dipartimento di Ingegneria “Enzo Ferrari”, Università di Modena e Reggio Emilia, Modena, Italy
,
Stan Sclaroff
Department of Computer Science, Boston University, Boston, MA, USA
,
Giovanni Maria Farinella
Dipartimento di Matematica e Informatica, University of Catania, Catania, Italy
,
Tao Mei
Cloud & AI, JD.COM, Beijing, China
,
Marco Bertini
Dipartimento di Ingegneria dell’Informazione, University of Firenze, Firenze, Italy
,
Hugo Jair Escalante
Computational Sciences Department, National Institute of Astrophysics, Optics and Electronics (INAOE), Tonantzintla, Puebla, Mexico
,
Roberto Vezzani
Dipartimento di Ingegneria “Enzo Ferrari”, Università di Modena e Reggio Emilia, Modena, Italy

© Springer Nature Switzerland AG 2021.

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 10 January 2021

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 31 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Retsinas GNikolaidou KSfikas G(2024)Enhancing CRNN HTR Architectures with Transformer BlocksDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70546-5_25(425-440)Online publication date: 30-Aug-2024
https://dl.acm.org/doi/10.1007/978-3-031-70546-5_25
Sfikas GRetsinas GDimitrakopoulos PGatos BNikou C(2023)Shared-Operation Hypercomplex Networks for Handwritten Text RecognitionDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41685-9_13(200-216)Online publication date: 21-Aug-2023
https://dl.acm.org/doi/10.1007/978-3-031-41685-9_13
Mohammed HMalik JAl-Maadeed SKiranyaz S(2022)2D Self-organized ONN model for Handwritten Text RecognitionApplied Soft Computing10.1016/j.asoc.2022.109311127:COnline publication date: 1-Sep-2022
https://dl.acm.org/doi/10.1016/j.asoc.2022.109311
Retsinas GSfikas GGatos BNikou C(2022)Best Practices for a Handwritten Text Recognition SystemDocument Analysis Systems10.1007/978-3-031-06555-2_17(247-259)Online publication date: 22-May-2022
https://dl.acm.org/doi/10.1007/978-3-031-06555-2_17
Sudarsan DSankar D(undefined)An Ensemble Neural Network Model For Malayalam Character Recognition From Palm Leaf ManuscriptsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/3686311
https://dl.acm.org/doi/10.1145/3686311

View Options

View options

Figures

Tables

Media

View Table of Conten