[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1007/978-3-030-68787-8_18guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A Convolutional Recurrent Neural Network for the Handwritten Text Recognition of Historical Greek Manuscripts

Published: 10 January 2021 Publication History

Abstract

In this paper, a Convolutional Recurrent Neural Network architecture for offline handwriting recognition is proposed. Specifically, a Convolutional Neural Network is used as an encoder for the input which is a textline image, while a Bidirectional Long Short-Term Memory (BLSTM) network followed by a fully connected neural network acts as the decoder for the prediction of a sequence of characters. This work was motivated by the need to transcribe historical Greek manuscripts that entail several challenges which have been extensively analysed. The proposed architecture has been tested for standard datasets, namely the IAM and RIMES, as well as for a newly created dataset, namely EPARCHOS, which contains historical Greek manuscripts and has been made publicly available for research purposes. Our experimental work relies upon a detailed ablation study which shows that the proposed architecture outperforms state-of-the-art approaches.

References

[1]
Chen, Z., Wu, Y., Yin, F., Liu, C.: Simultaneous script identification and handwriting recognition via multi-task learning of recurrent neural networks. In: 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017, Kyoto, Japan, 9–15 November 2017, pp. 525–530. IEEE (2017)
[2]
Dutta, K., Krishnan, P., Mathew, M., Jawahar, C.V.: Improving CNN-RNN hybrid networks for handwriting recognition. In: 16th International Conference on Frontiers in Handwriting Recognition, ICFHR 2018, Niagara Falls, NY, USA, 5–8 August 2018, pp. 80–85. IEEE Computer Society (2018)
[3]
Graves, A., Fernández, S., Gomez, F.J., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Cohen, W.W., Moore, A.W. (eds.) Machine Learning, Proceedings of the Twenty-Third International Conference (ICML 2006), Pittsburgh, Pennsylvania, USA, 25–29 June 2006. ACM International Conference Proceeding Series, vol. 148, pp. 369–376. ACM (2006)
[4]
Grosicki, E., Carré, M., Brodin, J., Geoffrois, E.: Results of the RIMES evaluation campaign for handwritten mail processing. In: 10th International Conference on Document Analysis and Recognition, ICDAR 2009, Barcelona, Spain, 26–29 July 2009, pp. 941–945. IEEE Computer Society (2009)
[5]
Ingle, R.R., Fujii, Y., Deselaers, T., Baccash, J., Popat, A.C.: A scalable handwritten text recognition system. In: 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia, 20–25 September 2019, pp. 17–24. IEEE (2019)
[6]
Krishnan, P., Dutta, K., Jawahar, C.V.: Word spotting and recognition using deep embedding. In: 13th IAPR International Workshop on Document Analysis Systems, DAS 2018, Vienna, Austria, 24–27 April 2018, pp. 1–6. IEEE Computer Society (2018)
[7]
Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30, p. 3 (2013)
[8]
Marti U and Bunke H The iam-database: an english sentence database for offline handwriting recognition IJDAR 2002 5 1 39-46
[9]
Papazoglou, A., Pratikakis, I., Markou, K., Tsochatzidis, L.: Eparchos - historical Greek handwritten document dataset (version 1.0) [data set] (2020).
[10]
Pham, V., Bluche, T., Kermorvant, C., Louradour, J.: Dropout improves recurrent neural networks for handwriting recognition. In: 14th International Conference on Frontiers in Handwriting Recognition, ICFHR 2014, Crete, Greece, 1–4 September 2014, pp. 285–290. IEEE Computer Society (2014)
[11]
Puigcerver, J.: Are multidimensional recurrent layers really necessary for handwritten text recognition? In: 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017, Kyoto, Japan, 9–15 November 2017, pp. 67–72. IEEE (2017)
[12]
Puigcerver, J.: PyLaia Toolkit (2017). https://github.com/jpuigcerver/PyLaia. Accessed 7 Apr 2020
[13]
Ruder, S.: An overview of gradient descent optimization algorithms. CoRR abs/1609.04747 (2016)
[14]
Sainath, T.N., Vinyals, O., Senior, A.W., Sak, H.: Convolutional, long short-term memory, fully connected deep neural networks. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, 19–24 April 2015. pp. 4580–4584. IEEE (2015)
[15]
Shi B, Bai X, and Yao C An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition IEEE Trans. Pattern Anal. Mach. Intell. 2017 39 11 2298-2304
[16]
Voigtlaender, P., Doetsch, P., Ney, H.: Handwriting recognition with large multidimensional long short-term memory recurrent neural networks. In: 15th International Conference on Frontiers in Handwriting Recognition, ICFHR 2016, Shenzhen, China, 23–26 October 2016, pp. 228–233. IEEE Computer Society (2016)
[17]
Yu Y, Si X, Hu C, and Zhang J A review of recurrent neural networks: LSTM cells and network architectures Neural Comput. 2019 31 7 1235-1270

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10-15, 2021, Proceedings, Part VII
Jan 2021
695 pages
ISBN:978-3-030-68786-1
DOI:10.1007/978-3-030-68787-8
  • Editors:
  • Alberto Del Bimbo,
  • Rita Cucchiara,
  • Stan Sclaroff,
  • Giovanni Maria Farinella,
  • Tao Mei,
  • Marco Bertini,
  • Hugo Jair Escalante,
  • Roberto Vezzani

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 10 January 2021

Author Tags

  1. Offline handwriting recognition
  2. Recurrent Neural Networks
  3. Historical documents

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Enhancing CRNN HTR Architectures with Transformer BlocksDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70546-5_25(425-440)Online publication date: 30-Aug-2024
  • (2023)Shared-Operation Hypercomplex Networks for Handwritten Text RecognitionDocument Analysis and Recognition - ICDAR 202310.1007/978-3-031-41685-9_13(200-216)Online publication date: 21-Aug-2023
  • (2022)2D Self-organized ONN model for Handwritten Text RecognitionApplied Soft Computing10.1016/j.asoc.2022.109311127:COnline publication date: 1-Sep-2022
  • (2022)Best Practices for a Handwritten Text Recognition SystemDocument Analysis Systems10.1007/978-3-031-06555-2_17(247-259)Online publication date: 22-May-2022
  • (undefined)An Ensemble Neural Network Model For Malayalam Character Recognition From Palm Leaf ManuscriptsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/3686311

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media