Abstract
In-air handwriting is a rapidly emerging human–machine interactive paradigm that helps users to write and communicate naturally and intuitively in free space. In this paper, we develop a hybrid one-dimensional convolutional recurrent attention framework model for in-air handwritten Assamese word recognition (IAHAWR) which associates an encoder and a decoder framework for efficiently recognizing air-written words. The encoder is an assimilation of 1D convolutional neural network and bidirectional gated recurrent unit neural network for input trajectory feature sequence learning, while the decoder is an attention-based gated recurrent unit for predicting the target words. In contrast to conventional pen-based handwriting, in-air handwriting is intricate in the sense that the handwriting is finished in a single continuous stroke giving rise to many irrelevant motions called ligatures in between adjacent character strokes. So, we have imbibed a salient stroke extraction and a critical point detection scheme into the proposed system, which helps in removal of insignificant ligatures thus enhancing the recognition performance. Further, air-writing trajectories contain intermittent jitters and suffer wide variations in writing patterns due to unrestricted writing in free space. So, we incorporate a multistage word normalization methodology which generalizes the air-written patterns and aids in efficient recognition. We have assessed the performance of our proposed system on an air-written Assamese word dataset as well as some air-written Latin words. Experimental evaluation connotes that our proposed IAHAWR system can effectively procure characteristic information from air-writing sequences and provides comparable recognition accuracy and computational performance with that of other state-of-the-art recognition frameworks.
Similar content being viewed by others
References
Gan, J., Wang, W.: In-air handwritten English word recognition using attention recurrent translator. Neural Comput. Appl. 31, 1–18 (2017)
Choudhury, A., Sarma, K.K.: A CNN-LSTM based ensemble framework for in-air handwritten Assamese character recognition. Multimedia Tools Appl. 80(28), 35649–35684 (2021)
Chen, M., AlRegib, G., Juang, B.H.: Air-writing recognition—part I: modeling and recognition of characters, words, and connecting motions. IEEE Trans. Hum. Mach. Syst. 46(3), 403–413 (2015)
Ren, H., Wang, W., Lu, K., Zhou, J., Yuan, Q.: An end-to-end recognizer for in-air handwritten Chinese characters based on a new recurrent neural networks. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), 10–14 July 2017, Hong Kong, China, pp. 841–846 (2017)
Kumar, P., Saini, R., Roy, P.P., Pal, U.: A lexicon-free approach for 3D handwriting recognition using classifier combination. Pattern Recognit. Lett. 103, 1–7 (2018)
Gan, J., Wang, W., Lu, K.: A unified CNN-RNN approach for in-air handwritten English word recognition. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), 23–27 July 2018, San Diego, CA, USA, pp. 1–6 (2018)
Amma, C., Georgi, M., Schultz, T.: Airwriting: hands-free mobile text input by spotting and continuous recognition of 3D-space handwriting with inertial sensors. In: Proceedings of IEEE 16th International Symposium on Wearable Computers, pp. 52–59 (2012)
Chen, M., AlRegib, G., Juang, B.H.: Air-writing recognition-part II: detection and recognition of writing activity in continuous stream of motion data. IEEE Trans. Hum. Mach. Syst. 46(3), 436–444 (2016)
Kumar, P., Saini, R., Roy, P.P., Dogra, D.P.: 3D text segmentation and recognition using leap motion. Multimedia Tools Appl. 76(15), 16491–16510 (2017)
Vikram, S., Li, L., Russell, S.: Handwriting and gestures in the air, recognizing on the fly. In: Proceedings of the CHI, vol. 13, pp. 1179–1184 (2013)
Chiang, C.C., Wang, R.H., Chen, B.R.: Recognizing arbitrarily connected and superimposed handwritten numerals in intangible writing interfaces. Pattern Recognit. 61, 15–28 (2017)
Graves, A., Liwicki, M., Fernández, S., Bertolami, R., Bunke, H., Schmidhuber, J.: A novel connectionist system for unconstrained handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(5), 855–868 (2008)
Choudhury, A., Sarma, K.K.: A vision-based framework for spotting and segmentation of gesture-based Assamese characters written in the air. J. Inf. Technol. Res. JITR 14(1), 70–91 (2021)
Luong, M. T., Pham, H., Manning, C. D.: Effective approaches to attention-based neural machine translation. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, pp. 1412–1421 (2015)
Yang, C., Ku, B., Han, D.K., Ko, H.: Alpha-numeric hand gesture recognition based on fusion of spatial feature modelling and temporal feature modelling. Electron. Lett. 52(20), 1679–1681 (2016)
Roy, P., Ghosh, S., Pal, U.: A CNN based framework for unistroke numeral recognition in air-writing. In: Proceedings of 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), 05–08 August 2018, Niagara Falls, NY, USA, pp. 404–409 (2018)
Rahman, A., Roy, P., Pal, U.: Continuous motion numeral recognition using RNN architecture in air-writing environment. In: Proceedings of Asian Conference on Pattern Recognition, pp. 76–90. Springer (2019)
Gan, J., Wang, W., Lu, K.: A new perspective: recognizing online handwritten Chinese characters via 1-dimensional CNN. Inf. Sci. 478, 375–390 (2019)
Agarwal, C., Dogra, D. P., Saini, R., Roy, P. P.: Segmentation and recognition of text written in 3d using leap motion interface. In: Proceedings of IEEE IAPR Asian Conference on Pattern Recognition (ACPR), 03–06 November 2015, Kuala Lumpur, Malaysia, pp. 539–543 (2015)
Wang, Z.R., Du, J., Wang, W.C., Zhai, J.F., Hu, J.S.: A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition. Int. J. Doc. Anal. Recognit. IJDAR 21(4), 241–251 (2018)
Wang, Z.R., Du, J., Wang, J.M.: Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition. Pattern Recognit. 100, 107102 (2020)
Xie, Z., Sun, Z., Jin, L., Feng, Z., Zhang, S.: Fully convolutional recurrent network for handwritten Chinese text recognition. In: Proceedings of IEEE 23rd International Conference on Pattern Recognition (ICPR), pp. 4011–4016 (2016)
Ahmed, S.B., Naz, S., Razzak, M.I., Rashid, S.F., Afzal, M.Z., Breuel, T.M.: Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput. Appl. 27(3), 603–613 (2016)
Kumar, P., Saini, R., Roy, P.P., Dogra, D.P.: Study of text segmentation and recognition using leap motion sensor. IEEE Sens. J. 17(5), 1293–1301 (2017)
Phung, S.L., Bouzerdoum, A., Chai, D.: Skin segmentation using color pixel classification: analysis and comparison. IEEE Trans. Pattern Anal. Mach. Intell. 27(1), 148–154 (2005)
Choudhury, A., Talukdar, A.K., Sarma, K.K., Bhuyan, M.K.: An adaptive thresholding-based movement epenthesis detection technique using hybrid feature set for continuous fingerspelling recognition. SN Comput. Sci. 2(2), 1–21 (2021)
Wilson, J.N., Ritter, G.X.: Handbook of Computer Vision Algorithms in Image Algebra. CRC Press, Boca Raton (2000)
Gose, E., Johnsonbaugh, R., Jost, S.: Pattern Recognition and Image Analysis. Prentice-Hall, Englewood Cliffs (1996)
Petrick, N., Chan, H.P., Sahiner, B., Helvie, M.A.: Combined adaptive enhancement and region-growing segmentation of breast masses on digitized mammograms. Med. Phys. 26(8), 1642–1654 (1999)
Gallagher, N.B.: Savitzky–Golay Smoothing and Differentiation Filter. https://eigenvector.com/wp-content/uploads/2020/01/SavitzkyGolay.pdf
Zhang, X.Y., Yin, F., Zhang, Y.M., Liu, C.L., Bengio, Y.: Drawing and recognizing Chinese characters with recurrent neural network. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 849–862 (2017)
Liu, Y.K., Žalik, B., Wang, P.J., Podgorelec, D.: Directional difference chain codes with quasi-lossless compression and run-length encoding. Signal Process. Image Commun. 27(9), 973–984 (2012)
Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Baltimore, Maryland, pp. 655–665. Association for Computational Linguistics (2014)
Chollet, F.: Deep Learning with Python. Simon and Schuster, New York (2021)
Chung, J., Gulcehre, C., Cho, K., Bengio, Y.: Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)
Acknowledgements
The authors would like to thank the Ministry of Electronics and Information Technology (MeitY), Government of India for providing the Visvesvaraya PhD Fellowship Scheme for conducting the research. The authors also wish to thank all the editors and anonymous reviewers for their constructive advice.
Author information
Authors and Affiliations
Contributions
AC formulated the methodology and algorithms incorporated in the work, designed and implemented the proposed framework, performed experimental evaluations and wrote the manuscript. KKS defined and supervised the research, assessed the framework’s functionality and provided suggestions for revision of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Choudhury, A., Sarma, K.K. Trajectory-based recognition of in-air handwritten Assamese words using a hybrid classifier network. IJDAR 26, 375–400 (2023). https://doi.org/10.1007/s10032-022-00426-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-022-00426-3