[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Development of a Benchmark Odia Handwritten Character Database for an Efficient Offline Handwritten Character Recognition with a Chronological Survey

Published: 17 June 2023 Publication History

Abstract

A good benchmark dataset is a primary requirement in the offline handwritten character recognition (HCR) process. Only three handwritten numerals and alphabet datasets from Odia are publicly accessible for study, although many writers have used several datasets in their experiments. In this article, two tasks are done to address this issue. Those are the following: First, an extensive survey focused on various datasets is provided with the methodologies used in chronological order. The second factor is a solution to the lack of publicly available handwritten characters and numeral datasets. A new dataset of handwritten Odia characters with numerals has been developed. Anyone can access this dataset by sending an email to the authors of the article. This dataset was created with the help of 150 volunteers of various age groups, races, and qualifications. Some homogeneous experiments are conducted using deep learning models to evaluate the consistency of the dataset. One heterogeneous trial has also been performed to estimate the complexities of the characters present in the dataset by comparing them with the existing benchmark datasets.

References

[1]
Shailesh Acharya, Ashok Kumar Pant, and Prashnna Kumar Gyawali. 2015. Deep learning based large scale handwritten Devanagari character recognition. In 9th International Conference on Software, Knowledge, Information Management and Applications (SKIMA). IEEE, 1–6.
[2]
Ujjwal Bhattacharya and B. B. Chaudhuri. 2005. Databases for research on recognition of handwritten characters of Indian scripts. In 8th International Conference on Document Analysis and Recognition (ICDAR’05). IEEE, 789–793.
[3]
Ujjwal Bhattacharya, Malayappan Shridhar, Swapan K. Parui, P. K. Sen, and B. B. Chaudhuri. 2012. Offline recognition of handwritten Bangla characters: An efficient two-stage approach. Pattern Anal. Applic. 15, 4 (2012), 445–458.
[4]
Tapan K. Bhowmik, Swapan K. Parui, Ujjwal Bhattacharya, and Bikash Shaw. 2006. An HMM based recognition scheme for handwritten Oriya numerals. In 9th International Conference on Information Technology (ICIT’06). IEEE, 105–110.
[5]
Zhong Cao, Jiang Lu, Sen Cui, and Changshui Zhang. 2020. Zero-shot handwritten Chinese character recognition with hierarchical decomposition embedding. Pattern Recog. 107 (2020), 107488.
[6]
Sukalpa Chanda, Katrin Franke, and Umapada Pal. 2012. Text independent writer identification for Oriya script. In 10th IAPR International Workshop on Document Analysis Systems. IEEE, 369–373.
[7]
Dibyasundar Das, Ratnakar Dash, and Banshidhar Majhi. 2018. Optimization based feature generation for handwritten Odia-numeral recognition. In 14th International Conference on Information Processing (ICINPRO). IEEE, 1–5.
[8]
Dibyasundar Das, Deepak Ranjan Nayak, Ratnakar Dash, and Banshidahar Majhi. 2019. An empirical evaluation of extreme learning machine: Application to handwritten character recognition. Multim. Tools Applic. 78, 14 (2019), 19495–19523.
[9]
Dibyasundar Das, Deepak Ranjan Nayak, Ratnakar Dash, and Banshidhar Majhi. 2020. MJCN: Multi-objective Jaya convolutional network for handwritten optical character recognition. Multim. Tools Applic. 79, 43 (2020), 33023–33042.
[10]
Bhabani Dash, Shibashis Pradhan, and Debaraj Rana. 2016. Odia offline character recognition using DWT features. IOSR J. Electron. Commun. Eng. (2016), 31–37.
[11]
Kalyan S. Dash, N. B. Puhan, and Ganapati Panda. 2014. A hybrid feature and discriminant classifier for high accuracy handwritten Odia numeral recognition. In IEEE Region 10 Symposium. IEEE, 531–535.
[12]
Kalyan Sourav Dash, Niladri B. Puhan, and Ganapati Panda. 2015. Handwritten numeral recognition using non-redundant Stockwell transform and bio-inspired optimal zoning. IET Image Process. 9, 10 (2015), 874–882.
[13]
Kalyan S. Dash, Niladri B. Puhan, and Ganapati Panda. 2015. On extraction of features for handwritten Odia numeral recognition in transformed domain. In 8th International Conference on Advances in Pattern Recognition (ICAPR). IEEE, 1–6.
[14]
Kalyan S. Dash, Niladri B. Puhan, and Ganapati Panda. 2017. Odia character recognition: A directional review. Artif. Intell. Rev. 48, 4 (2017), 473–497.
[15]
Kalyan S. Dash, Niladri B. Puhan, and Ganapati Panda. 2020. Sparse concept coded tetrolet transform for unconstrained Odia character recognition. arXiv preprint arXiv:2004.01551.
[16]
Teófilo Emídio De Campos, Bodla Rakesh Babu, Manik Varma et al. 2009. Character recognition in natural images. VISAPP 7, 2 (2009).
[17]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 248–255.
[18]
Raghunath Dey and Rakesh Chandra Balabantaray. 2019. A novel sliding window approach for offline handwritten character recognition. In International Conference on Information Technology (ICIT). IEEE, 178–183.
[19]
Raghunath Dey, Rakesh Chandra Balabantaray, and Sanghamitra Mohanty. 2021. Sliding window based off-line handwritten text recognition using edit distance. Multim. Tools Applic. 81, 16 (2021), 1–28.
[20]
Raghunath Dey, Rakesh Chandra Balabantaray, and Jayashree Piri. 2020. A robust handwritten digit recognition system based on sliding window with edit distance. In IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT). IEEE, 1–6.
[21]
Raghunath Dey, Rakesh Chandra Balabantaray, Jayashree Piri, and Debabrata Singh. 2021. Offline natural scene character recognition using VGG16 neural networks. In 3rd International Conference on Inventive Research in Computing Applications (ICIRCA). IEEE, 946–951.
[22]
Riya Guha, Nibaran Das, Mahantapas Kundu, Mita Nasipuri, and K. C. Santosh. 2020. DevNet: An efficient CNN architecture for handwritten Devanagari character recognition. Int. J. Pattern Recog. Artif. Intell. 34, 12 (2020), 2052009.
[23]
Tarun Jindal and Ujjwal Bhattacharya. 2013. Recognition of offline handwritten numerals using an ensemble of MLPs combined by AdaBoost. In 4th International Workshop on Multilingual OCR. ACM, 1–5.
[24]
B. I. Justusson. 1981. Median filtering: Statistical properties. In Two-dimensional Digital Signal Processing II. Springer, 161–196.
[25]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2017. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 6 (2017), 84–90.
[26]
Zhiyuan Li, Qi Wu, Yi Xiao, Min Jin, and Huaxiang Lu. 2020. Deep matching network for handwritten Chinese character recognition. Pattern Recog. 107 (2020), 107471.
[27]
Manoj Kumar Mahto, Archana Kumari, and S. Panigrahi. 2011. Asystem for Oriya handwritten numeral recognization for Indian postal automation. Int. J. Appl. Sci. Technol. Res. Excell. 1, 1 (2011), 17–23.
[28]
Babita Majhi and Puspalata Pujari. 2018. On development and performance evaluation of novel Odia handwritten digit recognition methods. Arab. J. Sci. Eng. 43, 8 (2018), 3887–3901.
[29]
Sukadev Meher and Debasish Basa. 2011. An intelligent scanner with handwritten Odia character recognition capability. In 5th International Conference on Sensing Technology. IEEE, 53–59.
[30]
Tusar Kanti Mishra, Banshidhar Majhi, and Sandeep Panda. 2013. A comparative analysis of image transformations for handwritten Odia numeral recognition. In International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE, 790–793.
[31]
Tusar Kanti Mishra, Banshidhar Majhi, Pankaj K. Sa, and Sandeep Panda. 2014. Model based Odia numeral recognition using fuzzy aggregated features. Front. Comput. Sci. 8, 6 (2014), 916–922.
[32]
Ramesh Kumar Mohapatra, Banshidhar Majhi, and Sanjay Kumar Jena. 2015. Classification of handwritten Odia basic character using Stockwell transform. Int. J. Appl. Pattern Recog. 2, 3 (2015), 235–254.
[33]
Ramesh Kumar Mohapatra, Tusar Kanti Mishra, Sandeep Panda, and Banshidhar Majhi. 2015. OHCS: A database for handwritten atomic Odia Character Recognition. In 5th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG). IEEE, 1–4.
[34]
Mamata Nayak and Ajit Kumar Nayak. 2017. Odia character recognition using backpropagation network with binary features. Int. J. Computat. Vis. Robot. 7, 5 (2017), 588–604.
[35]
Kishorjit Nongmeikapam, Kanan Wahengbam, Oinam Nickson Meetei, and Themrichon Tuithung. 2019. Handwritten Manipuri Meetei-Mayek classification using convolutional neural network. ACM Trans. Asian Low-resour. Lang. Inf. Process. 18, 4 (2019), 1–23.
[36]
Nobuyuki Otsu. 1979. A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man, Cybern. 9, 1 (1979), 62–66.
[37]
Debananda Padhi and Debabrata Senapati. 2013. Zone centroid distance and standard deviation based feature matrix for Odia handwritten character recognition. In International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA). Springer, 649–658.
[38]
Umapada Pal, Ramachandran Jayadevan, and Nabin Sharma. 2012. Handwriting recognition in Indian regional scripts: A survey of offline techniques. ACM Trans. Asian Lang. Inf. Process. 11, 1 (2012), 1–35.
[39]
Umapada Pal, Nabin Sharma, and Fumitaka Kimura. 2007. Oriya off-line handwritten character recognition. In Advances in Pattern Recognition. World Scientific, 123–128.
[40]
Umapada Pal, Nabin Sharma, Tetsushi Wakabayashi, and Fumitaka Kimura. 2007. Handwritten numeral recognition of six popular Indian scripts. In 9th International Conference on Document Analysis and Recognition (ICDAR’07), Vol. 2. IEEE, 749–753.
[41]
Umapada Pal, Tetsushi Wakabayashi, and Fumitaka Kimura. 2007. A system for off-line Oriya handwritten character recognition using curvature feature. In 10th International Conference on Information Technology (ICIT’07). IEEE, 227–229.
[42]
Smruti Rekha Panda and Jogeswar Tripathy. 2015. Odia offline typewritten character recognition using template matching with unicode mapping. In International Symposium on Advanced Computing and Communication (ISACC). IEEE, 109–115.
[43]
Kaushik Roy, Tandra Pal, Umapada Pal, and Fumitaka Kimura. 2005. Oriya handwritten numeral recognition system. In 8th International Conference on Document Analysis and Recognition (ICDAR’05). IEEE, 770–774.
[44]
Indugu Rushiraj, Souvik Kundu, and Baidyanath Ray. 2016. Handwritten character recognition of Odia script. In International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES). IEEE, 764–767.
[45]
Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. 2018. MobileNetV2: Inverted residuals and linear bottlenecks. In IEEE Conference on Computer Vision and Pattern Recognition. 4510–4520.
[46]
Pradeepta K. Sarangi and Parvez Ahmed. 2013. Recognition of handwritten Odia numerals using artificial intelligence techniques. Int. J. Comput. Sci. 2, 02 (2013), 35–38.
[47]
Pradeepta K. Sarangi, P. Ahmed, and Kiran K. Ravulakollu. 2014. Naïve Bayes classifier with Lu factorization for recognition of handwritten Odia numerals. Ind. J. Sci. Technol. 7, 1 (2014), 35–38.
[48]
Pradeepta K. Sarangi, Ashok K. Sahoo, and P. Ahmed. 2012. Recognition of isolated handwritten oriya numerals using Hopfield neural network. Int. J. Comput. Applic. 40, 8 (2012), 36–42.
[49]
Subhashree Satpathy, Ajit Kumar Nayak, Mamata Nayak, and Siksha O. Anusandhan. 2018. Use of stacked autoencoder in Odia numeral image recognition. Int. J. Pure Appl. Math. 118, 24 (2018).
[50]
Abhisek Sethy and Prashanta Kumar Patra. 2016. Off-line Odia handwritten numeral recognition using neural network: A comparative analysis. In International Conference on Computing, Communication and Automation (ICCCA). IEEE, 1099–1103.
[51]
Abhisek Sethy and Prashanta Kumar Patra. 2019. Off-line Odia handwritten character recognition: An axis constellation model based research. Int. J. Innov. Technol. Explor. Eng. 8, 9S2 (2019), 788–793.
[52]
Abhisek Sethy and Prashanta Kumar Patra. 2020. R-HOG feature-based off-line Odia handwritten character recognition. In Examining Fractal Image Processing and Analysis. IGI Global, 196–210.
[53]
Abhisek Sethy, Prashanta Kumar Patra, and Deepak Ranjan Nayak. 2018. Off-line handwritten Odia character recognition using DWT and PCA. In Progress in Advanced Computing and Intelligent Engineering. Springer, 187–195.
[54]
Abhisek Sethy, Prashanta K. Patra, and Deepak Ranjan Nayak. 2019. Gray-level co-occurrence matrix and random forest based off-line Odia handwritten character recognition. Recent Patents Eng. 13, 2 (2019), 136–141.
[55]
Abhisek Sethy, Prashanta Kumar Patra, SoumyaRanjan Nayak, and Pyari Mohan Jena. 2017. Symmetric axis based off-line Odia handwritten character and numeral recognition. In 3rd International Conference on Computational Intelligence and Networks (CINE). IEEE, 83–87.
[56]
Abhisek Sethy, Prashanta Kumar Patra, and Soumya Ranjan Nayak. 2020. Offline handwritten numeral recognition using convolution neural network. Mach. Vis. Inspect. Syst.: Image Process., Concepts, Methodol. Applic. 1 (2020), 197–212.
[57]
Pawan Kumar Singh, Iman Chatterjee, Ram Sarkar, Elisa Barney Smith, and Mita Nasipuri. 2021. A new feature extraction approach for script invariant handwritten numeral recognition. Expert Syst. 38, 6 (2021), e12699.
[58]
Sunny Singh, Pradeepta Kumar Sarangi, Chaitanya Singla, and Ashok Kumar Sahoo. 2021. Odia character recognition system: A study on feature extraction and classification techniques. Mater. Today: Proc. 34 (2021), 742–747.
[59]
Dhivya Subburaman and Usha Devi Gandhi. 2020. Study on automated approach to generate character recognition for handwritten and historical documents. ACM Trans. Asian Low-resour. Lang. Inf (2020).
[60]
Nilamadhaba Tripathy and Umapada Pal. 2003. System for Oriya handwritten numeral recognition. In Document Recognition and Retrieval XI, Vol. 5296. International Society for Optics and Photonics, IEEE, 174–181.
[61]
Nilamadhaba Tripathy and Umapada Pal. 2006. Handwriting segmentation of unconstrained Oriya text. Sadhana 31, 6 (2006), 755–769.

Cited By

View all
  • (2024)VashaNet: An automated system for recognizing handwritten Bangla basic characters using deep convolutional neural networkMachine Learning with Applications10.1016/j.mlwa.2024.10056817(100568)Online publication date: Sep-2024
  • (2023)A time efficient offline handwritten character recognition using convolutional extreme learning machineThe Imaging Science Journal10.1080/13682199.2023.222301172:6(736-748)Online publication date: 24-Jun-2023

Index Terms

  1. Development of a Benchmark Odia Handwritten Character Database for an Efficient Offline Handwritten Character Recognition with a Chronological Survey

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian and Low-Resource Language Information Processing
    ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 6
    June 2023
    635 pages
    ISSN:2375-4699
    EISSN:2375-4702
    DOI:10.1145/3604597
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 June 2023
    Online AM: 21 February 2023
    Accepted: 03 February 2023
    Revised: 31 October 2022
    Received: 14 April 2021
    Published in TALLIP Volume 22, Issue 6

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Bounding box
    2. median blur filtering
    3. threshold
    4. convolutional neural network (CNN)
    5. benchmark evaluation
    6. pre-trained deep learning networks
    7. “matras” and “juktas
    8. ” Odia scripts
    9. preprocessing

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)96
    • Downloads (Last 6 weeks)13
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)VashaNet: An automated system for recognizing handwritten Bangla basic characters using deep convolutional neural networkMachine Learning with Applications10.1016/j.mlwa.2024.10056817(100568)Online publication date: Sep-2024
    • (2023)A time efficient offline handwritten character recognition using convolutional extreme learning machineThe Imaging Science Journal10.1080/13682199.2023.222301172:6(736-748)Online publication date: 24-Jun-2023

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media