[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Developing an Offline and Real-Time Indian Sign Language Recognition System with Machine Learning and Deep Learning

Published: 20 February 2024 Publication History

Abstract

Sign language is a powerful form of communication for humans, and advancements in computer vision systems are driving significant progress in sign language recognition. In the context of Indian sign language (ISL), early research focused on differentiating a limited set of distinct hand signs, often relying on specialized hardware such as sensors and gloves, also most of the works were experimented on the dataset captured under controlled environments. This research aims to enhance communication for the speech and hearing impaired community by recognizing static images of ISL digits and alphabets in both offline and real-time scenarios. To achieve this, two publicly available datasets were used, containing a total of 42,000 sign images and 36,000 static signs, respectively. The dataset1 consists of sign images that were taken under controlled environments, whereas the dataset2 consists of sign images that were taken in different environments with varying backgrounds and lighting conditions. Dataset1 was experimented with and without using preprocessing techniques, while dataset2 underwent similar testing. We employed both machine learning and deep learning with CNN to categorize the ISL alphabets and numbers. In the machine learning approach, image preprocessing techniques such as HSV conversion, skin mask generation, and skin portion extraction and Gabor filtering were used to segment the region of interest, which was then fed to five ML models for sign prediction. In contrast, the DL approach used CNN model. In addition, probability ensemble testing was performed on both datasets to compare the accuracies. Real-time recognition was also conducted using a custom dataset, employing the YOLO-NAS-S model. This study contributes to the advancement of ISL recognition by conducting a comparative analysis of ML algorithms and CNNs, examining their performance with and without preprocessing techniques.

References

[1]
Deora D, Bajaj N. Indian sign language recognition, 2012 1st International Conference on Emerging Technology Trends in Electronics, Communication and Networking, IEEE 2012-978–1-4673-1627-9/12.
[2]
Nair AV and Bindu V A review on indian sign language recognition Int J Comput Appl 2013 73 22 33-38
[3]
Badenas J, Miguel Sanchiz J, and Filiberto P Motion-based segmentation and region tracking in image sequences Pattern Recognit 2001 34 661-670
[4]
Liao P-S, Chen T-S, and Chung P-C A fast algorithm for multilevel thresholding J Inf Sci Eng 2001 17 713-727
[5]
McIvor A, Zang B, Klette R. The background subtraction problem for video surveillance systems. In: International workshop robot vision 2001, Auckland, New Zealand, February 2001. Springer lecture notes in computer science. 1998. pp 176–83.
[6]
Sultana A and Rajapushpa T Vision based gesture recognition for alphabetical hand gestures using the SVM classifier Int J Comput Sci Eng Technol 2012 3 7 218-223
[7]
Nanivadekar PA, Kulkarni V. Indian sign language recognition: database creation, hand tracking and segmentation, International Conference on Circuits, Systems, Communication and Information Technology Applications, IEEE 2014, 978-1-4799-2494-3/14.
[8]
Raheja JL, Mishra A, and Chaudhary A Indian Sign language recognition using SVM Pattern Recognit Image Anal 2016 26 2 434-441
[9]
Loke P, Paranjpe J, Bhabal S, Kanere K. Indian sign language converter system using an android app, International Conference on Electronics, Communication and Aerospace Technology, 2017 IEEE, 978-105090-5686-6/17.
[10]
Beena MV and Agnisarman Namboodiri MN ASL numerals recognition from depth maps using artificial neural networks Middle-East J Sci Res 2017 25 7 1407-1413
[11]
Rodriguez KO, Chavez GC. Finger spelling recognition from RGB-D information using kernel descriptor, 2013 XXVI Conference on Graphics, Patterns and Images.
[12]
Molchanov P, Yang X, Gupta S, Kim K, Tyree S, Kautz J. Online detection and classification of dynamic hand gestures with recurrent 3D convolutional neural networks, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13]
Karpathy A, Toderici G, Shetty S, Leung T, Sukthankar R, Fei-Fei L. Large-scale video classification with convolutional neural networks. In CVPR, 2014.
[14]
Kpkl O, Gunduz A, Kose N, Rigoll G. Real-time hand gesture detection and classification using convolutional neural networks, paper accepted to IEEE International Conference on Automatic Face and Gesture Recognition (FG 2019).
[15]
Chen X, Gao K. Dense image network: Video spatial-temporal evolution encoding and understanding, paper submitted to ArXiv on 19 May 2018.
[16]
Kumar Makwana M. Sign language recognition, M. Tech thesis submitted to Indian Institute of Science, Bengaluru, June 2017.
[17]
Buehler P, Zisserman A, Everingham M. Learning sign language by watching tv (using weakly aligned subtitles). In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp 2961–2968. IEEE, 2009.
[18]
Cooper H, Ong E-J, Pugeault N, and Bowden R Sign language recognition using sub-units J Mach Learn Res 2012 13 1 2205-2231
[19]
Tharwat A, Gaber T, Ella Hassanien A, Shahin MK, and Refaat B Sift-based arabic sign language recognition system Afro-European conference for industrial advancement 2015 Cham Springer 359-370
[20]
Quan Yang. Chinese sign language recognition based on video sequence appearance modeling. In 2010 5th IEEE Conference on Industrial Electronics and Applications, pages 1537–1542. IEEE, 2010.
[21]
Al-Rousan M, Assaleh K, and Tala’a A Video-based signer-independent Arabic sign language recognition using hidden markov models Appl Soft Comput 2009 9 3 990-999
[22]
Badhe PC, Kulkarni V. Indian sign language translator using gesture recognition algorithm. In: 2015 IEEE International Conference on Computer Graphics, Vision and Information Security (CGVIS), pp 195–200. IEEE, 2015.
[23]
Starner T, Weaver J, and Pentland A Real-time American sign language recognition using desk and wearable computer based video IEEE Trans Pattern Anal Mach Intell 1998 20 12 1371-1375
[24]
Evangelidis GD, Singh G, and Horaud R Continuous gesture recognition from articulated poses European conference on computer vision 2014 Cham Springer 595-607
[25]
Zhang J, Zhou W, Xie C, Pu J, Li H. Chinese sign language recognition with adaptive hmm. In: 2016 IEEE International Conference on Multimedia and Expo (ICME), IEEE, pp 1–6. IEEE, 2016.
[26]
Wang SB, Quattoni A, Morency LP, Demirdjian D, Darrell T. Hidden conditional random fields for gesture recognition. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), vol. 2, pp 1521–1527. IEEE, 2006.
[27]
Sakoe H and Chiba S Dynamic programming algorithm optimization for spoken word recognition IEEE Trans Acoust Speech Signal Process 1978 26 1 43-49
[28]
Lichtenauer JF, Hendriks EA, and Reinders MJT Sign language recognition by combining statistical dtw and independent classification IEEE Trans Pattern Anal Mach Intell 2008 30 11 2040-2046
[29]
Nagarajan S and Subashini TS Static hand gesture recognition for sign language alphabets using edge oriented histogram and multi class svm Int J Comput Appl 2013 82 4 28-35
[30]
Zafrulla Z, Brashear H, Starner T, Hamilton H, Presti P. American sign language recognition with the kinect. In: Proceedings of the 13th International Conference on Multimodal Interfaces, pp 279–286, 2011.
[31]
Ming Lim K, Tan AWC, and Chiang Tan S Block-based histogram of optical flow for isolated sign language recognition J Vis Commun Image Represent 2016 40 538-545
[32]
Kulkarni VS and Lokhande SD Appearance based recognition of american sign language using gesture segmentation Int J Comput Sci Eng 2010 2 3 560-565
[33]
Donahue J, Anne Hendricks L, Guadarrama S, Rohrbach M, Venugopalan S, Saenko K, Darrell T. Long-term recurrent convolutional networks for visual recognition and description. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2625–2634, 2015.
[34]
Feichtenhofer C, Pinz A, Zisserman A. Convolutional two-stream network fusion for video action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1933–1941, 2016.
[35]
Camgoz NC, Hadfield S, Koller O, Bowden R. Using convolutional 3d neural networks for user-independent continuous gesture recognition. In: 2016 23rd International Conference on Pattern Recognition (ICPR), pp 49–54. IEEE, 20156.
[36]
Rokade YI and Jadav PM Indian sign language recognition system Int J Eng Technol 2017 9 3 189-196
[37]
Katoch S, Singh V, and Shanker Tiwary U Indian sign language recognition system using SURF with SVM and CNN Array 2022 14 100141
[38]
Shenoy K, Dastane T, Rao V, Vyavaharkar D. Real-time Indian sign language (ISL) recognition. In: 2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pp 1–9. IEEE, 2018.

Cited By

View all
  • (2024)Algorithm for Automatic Layout of Graphic Language and Its Application in Graphic DesignInternational Journal of Information Systems and Supply Chain Management10.4018/IJISSCM.34539617:1(1-18)Online publication date: 21-Jun-2024

Index Terms

  1. Developing an Offline and Real-Time Indian Sign Language Recognition System with Machine Learning and Deep Learning
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image SN Computer Science
          SN Computer Science  Volume 5, Issue 3
          Mar 2024
          750 pages

          Publisher

          Springer-Verlag

          Berlin, Heidelberg

          Publication History

          Published: 20 February 2024
          Accepted: 07 November 2023
          Received: 25 October 2023

          Author Tags

          1. Indian sign language
          2. Machine learning
          3. Deep learning
          4. CNN
          5. HSV
          6. Skin mask
          7. Gabor filters
          8. YOLO-NAS-S

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 21 Dec 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)Algorithm for Automatic Layout of Graphic Language and Its Application in Graphic DesignInternational Journal of Information Systems and Supply Chain Management10.4018/IJISSCM.34539617:1(1-18)Online publication date: 21-Jun-2024

          View Options

          View options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media