Multimodal Deep Learning Model for Cylindrical Grasp Prediction Using Surface Electromyography and Contextual Data During Reaching
"> Figure 1
<p>Seven zones for surface EMG placement from [<a href="#B24-biomimetics-10-00145" class="html-bibr">24</a>].</p> "> Figure 2
<p>CDF with the proposed cut-off threshold (700 samples), along with the percentage of rejected samples (7.77%).</p> "> Figure 3
<p>Data distributions by class and contextual data: (<b>a</b>) Weight. (<b>b</b>) Task height.</p> "> Figure 4
<p>Comparison of SPAN distributions by class: (<b>a</b>) Span 1, main span of the product. (<b>b</b>) Span 2, secondary span of the product.</p> "> Figure 5
<p>(<b>a</b>) CNN for EMG signals; (<b>b</b>) FC neural network for contextual data.</p> "> Figure 6
<p>Hybrid model architecture (M_HYBRID).</p> "> Figure 7
<p>Training results of the models: (<b>a</b>) EMG, (<b>b</b>) contextual, (<b>c</b>) hybrid.</p> "> Figure 8
<p>Confusion matrices of the models: (<b>a</b>) EMG, (<b>b</b>) contextual, (<b>c</b>) hybrid.</p> "> Figure 9
<p>Comparison of accuracy and loss for hybrid models. The dashed lines indicate the best values achieved.</p> ">
:1. Introduction
2. Materials and Methods
2.1. Dataset
2.2. Data Analysis and Preparation
2.2.1. Class Labels
2.2.2. EMG Data
2.2.3. Context Data
2.2.4. Data Partitions
2.3. Classification Models
2.3.1. Unimodal Models
2.3.2. Multimodal Model
2.3.3. Study of the Relevance of Contextual Variables
2.3.4. Hyperparameter Setup
- Optimizer: ADAM
- Loss function: binary cross-entropy
- Activation function of hidden layers: ReLU
- Activation function of output layers: softmax
- First convolutional layer: number of filters = 16, kernel size = 5(these filters operate over the 7 EMG channels)
- Second convolutional layer: number of filters = 32, kernel size = 5
- Learning rate scheduler: cosine decay (alpha = 0.01 initial learning rate = 0.001)
- Epochs: 300
- Batch size: 68
2.3.5. Classification Metrics
3. Results
3.1. Unimodal Versus Multimodal Models
3.1.1. Training and Validation Results
3.1.2. Test Results
3.1.3. Comparison Overview
3.2. Study of the Relevance of Contextual Variables
4. Discussion
4.1. Unimodal Versus Multimodal Models
4.2. Study of the Relevance of Contextual Variables
5. Conclusions and Future Directions
Author Contributions
Data Availability Statement
Conflicts of Interest
EMG | Electromyography |
HEs | Hand exoskeletons |
CNN | Convolutional Neural Network |
FE | Feature Extraction |
ADL | Activities of Daily Life |
MVC | Maximum Voluntary Contraction |
T | Task |
ET | Elementary Task |
CDF | Cummulative Distribution Function |
FC | Fully Connected |
M_EMG | Model based on EMG data input |
M_CONTEXT | Model based on contextual data input |
M_HYBRID | Multimodal model |
MH_NO_SPAN | Multimodal model without span variables |
MH_NO_SPAN1 | Multimodal model without span 1 variable |
MH_NO_SPAN2 | Multimodal model without span 2 variable |
MH_NO_THEIGHT | Multimodal model without task height variable |
MH_NO_WEIGHT | Multimodal model without weight variable |
- Corbetta, D.; Santello, M. (Eds.) Reach-to-Grasp Behavior: Brain, Behavior, and Modelling Across the Life Span, 1st ed.; Routledge: New York, NY, USA, 2018. [Google Scholar] [CrossRef]
- Seegelke, C.; Hughes, C.M.; Knoblauch, A.; Schack, T. Grasp posture planning during multi-segment object manipulation tasks — Interaction between cognitive and biomechanical factors. Acta Psychol. 2013, 144, 513–521. [Google Scholar] [CrossRef] [PubMed]
- Feix, T.; Bullock, I.M.; Dollar, A.M. Analysis of Human Grasping Behavior: Object Characteristics and Grasp Type. IEEE Trans. Haptics 2014, 7, 311–323. [Google Scholar] [CrossRef] [PubMed]
- Guo, L.; Lu, Z.; Yao, L. Human-Machine Interaction Sensing Technology Based on Hand Gesture Recognition: A Review. IEEE Trans. Hum.-Mach. Syst. 2021, 51, 300–309. [Google Scholar] [CrossRef]
- Pérez-Reynoso, F.; Farrera-Vazquez, N.; Capetillo, C.; Méndez-Lozano, N.; González-Gutiérrez, C.; López-Neri, E. Pattern Recognition of EMG Signals by Machine Learning for the Control of a Manipulator Robot. Sensors 2022, 22, 3424. [Google Scholar] [CrossRef] [PubMed]
- Li, W.; Shi, P.; Yu, H. Gesture Recognition Using Surface Electromyography and Deep Learning for Prostheses Hand: State-of-the-Art, Challenges, and Future. Front. Neurosci. 2021, 15, 621885. [Google Scholar] [CrossRef]
- Xiong, D.; Zhang, D.; Zhao, X.; Zhao, Y. Deep Learning for EMG-based Human-Machine Interaction: A Review. IEEE/CAA J. Autom. Sin. 2021, 8, 512–533. [Google Scholar] [CrossRef]
- Noronha, B.; Accoto, D. Exoskeletal Devices for Hand Assistance and Rehabilitation: A Comprehensive Analysis of State-of-the-Art Technologies. IEEE Trans. Med Robot. Bionics 2021, 3, 525–538. [Google Scholar] [CrossRef]
- Sultana, A.; Ahmed, F.; Alam, M.S. A systematic review on surface electromyography-based classification system for identifying hand and finger movements. Healthc. Anal. 2023, 3, 100126. [Google Scholar] [CrossRef]
- Chowdhury, R.H.; Reaz, M.B.I.; Ali, M.A.B.M.; Bakar, A.A.A.; Chellappan, K.; Chang, T.G. Surface Electromyography Signal Processing and Classification Techniques. Sensors 2013, 13, 12431–12466. [Google Scholar] [CrossRef] [PubMed]
- Hargrove, L.; Englehart, K.; Hudgins, B. A training strategy to reduce classification degradation due to electrode displacements in pattern recognition based myoelectric control. Biomed. Signal Process. Control 2008, 3, 175–180. [Google Scholar] [CrossRef]
- Simar, C.; Colot, M.; Cebolla, A.M.; Petieau, M.; Cheron, G.; Bontempi, G. Machine learning for hand pose classification from phasic and tonic EMG signals during bimanual activities in virtual reality. Front. Neurosci. 2024, 18, 1329411. [Google Scholar] [CrossRef] [PubMed]
- Siu, H.C.; Shah, J.A.; Stirling, L.A. Classification of Anticipatory Signals for Grasp and Release from Surface Electromyography. Sensors 2016, 16, 1782. [Google Scholar] [CrossRef]
- Batzianoulis, I.; El-Khoury, S.; Pirondini, E.; Coscia, M.; Micera, S.; Billard, A. EMG-based decoding of grasp gestures in reaching-to-grasping motions. Robot. Auton. Syst. 2017, 91, 59–70. [Google Scholar] [CrossRef]
- Batzianoulis, I.; Krausz, N.E.; Simon, A.M.; Hargrove, L.; Billard, A. Decoding the grasping intention from electromyography during reaching motions. J. NeuroEngineering Rehabil. 2018, 15, 57. [Google Scholar] [CrossRef]
- Wang, Z.; Xiong, C.; Zhang, Q. Enhancing the online estimation of finger kinematics from sEMG using LSTM with attention mechanisms. Biomed. Signal Process. Control 2024, 92, 105971. [Google Scholar] [CrossRef]
- Gandolla, M.; Ferrante, S.; Ferrigno, G.; Baldassini, D.; Molteni, F.; Guanziroli, E.; Cottini, M.C.; Seneci, C.; Pedrocchi, A. Artificial neural network EMG classifier for functional hand grasp movements prediction. J. Int. Med Res. 2017, 45, 1831–1847. [Google Scholar] [CrossRef]
- Zandigohar, M.; Han, M.; Erdoğmuş, D.; Schirner, G. Towards Creating a Deployable Grasp Type Probability Estimator for a Prosthetic Hand. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Cham, Switzerland, 2020; pp. 44–58. [Google Scholar] [CrossRef]
- Jaramillo-Yánez, A.; Benalcázar, M.E.; Mena-Maldonado, E. Real-Time Hand Gesture Recognition Using Surface Electromyography and Machine Learning: A Systematic Literature Review. Sensors 2020, 20, 2467. [Google Scholar] [CrossRef] [PubMed]
- Coskun, M.; Yildirim, O.; Demir, Y.; Acharya, U.R. Efficient deep neural network model for classification of grasp types using sEMG signals. J. Ambient Intell. Humaniz. Comput. 2022, 13, 4437–4450. [Google Scholar] [CrossRef]
- Atzori, M.; Gijsberts, A.; Castellini, C.; Caputo, B.; Hager, A.G.M.; Elsig, S.; Giatsidis, G.; Bassetto, F.; Müller, H. Electromyography data for non-invasive naturally-controlled robotic hand prostheses. Sci. Data 2014, 1, 140053. [Google Scholar] [CrossRef]
- Pizzolato, S.; Tagliapietra, L.; Cognolato, M.; Reggiani, M.; Müller, H.; Atzori, M. Comparison of six electromyography acquisition setups on hand movement classification tasks. PLoS ONE 2017, 12, e0186132. [Google Scholar] [CrossRef]
- Mora, M.C.; García-Ortiz, J.V.; Cerdá-Boluda, J. sEMG-Based Robust Recognition of Grasping Postures with a Machine Learning Approach for Low-Cost Hand Control. Sensors 2024, 24, 2063. [Google Scholar] [CrossRef]
- Roda-Sales, A.; Jarque-Bou, N.J.; Bayarri-Porcar, V.; Gracia-Ibáñez, V.; Sancho-Bru, J.L.; Vergara, M. MOVMUS-UJI Dataset & ERGOMOVMUS: EMG and kinematics data of the hand in activities of daily living with special interest for ergonomics. Zenodo 2023. [Google Scholar] [CrossRef]
- Jarque-Bou, N.J.; Vergara, M.; Sancho-Bru, J.L.; Roda-Sales, A.; Gracia-Ibáñez, V. Identification of forearm skin zones with similar muscle activation patterns during activities of daily living. J. NeuroEng. Rehabil. 2018, 15, 91. [Google Scholar] [CrossRef] [PubMed]
- Hermens, H.J.; Freriks, B.; Disselhorst-Klug, C.; Rau, G. Development of recommendations for SEMG sensors and sensor placement procedures. J. Electromyogr. Kinesiol. 2000, 10, 361–374. [Google Scholar] [CrossRef] [PubMed]
- Konrad, P. The ABC of EMG: A Practical Introduction to Kinesiological Electromyography, version 1.0 ed.; Noraxon Inc.: Scottsdale, AZ, USA, 2005. [Google Scholar]
- Vergara, M.; Sancho-Bru, J.L.; Gracia-Ibáñez, V.; Pérez-González, A. An introductory study of common grasps used by adults during performance of activities of daily living. J. Hand Ther. 2014, 27, 225–234. [Google Scholar] [CrossRef]
- Fitts, P.M. The information capacity of the human motor system in controlling the amplitude of movement. J. Exp. Psychol. 1954, 47, 381–391. [Google Scholar] [CrossRef] [PubMed]
- Shen, S.; Gu, K.; Chen, X.R.; Yang, M.; Wang, R.C. Movements Classification of Multi-Channel sEMG Based on CNN and Stacking Ensemble Learning. IEEE Access 2019, 7, 137489–137500. [Google Scholar] [CrossRef]
- Terven, J.; Córdova-Esparza, D.M.; Romero-González, J.A. A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS. Mach. Learn. Knowl. Extr. 2023, 5, 1680–1716. [Google Scholar] [CrossRef]
- Radford, A.; Kim, J.W.; Hallacy, C.; Ramesh, A.; Goh, G.; Agarwal, S.; Sastry, G.; Askell, A.; Mishkin, P.; Clark, J.; et al. Learning Transferable Visual Models From Natural Language Supervision. In Proceedings of the 38th International Conference on Machine Learning, Virtual, 18–24 July 2021. [Google Scholar]
- He, Y.; Shima, R.; Fukuda, O.; Bu, N.; Yamaguchi, N.; Okumura, H. Development of distributed control system for vision-based myoelectric prosthetic hand. IEEE Access 2019, 7, 54542–54549. [Google Scholar] [CrossRef]
- Shi, C.; Yang, D.; Zhao, J.; Liu, H. Computer Vision-Based Grasp Pattern Recognition with Application to Myoelectric Control of Dexterous Hand Prosthesis. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 2090–2099. [Google Scholar] [CrossRef] [PubMed]
- Gong, W. Lightweight Object Detection: A Study Based on YOLOv7 Integrated with ShuffleNetv2 and Vision Transformer. arXiv 2024, arXiv:2403.01736. [Google Scholar] [CrossRef]
- Roda-Sales, A.; Jarque-Bou, N.J.; Bayarri-Porcar, V.; Gracia-Ibáñez, V.; Sancho-Bru, J.L.; Vergara, M. Electromyography and kinematics data of the hand in activities of daily living with special interest for ergonomics. Sci. Data 2023, 10, 814. [Google Scholar] [CrossRef] [PubMed]
Task | Elementary Task | Action |
Transporting a plastic bottle of water | Move closer to take the object from the top shelf | Reaching |
Take the object from the top shelf. Leave it on the kitchen top | Transportation | |
Release the object and return to P1 | Release | |
Move closer to take the object from the kitchen top | Reaching | |
Take the object from the kitchen top. Leave it on the bottom shelf | Transportation | |
Release the object and return to P1 | Release | |
Plugging a flat electric plug | Push to put the plug in | Pushing |
Pull to take the plug out | Pulling | |
Hold for 2 seconds | Holding |
Variable | Description | Original Range | New Range | |
ID | Unique identifier for each recording used for training the models. | – | – | |
PARTICIPANT | Numeric identifier for the participant performing the task. | Values: 1–30 | – | |
T (Task) | Combination of movements that each participant performed. One participant can have the same task several times, as there are several elementary tasks for each task. | Values: 102–180, 202–234, 302–350 | – | |
ET (Elementary Task) | Each movement that the participant performed in each task. | Values: 1–614 | – | |
datasize | Number of EMG frames for each recording. | 41–6337 | 1000 1 | |
PRODUCT_DH | Identifier of the product grasped with the dominant hand. | Values: 1–105 | – | |
WEIGHT 2 | Weight of the product. | In grams | [0, 1] 1 | |
SPAN_1/SPAN_2 2 | Width of the part of the object where it is grasped. If the object has two possible parts (e.g., a water bottle), both spans are specified, in other case SPAN_2 = 0. | In mm | [0, 1] 1 | |
GRASP_DH | LABEL TO PREDICT. Grasp type performed with dominant hand. | 1 = Cylindrical 2 = Lateral pinch 3 = Lumbrical 4 = Oblique 5 = Special Pinch | 6 = Hook 7 = Intermediate 8 = Five finger pinch 9 = Free | 0 = Non-cylindrical (classes 2–8) 1 = Cylindrical 1 |
ACTION_DH | Action performed with dominant hand. | 1 = Reaching 2 = Releasing 3 = Transporting 4 = Holding 5 = Pouring | 6 = Pulling 7 = Pushing 8 = Twisting (clockwise) 9 = Twisting (anticlockwise) 10 = Other | Only using 1 (reaching) |
TASK_HEIGHT 2 | Height of task performance. | 1 = High–Median, 2 = Median, 3 = Median–Low | One-hot encoded |
Group | Number of Subjects | Recordings (% of Data) |
Training | 16 (53.3%) | 2720 (70%) |
Validation | 4 (13.3%) | 502 (13%) |
Testing | 10 (33.3%) | 661 (17%) |
Metrics | M_EMG | M_CONTEXT | M_HYBRID | ||||||
Tr | Val | Test | Tr | Val | Test | Tr | Val | Test | |
Accuracy | 0.90 | 0.78 | 0.80 | 0.84 | 0.86 | 0.87 | 0.98 | 0.93 | 0.94 |
Precision | 0.92 | 0.75 | 0.81 | 0.78 | 0.86 | 0.91 | 0.98 | 0.92 | 0.96 |
Recall | 0.81 | 0.79 | 0.81 | 0.83 | 0.85 | 0.84 | 0.97 | 0.93 | 0.92 |
Model | Loss | Accuracy | Precision | Recall |
M_EMG | 0.49 | 0.80 | 0.81 | 0.81 |
M_CONTEXT | 0.34 | 0.87 | 0.91 | 0.84 |
M_HYBRID | 0.19 | 0.94 | 0.96 | 0.92 |
MH_NO_SPAN | 0.43 | 0.83 | 0.84 | 0.84 |
MH_NO_SPAN1 | 0.36 | 0.86 | 0.89 | 0.83 |
MH_NO_SPAN2 | 0.20 | 0.94 | 0.94 | 0.94 |
MH_NO_WEIGHT | 0.30 | 0.90 | 0.92 | 0.88 |
MH_NO_THEIGHT | 0.22 | 0.93 | 0.96 | 0.92 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (
Share and Cite
Lázaro, R.; Vergara, M.; Morales, A.; Mollineda, R.A. Multimodal Deep Learning Model for Cylindrical Grasp Prediction Using Surface Electromyography and Contextual Data During Reaching. Biomimetics 2025, 10, 145.
Lázaro R, Vergara M, Morales A, Mollineda RA. Multimodal Deep Learning Model for Cylindrical Grasp Prediction Using Surface Electromyography and Contextual Data During Reaching. Biomimetics. 2025; 10(3):145.
Chicago/Turabian StyleLázaro, Raquel, Margarita Vergara, Antonio Morales, and Ramón A. Mollineda. 2025. "Multimodal Deep Learning Model for Cylindrical Grasp Prediction Using Surface Electromyography and Contextual Data During Reaching" Biomimetics 10, no. 3: 145.
APA StyleLázaro, R., Vergara, M., Morales, A., & Mollineda, R. A. (2025). Multimodal Deep Learning Model for Cylindrical Grasp Prediction Using Surface Electromyography and Contextual Data During Reaching. Biomimetics, 10(3), 145.