1. Introduction
The 21st century can be considered an era of mechatronics, including robotics, as robots are increasingly integrated into human society. Among these advancements, myoelectric prostheses have emerged as transformative devices that replace parts of the human body. Unlike body-powered prostheses, myoelectric prostheses leverage EMG signals as control inputs, enabling a range of natural movements, such as hand opening and closing. According to Ottobock, these prostheses provide strong grip force with minimal effort and offer flexible hand movements regardless of arm posture, significantly enhancing the user’s ability to perform everyday tasks in various environments [
1,
2,
3].
Research on myoelectric prostheses has primarily focused on improving hand gesture recognition and movement prediction using machine learning techniques such as neural networks (NNs), convolutional neural networks (CNNs), and multi-scale convolutional neural networks (MSCNNs) [
4,
5]. While these methods show high accuracy in recognizing hand patterns, achieving real-time performance and responsiveness remains a challenge, particularly as the complexity of movements and the amount of EMG data increase.
One critical challenge in human–machine interfaces, including myoelectric prostheses, is the delay between user intent and the system’s response. Delays exceeding 300 ms have been shown to negatively impact user performance in remote operations [
6]. To address this issue, researchers have explored Electro-Mechanical Delay (EMD), which refers to the time lag (typically 30–100 ms) between the generation of EMG signals and the onset of movement. By leveraging EMD, myoelectric systems can predict user intent in advance, thereby reducing the overall response time [
7]. Recent studies, such as Tigrini et al. (2023), have demonstrated the effectiveness of transient EMG data in predicting movement intent within a 150 ms window centered on movement onset, highlighting the applicability of EMD in anticipatory control systems [
8].
Hayashi et al. also proposed utilizing EMD to improve the responsiveness of the system. Additionally, they proposed a Single Input–Single Output (SISO)-NARX model for wrist angle estimation to predict wrist movements and control a single-link manipulator. In their approach, the input signal was synthesized by combining the EMG signals obtained from a four-channel EMG sensor into a single signal. To enhance the manipulator’s responsiveness and accuracy, a controller combining feedforward and feedback control, specifically a ZPETC+PID controller, was employed. Hayashi et al. set the practical operation of devices such as myoelectric prostheses as a future research goal [
7].
Based on the foundational research by Hayashi et al., this study aims to address the challenges encountered during practical implementation. Hayashi et al.’s approach combined four EMG signals into a single input for a SISO-NARX model, which failed to fully utilize the unique characteristics of each EMG channel. This inherently limited the system’s scalability, particularly in handling complex multi-degrees-of-freedom (Multi-DOF) movements, such as finger articulation. Additionally, Hayashi et al. utilized a four-channel wired sensor to acquire EMG signals, which required precise identification of muscle positions and adjustments during attachment. This process could cause inconvenience for users during operation and adjustment, potentially reducing user comfort and usability [
9]. However, utilizing a band-type wireless sensor offers the advantage of providing a simple interface that users can easily wear, enabling signal acquisition without the need for complex equipment [
10,
11].
To address the scalability issues related to recognizing and estimating finger movements in addition to wrist motions, we propose replacing the four-channel wired sensor with an eight-channel band-type wireless sensor. This approach allows for acquiring EMG signals from eight channels instead of the original four channels, thereby enabling the system to estimate complex movements, such as finger motions, and ensuring scalability. However, switching from a four-channel wired sensor to an eight-channel wireless sensor is expected to introduce communication delays. While Hayashi et al.’s research considered only the motor dynamics using wired sensors, switching to a wireless sensor is expected to introduce communication delay issues. We will verify this communication delay and investigate the applicability of Electro-Mechanical Delay (EMD) in such conditions. As the angle estimation model, we propose using a MISO-NARX model with ridge regression [
12], which is expected to prevent overfitting and improve generalization performance. Although it is generally known in the research domain of the system identification that multi-input models tend to have reduced identification accuracy due to increased complexity compared to the SISO-NARX model, we anticipate that ridge regression will enhance the performance of the MISO-NARX model, making it superior to the SISO-NARX model. The performance of ZPETC+PID control has been validated in prior studies. However, there are differences in the control targets between previous studies and this research. Therefore, we will verify whether the same control method applied to the servo motor, which is the control target in this study, produces results consistent with prior findings and confirm that there are no issues during practical implementation.
A systematic literature review was conducted using major academic databases, including IEEE Xplore, PubMed, and Google Scholar, with keywords such as “MISO NARX model”, “ridge regression EMG control”, and “multi-channel EMG analysis”. Despite extensive searches within the past five years, no studies were found that expanded upon the work of Hayashi et al. to integrate a MISO-NARX model or an eight-channel EMG configuration. This absence of prior work highlights the novelty of this research, which pioneers the application of advanced multi-channel configurations in wrist angle estimation systems.
By developing a scalable and responsive interface, this study paves the way for myoelectric prostheses that can perform complex, user-defined motions. In the future, this approach could enable prosthetic devices to execute versatile and intuitive movements, transforming them from tools of functional replacement to systems that mimic natural human motion.
2. Materials and Methods
2.1. Latency Assessment for Robotic Hand Motion
To conduct research on myoelectric prostheses, we utilized a hand-shaped robotic system, as shown in
Figure 1. This robot features five motors for finger movement and two motors for wrist movement, simulating the joints of a hand. As an initial step toward implementing full hand movements, we are currently focusing on wrist movements only. The motors used for hand movement control are servo motors, as depicted in
Figure 2.
The servo motor in
Figure 2 is a GWS Micro 2BBMG—micro servo with the following specifications [
13]:
Torque: 5.40 kg-cm at 4.8 V, 6.41 kg-cm at 6.0 V Speed: 0.17 s/60° at 4.8 V, 0.14 s/60° at 6.0 V.
The GWS Micro 2BBMG—micro servo is an analog servo motor and does not have a built-in PID control algorithm.
Before configuring the system, we measured the time taken by the robot hand in
Figure 1 to reach the target angle after applying a step input in order to verify the delay time. To verify the time it takes for the servo motor to move to the target angle, we used the AS5601 magnetic encoder along with an Arduino Mega. The AS5601 features a 12-bit resolution, offering a high-resolution output of 4096 PPR (Pulses Per Revolution). By utilizing the magnetic encoder, we measured the time and angle required for the servo motor to move from 0 degrees to the target angle of 60 degrees. Also, the transfer function of the control plant is derived using the measured step response.
2.2. System Design
This study builds upon the work of Masamichi Hayashi, Hiroshi Kogure, Kazuhide Ura, Masami Iwase, Teruyoshi Sadahiro, Shosiro Hatakeyama, and Eita Sawaguchi in their paper titled “Development of Zero-Phase-Tracking Man-Machine Interface with Electro-Mechanical Delay of Electromyogram.” The authors explored the application of Electro-Mechanical Delay (EMD) in EMG signals to synchronize human movements with machine responses. They implemented a Zero-Phase Error Tracking Control (ZPETC) system combined with a Nonlinear Delayed Auto-Regressive eXogenous (NDARX) model to predict wrist movements from EMG signals, reducing phase lag and improving system responsiveness.
Building on this prior work, our study combines PID control with ZPETC to further enhance the stability and response time of the myoelectric prosthesis control system. By integrating the EMD-based compensation approach from earlier research, we aim to minimize delay and develop a more responsive control system that enhances real-time performance. Additionally, we aim to validate the system’s effectiveness through real-world implementation, addressing the challenges identified in previous studies.
Figure 3 provides a block diagram of the control system used for the robotic hand. The system integrates Electro-Mechanical Delay (EMD) compensation into its processes to synchronize the user’s intent with the robotic hand’s response. First, EMG signals generated by wrist movements are captured and passed through a low-pass filter to remove noise and extract relevant features. These filtered signals are then processed by a PC-based NARX model, which estimates the wrist angle.
The estimated angle serves as the input for two control strategies: the feedforward control (ZPETC) and the feedback control (PID controller). ZPETC generates a target trajectory by leveraging the EMD time and ensures that the system tracks this trajectory without phase delay. Meanwhile, the PID controller minimizes the error between the motor’s output angle and the desired wrist angle by correcting for any deviations. The combined outputs of these controllers are sent to the robotic hand, ensuring precise and responsive control.
Finally, the system’s effectiveness is validated by comparing the motor angle with the actual wrist angle, assessing how accurately the robotic hand replicates the intended movement. The combination of ZPETC and the PID controller ensures fast, accurate, and stable operation by addressing both phase delay and real-time error correction.
2.3. EMG Signal Measurement
Measurement of the EMG signal uses the Myo armband in
Figure 4a. The Myo armband is a wearable device equipped with eight EMG electrodes, a nine-axis inertial measurement device, and a transmission module. The sampling frequency is 200 Hz, and the data are transmitted to the outside using Bluetooth low energy (BLE) technology [
14]. In this study, the four channels of the Myo armband are placed on the extensor carpi radialis longus, as shown in
Figure 4b, and worn to acquire and use the EMG signal during the Palmar flexion and Dorsi flexion operation of the wrist.
This study was conducted using data from a single healthy 27-year-old male participant. The ultimate goal of this study is to develop a control system that is applicable to myoelectric prostheses. Since myoelectric prostheses are customized for individuals, the control system is designed to be calibrated by the user after development. Therefore, this study was conducted based on data from a single participant. To ensure reliability, data collection was performed under repeated conditions for the same participant. Also, this study utilized anonymized EMG signals that cannot identify specific individuals and collected data using non-invasive methods. The purpose of the data usage was limited to technical research, such as the development of an algorithm for estimating wrist angles. It has been confirmed that this study is outside the scope of ethical approval requirements under the Tokyo Denki University guidelines. Furthermore, this study adhered to ethical guidelines and ensured the protection of participants’ rights and privacy during data usage and processing.
As shown in
Figure 4b, the participant wore the Myo armband on the extensor carpi radialis longus of the right hand. During the experiment, the participant was seated comfortably in a chair with the arm positioned at approximately 90° relative to the torso, as illustrated in
Figure 5. The participant alternated between Dorsi flexion (
Figure 6a) and Palmar flexion (
Figure 6b) movements, performing each movement for 5 s. For NARX model training data, wrist movements were performed at the participant’s maximum flexion angle (0° to −80° for
Figure 6a and 0° to 80° for
Figure 6b) with a controlled speed, completing 1–2 repetitions per movement within 5 s. Test data were collected by performing wrist movements with randomized angles, sequences, and speeds to evaluate the model’s generalization performance. The EMG signal contains high-frequency noise components. Therefore, high-frequency noise is removed using a low-pass filter (LPF). Specifically, muscle motor units associated with hand movements predominantly generate activation signals in the low-frequency band [
15]. Additionally, a study utilizing Multivariate Variational Mode Decomposition (MVMD) [
16] has demonstrated that the low-frequency components contain important motor information, and pattern recognition performance improves when extracting the low-frequency band. Based on this, the current study applied a low-pass filter with a cutoff frequency set at 5 Hz. The transfer function of the low-pass filter is represented by Equation (
1).
represents the angular frequency of the filter.
2.4. Wrist Angle Measurement
The SG65 goniometer from Biometrics Ltd., shown in
Figure 7a, was used to measure wrist angles. The SG65 is a wired, twin-axis electronic goniometer with a full measurement range of
. This device offers an accuracy of
and a repeatability of
. Considering human movement, an angular error of 2 degrees is generally negligible [
17].
To measure wrist angles using the SG65, the joint of the SG65 was positioned on the wrist joint as shown in
Figure 7b. The angle measured by the SG65 can be checked using the K800 Amplifier from the same company, as shown in
Figure 8. The K800 amplifier has a sampling frequency of approximately 5 kHz [
18,
19]. The SG65 goniometer, when used with the K800 amplifier, specifies that
corresponds to
,
to
, and
to
. Using this information, the sensitivity of the sensor is calculated, as shown in Equations (
2)–(
6). The calculation result indicates that the sensitivity is approximately
.
The wrist joint angle measured by the SG65 is converted into voltage and transmitted to the PC. Therefore, it is necessary to confirm the correlation between voltage and joint angle. We measured the voltage and angle, increasing the angle from −90 degrees to 90 degrees in steps of 30 degrees. Additionally, when measuring the EMG signals and wrist angles simultaneously, the measurement was conducted by wearing both the Myo armband and the SG65 goniometer simultaneously, as shown in
Figure 7a,b using the same method described in the EMG Signal Measurement section. Due to the difference in sampling frequencies between the two sensors, the SG65 data were configured to be collected only at the time points when EMG signals were recorded. This approach synchronized the sampling frequencies, making the analysis more convenient.
where:
S: Sensitivity (V/degree)
: Change in output voltage from 0 to 4 (V)
: Change in input angle from −180 to 180 (degrees)
Sensitivity is calculated as:
Therefore, the sensitivity of the SG65 goniometer is approximately:
2.5. NARX Model (Nonlinear AutoRegressive eXogenous Model)
To control a servo motor using EMG signals, it is essential to understand the correlation between wrist angle and EMG signals. Furthermore, since EMG signals exhibit nonlinearity, a model with coefficients that vary depending on the output is required. To model this correlation, we propose the Nonlinear AutoRegressive eXogenous (NARX) model. The NARX model is well-suited for capturing the nonlinear characteristics of EMG signals. Additionally, NARX models are ideal for time-series prediction, as the output depends on past output values (autoregressive property) and exogenous inputs. Therefore, the NARX model is appropriate for deriving estimated angles from EMG in real-time to be used as the reference trajectory for ZPETC.
To understand the NARX model, it is necessary to first comprehend the ARX (AutoRegressive with eXogenous) model, its foundational form. The ARX model represents the output of a dynamic system as a linear combination of past output values and external input values. The basic equation of the ARX model is defined as follows:
: Output at time k, e.g., wrist angle (V).
: Input at time k, e.g., EMG signal amplitude (V).
: Coefficients of the autoregressive (output) terms.
: Coefficients of the exogenous (input) terms.
: Number of past output data points (output lag order).
: Number of past input data points (input lag order).
In Equation (
7),
and
are set to 2. The order selection for the ARX and NARX models will be described in detail in the subsequent Determination of the NARX Model Orders
,
, and Interval
m section. Additionally, since the input for the EMG signals in this experiment consists of eight channels,
is replaced with
(
) and
with
(
). The modified equation can be expressed as follows:
Equation (
8) represents the fundamental ARX model equation used in this study. The NARX model extends this equation to accommodate nonlinear signals.
The NARX model divides the dataset of wrist angles (output) and electromyography (EMG) signals (input) into m segments based on the output values and applies a separate ARX model to each segment to capture nonlinearity. Here, m represents the number of divisions within the output range, enabling a detailed analysis of the data characteristics within each segment.
In this study,
m is set to 4, and the rationale for this choice will be discussed in detail in the Determination of the NARX Model Orders
,
, and Interval
m section.
The coefficients for each segment
m in (
9) can be reorganized as coefficient functions
, resulting in the following MISO-NARX model equation:
Using the constructed MISO-NARX model (
10), wrist joint angles can be estimated from electromyography (EMG) signals. For the estimation, coefficients such as
in (
8) need to be determined. These coefficients are represented as a coefficient vector
. This can be expressed as Equation (
11). The corresponding calculation is described using the least squares method in Equation (
12).
The inputs and outputs used in Equation (
12) are represented by Equations (
13) and (
14), respectively. Here,
represents the input data in segment
m, and
represents the output data in segment
m.
Using the same method, the coefficient functions
can be determined, and the MISO-NARX model (
10) can be constructed.
2.6. Ridge Regression
When constructing the MISO-NARX model, increasing the complexity of the model raises the likelihood of overfitting to the training data. In this study, we use EMG signals from eight channels, each of which is related to wrist movements. Since the signals from each channel are highly correlated with each other, the risk of overfitting increases even more. Highly correlated data can significantly increase the variance of the model coefficients, causing the model to become overly sensitive to specific data points.
To address this issue, we propose applying ridge regression. Ridge regression works by incorporating L2 regularization, which limits the magnitude of the model’s coefficients, thereby adjusting the influence of each variable. This prevents the model from assigning excessively large values to the coefficients, which helps improve the generalization performance of the model and prevents overfitting [
12]. In this study, we set the L2 parameter
to 0.01. Let the coefficient vector of ridge regression be denoted as
, and the calculation follows the equation below.
2.7. Zero Phase Error Tracking Control (ZPETC)
Zero Phase Error Tracking Control (ZPETC) is a control algorithm used in control systems to enhance tracking performance in discrete-time systems. ZPETC minimizes discrepancies between input and output by eliminating the system’s phase error. To achieve phase error elimination, the inverse transfer function of the system is calculated and applied to the control input, allowing the output to accurately follow the reference signal.
In our work, we use the wrist angle estimated by the NARX model as the reference signal, and ZPETC is employed to control the input for the servo motor. By compensating for the phase error in the servo motor, this approach enables the motor to reach the target angle more quickly and accurately.
Assume that the control plant is of the form as shown in (
16), and the plant is either stable or stabilized. Here,
is the delay operator, the numerator polynomial is denoted as
, and the denominator polynomial is denoted as
. When
has unstable zeros, factorizing it yields the expression shown in (
19). Here,
is a monic polynomial of degree
s that includes both unstable and marginally stable zeros, and
is a polynomial of degree
that includes only stable zeros. A monic polynomial is a polynomial where the highest degree term has a coefficient of 1. In this case, the equation for ZPETC is expressed as shown in (
20). Here,
represents the steady-state gain of
. The inverse polynomial used to compensate for the unstable zeros is shown in (
21).
z−1: Delay operator representing a one-step time delay, e.g., z−1y[k] = y[k − 1].
B[z−1]: Numerator polynomial of the transfer function, defined as
B[z−1] = b0 + b1z−1 + ⋯ + bmz−m, representing the system’s response to input.
A[z−1]: Denominator polynomial of the transfer function, defined as
A[z−1] = 1 + a1z−1 + ⋯ + anz−n, governing the system stability.
B−[z−1]: Unstable or marginally stable zeros of B[z−1], requiring compensation in ZPETC.
B+[z−1]: Stable zeros of B[z−1], not requiring compensation in ZPETC.
c1: Ratio b0/b1, representing the steady-state gain, used for compensating B−[z−1].
Equation (
22) can be expressed in the form of (
16) as follows.
By substituting Equations (
23)–(
26) into (
20), the zero phase error tracking controller (ZPETC) can be obtained as follows.
4. Discussion
4.1. Summary and Interpretation of Results
This study builds upon the research by Hayashi et al., aiming to resolve challenges encountered during practical implementation. For this purpose, the following hypotheses were formulated and verified.
The first hypothesis posits that switching from a four-channel wired EMG sensor to an eight-channel wireless EMG sensor may introduce communication delays, and EMD (Electro-Mechanical Delay) could still be effectively utilized in such an environment. Studies using the same EMG sensors do not mention communication delays when using wireless sensors [
10,
11]. While communication delay may not be critical in studies focusing on achieving accurate motions, it is considered an important verification in this study, as it is necessary to improve both the accuracy of motion and the system’s responsiveness. In
Section 2.3,
Bluetooth latency of Myo armband, the communication delay was measured, showing a minimum delay of 0.000272 s, a maximum delay of 0.034995 s, and an average delay of 0.001078 s. Additionally, in Verification of EMD Section,
Verification of EMD, an EMD of approximately 0.05841 s was confirmed by analyzing the delay between EMG signal generation and wrist angle change. Relating these results to the observed communication delay, it was calculated that EMD could still be utilized effectively, with values ranging from a minimum of 0.023415 s to a maximum of 0.058138 s, and an average of 0.057332 s. These findings suggest that even with a wireless EMG sensor, EMD can be leveraged to enhance system responsiveness.
The second hypothesis states that the eight-channel MISO-NARX model combined with ridge regression will outperform the four-channel SISO-NARX model in angle estimation accuracy. This was validated in
Section 2.6,
Estimation of wrist angle by NARX. Ridge regression is used to address the issue of unstable and high-variance estimates in the ordinary least squares (OLS) method caused by strong correlations among independent variables. It helps prevent overfitting and improves predictive performance [
12]. In this study, the conventional SISO-NARX model was modified to a MISO-NARX model, and the number of channels was increased to eight. As shown in
Figure 11, all EMG signals are output, regardless of the direction in which the wrist moves, with multiple channels responding to the same motion. This implies strong correlations among variables. Therefore, ridge regression is expected to enhance the performance of the MISO-NARX model. First, the 4-channel SISO-NARX model was constructed using EMG signals synthesized into a single input, resulting in an RMSE of 32.70 degrees. This was used as a baseline error based on prior research. Next, a four-channel MISO-NARX model was built to compare the performance of single-input and multi-input configurations. The RMSE of the four-channel MISO-NARX model was 29.15 degrees, indicating a slight improvement of 3.55 degrees. However, as shown near the 7 s mark in
Figure 16b, the model exhibited significant errors, likely due to reduced system identification accuracy with increased input complexity. Subsequently, an eight-channel MISO-NARX model was analyzed, showing improvement over the four-channel MISO-NARX model. Nevertheless, significant errors were observed in the 0–5 s range in
Figure 17. After applying ridge regression to the eight-channel MISO-NARX model, the RMSE improved to 7.80 degrees. These results demonstrate that while the complexity of the eight-channel model introduced additional challenges, ridge regression effectively mitigated overfitting and improved accuracy. This confirms that increasing the number of inputs combined with ridge regression can enhance system performance.
Finally, the practical implementation of the ZPETC+PID controller was evaluated to ensure no issues arise during real-world application. In Hayashi et al.’s study, the delay was reduced from 0.030 s to approximately 0.010 s. In this study, the delay for the servo motor was measured at approximately 0.423 s. Simulations demonstrated that the delay could be reduced by 0.376 s, resulting in a delay of 0.047 s. When applied to the actual servo motor, the delay without the controller was approximately 0.428 s, while the ZPETC+PID controller reduced this to 0.188 s; a reduction of 0.240 s. As mentioned in the introduction, delays exceeding 300 ms can negatively impact user performance in remote operations [
6]. With the ZPETC+PID controller implemented, the delay was measured at 0.188 s, further validating the effectiveness of the controller in enhancing responsiveness.
4.2. Future Directions
The limitations of the current study are as follows. In this study, EMG and angle data were extracted and used from a single participant. This is because the focus of this research was on validating the effectiveness of the system for controlling a myoelectric prosthesis. In other words, it is assumed that individual differences in EMG are reflected in the NARX model through calibration, based on the idea that prosthetic hands are individual. In future studies, we plan to verify the results when calibrating the same control system with data from other individuals.
The following are three research directions planned for future studies. First, directly controlling the motor using EMG signals, along with verifying the resulting angle and response speed, could further optimize motor responsiveness, allowing for real-time feedback and enhancing user convenience. Second, expanding the system to estimate finger movements would make it possible for the prosthetic hand to perform finer, more precise tasks according to the user’s intent. This would facilitate more natural interactions in daily activities, increasing the practical utility of the prosthesis. Third, it is essential to develop a classifier that can identify which finger is in motion. By identifying the active finger from a single EMG signal and then applying the NARX model, a classifier would enable a more accurate control system and is therefore anticipated to be a critical element in the prosthetic’s effectiveness. These research extensions hold the potential to make prosthetic hands more natural and useful in daily life, significantly enhancing the user experience.