[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
Hybrid Encoding Scheme for AMBTC Compressed Images Using Ternary Representation Technique
Previous Article in Journal
Numerical Simulation on Spatial-Frequency Domain Imaging for Estimating Optical Absorption and Scattering Properties of Two-Layered Horticultural Products
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Blood Glucose Level Regression for Smartphone PPG Signals Using Machine Learning

Department of Electrical & Computer Engineering, North South University, Dhaka 1229, Bangladesh
*
Author to whom correspondence should be addressed.
Appl. Sci. 2021, 11(2), 618; https://doi.org/10.3390/app11020618
Submission received: 14 November 2020 / Revised: 11 December 2020 / Accepted: 28 December 2020 / Published: 10 January 2021
Figure 1
<p>System diagram of noninvasive glucose detection using smartphone camera (<b>top</b>) compared with the conventional glucometer-based technique (<b>bottom</b>).</p> ">
Figure 2
<p>(<b>A</b>) Normalized Photoplethysmography (PPG) signal acquired using (<b>a</b>) Xiaomi Redmi Note 5 Pro, (<b>b</b>) Samsung Galaxy Note 8, and (<b>c</b>) OnePlus 6T phones. (<b>B</b>) Performance evaluation on PPG signal acquisition by the three smartphones.</p> ">
Figure 3
<p>Representative PPG signal extracted from the (Red) channel, (Green) channel, and (Blue) channel of two video frames of varying quality. The extracted PPG on the left side signal was observed to be more reliable compared to the right side.</p> ">
Figure 4
<p>Normalized PPG signals with different qualities—a signal with both baseline variations and high-frequency noises (<b>top</b>), a signal with baseline variations (<b>middle</b>), and a good one with no high-frequency noise and little baseline variation (<b>bottom</b>).</p> ">
Figure 5
<p>Proposed model for the quantitative analysis of glucose level from smartphone video.</p> ">
Figure 6
<p>Raw PPG signal (<b>top</b>) was first preprocessed with ALS (<b>middle</b>) and then cleaned with a Gaussian filter (<b>bottom</b>). Both high-frequency noise and baseline variations had been removed.</p> ">
Figure 7
<p>(<b>a</b>) A representative PPG single cycle waveform with systolic and diastolic peaks and DelT; (<b>b</b>) is the first derivative of that cycle, and (<b>c</b>) is the second derivative of the cycle.</p> ">
Figure 8
<p>Two representative PPG signals—(left): relatively prominent PPG signal and (right): a signal with high baseline issues and noises processed through different steps. (<b>a</b>) Raw PPG Signal; (<b>b</b>) PPG signal preprocessed with ALS and Gaussian filter; (<b>c</b>) first derivative of preprocessed signal, and (<b>d</b>) extracted features of first derivative signal.</p> ">
Figure 9
<p>Standard Error of Predictions (SEPs) against a different number of principal components (PCs) achieved for Principal Component Regression (PCR) model built with different preprocessing approaches.</p> ">
Figure 10
<p>SEP against a different number of principal components (PCs) achieved for Partial Least Square Regression (PLS) model built with different preprocessing approaches.</p> ">
Figure 11
<p>SEP of different Support Vector Regression (SVR) models against the <span class="html-italic">ε</span> values.</p> ">
Figure 12
<p>SEP of different Random Forest Regression (RFR) models against the number of estimator tress.</p> ">
Figure 13
<p>Comparison of performances of different regression techniques.</p> ">
Versions Notes

Abstract

:
Diabetes is a chronic illness that affects millions of people worldwide and requires regular monitoring of a patient’s blood glucose level. Currently, blood glucose is monitored by a minimally invasive process where a small droplet of blood is extracted and passed to a glucometer—however, this process is uncomfortable for the patient. In this paper, a smartphone video-based noninvasive technique is proposed for the quantitative estimation of glucose levels in the blood. The videos are collected steadily from the tip of the subject’s finger using smartphone cameras and subsequently converted into a Photoplethysmography (PPG) signal. A Gaussian filter is applied on top of the Asymmetric Least Square (ALS) method to remove high-frequency noise, optical noise, and motion interference from the raw PPG signal. These preprocessed signals are then used for extracting signal features such as systolic and diastolic peaks, the time differences between consecutive peaks (DelT), first derivative, and second derivative peaks. Finally, the features are fed into Principal Component Regression (PCR), Partial Least Square Regression (PLS), Support Vector Regression (SVR) and Random Forest Regression (RFR) models for the prediction of glucose level. Out of the four statistical learning techniques used, the PLS model, when applied to an unbiased dataset, has the lowest standard error of prediction (SEP) at 17.02 mg/dL.

1. Introduction

Diabetes is an incurable chronic disease that occurs either when the pancreas is no longer able to produce insulin, or when the body is unable to utilize insulin properly [1,2]. This results in poor regulation of blood glucose level, which can lead to severe health complications such as chronic heart and kidney disease if blood glucose levels are not monitored carefully. According to the IDF (International Diabetes Federation) Diabetes Atlas [3] published in 2019, 463 million people in the 20–79 demographic have diabetes and is projected to reach 700 million by 2045. More than 4.2 million people died due to diabetes-related disorders in 2019.
The conventional approach to glucose level monitoring requires several apparatuses such as a glucometer, a one-time test strip, and a single-use lancet or lancing device to draw blood [4]. It also requires alcohol pads, gloves, and a band-aid to reduce the risk of infection for the patient. It is an uncomfortable process and one that people with diabetes need to use to monitor their blood glucose level regularly.
There is a demand for wearable, noninvasive, and smart health monitoring systems [5,6]. Previously, there have been studies conducted on noninvasive [7] or minimally invasive wearable devices based on different types of technologies for blood glucose level monitoring [8]. Some of the researchers used sensors built using carbon nanotubes [9,10], fluorescence [11], and plasmonic materials [12] to determine the glucose level. These techniques had some issues such as long calibration processes, size, and susceptibility to autofluorescence.
Several electrical techniques have been proposed for blood glucose level monitoring. Shaker, G. et al. experimented with a millimeter-wave radar system to track glucose level changes [13]. Similarly, Hanna, J. et al. experimented with glucose monitoring using electromagnetic devices in a noninvasive manner [14]. Other researchers utilized changes in bioimpedance [15] and ultrasound signal [16,17] to determine a subject’s blood glucose level. In general, electrical techniques have room for improvement due to poor selectivity, temperature sensitivity, and skin irritation. A handful of previous studies have explored the potential of optical techniques in tracking changes in blood glucose level. Phan, Quoc-Hung [18] studied a polarimetric optical measurement method for noninvasive glucose determination. Optical Coherence Tomography (OCT) [19], near-infrared spectroscopy [20], and mid-infrared (MIR) spectroscopy [21] have also been utilized under laboratory conditions to gauge changes in blood glucose level in diabetic subjects. Unfortunately, optical techniques suffer from a significant lag time of up to 30 min as well as high temperature sensitivity. It is noteworthy that these proposed techniques require intricately organized and expensive laboratory equipment, making it unlikely for them to be used for daily measurement of blood glucose by millions of diabetics without significant reduction in price and complexity. Sensor-based noninvasive techniques have also been proposed for this purpose. Studies have shown the possibility of using GSR (Galvanic Skin Response) sensors [22], saliva [23,24], tear-based sensors [25], and sweat-based sensors [26] in order to measure blood glucose level. These techniques are generally inexpensive and require biological fluid samples other than blood to be collected, thereby reducing the invasiveness of the procedure and the subject’s discomfort.
Photoplethysmography (PPG) is a low-cost, noninvasive technique that measures the volumetric change of blood in the arteries. Previously researchers have investigated the estimation of heart rate using PPG signals acquired by various methods such as endocavitary sensors [27] and pressure sensors [28]. Some researchers determined the glucose level using smartphone-based on the colorimetric and electrochemical methods [29]—this required an external apparatus to collect the sample before analyzing the sample using a smartphone. We also previously demonstrated that PPG could be a useful tool for blood glucose estimation [30].
A preliminary study conducted by Zhang, G. et al. proposed a subspace KNN (K–Nearest Neighbors) based method for classifying between diabetic/nondiabetic patients by using a PPG signal acquired from a single smartphone [31]. Using a binary classifier, this study constructed two classes in the (70 mg/dL–130 mg/dL) blood glucose range. However, the reported approach achieved a classification accuracy of 86.2% and used a Butterworth filter for denoising motion artefacts.
This study outlines the initial development steps towards a new reliable technique for blood glucose level estimation using a smartphone camera. A smartphone camera is used to record multiple videos from the same position of the fingertips for fifty-two subjects. A Gaussian filter is applied to denoise the raw PPG waveform generated, and the Asymmetric Least Squares (ALSs) method is used to remove baseline wander. Attributes such as systolic and diastolic peaks, the peak to peak time interval (DelT), first derivative peaks, and second derivative peaks are extracted from the denoised signal. These extracted features are subsequently fed into four statistical learning approaches—Principal Component Regression (PCR), Partial Least Squares Regression (PLS), Support Vector Regression (SVR), and Random Forest Regression (RFR) for the prediction of glucose level in blood. The paper focuses on data acquisition techniques, noise removals which are a big challenge in smartphone-based data acquisition systems, and the development of algorithms to provide increased acquisition capabilities, comforts, and flexibilities in measurements compared to the existing systems.

2. Experiment Details

The complete experimental procedure of noninvasive sensing and a comparison with the conventional method of glucose level prediction have been depicted in Figure 1. With the proposed noninvasive technique, a commercial smartphone camera can be used for the recording of a short video (20 s–50 s) of the subject’s fingertip, which is then converted into frames containing RGB channel information of different wavelength. Red light has a longer wavelength than green or blue, which consequently allows it to penetrate deeper into tissue. Consequently, data from the red channel can be used for generating the respective PPG signal from the video frames. Regression models can then be trained with the acquired smartphone PPG signals, and corresponding glucose levels are acquired with a commercially available glucometer.
Fifty-two subjects aged between 17 and 61 years, with a male to female ratio of 75:25, and with blood glucose levels ranging between 68 and 211 mg/dL participated in this experiment. The participants were as adequately briefed about the procedure before it began. A minimum of three trials were taken from each subject, resulting in one-hundred and ninety-one trials altogether. Each video recorded during the trials was 60 s long. All the subjects provided their informed consent paper and proper hygiene was maintained during the collection of reference values using a commercially approved glucometer (Accu-Check Active) which has an accuracy that meets the DIN EN ISO 15197:2013 requirements of a measuring interval that is 10–600 mg/dL. The subjects’ fingers and smartphone camera lenses were also cleaned using alcohol wipes before acquiring video as part of this study.

2.1. Data Acquisition with Smartphone

The camera quality of commercially available smartphones has improved a lot over the last decade and offers excellent potential utility as a biomedical apparatus for data acquisition [32]. Nearly all smartphones can be used for the capture of PPG signals. However, the quality of the acquired signal can vary widely depending on the resolution and number of frames recorded by the camera per second. It is possible to extract PPG signals from a subject’s fingertips or earlobe. In our study, the fingertip was chosen for data acquisition due to the convenience and comfort in the process for the subject. Olive et al. have previously shown earlobe pulse oximetry to be less reliable than fingertip pulse oximetry [33]. During the recording, the subject was asked to remain relatively still and comfortably place their index finger on the smartphone camera. Video data were subsequently collected for 60 s, with the participant’s right index finger placed on the horizontal camera array and the LED (Light Emitting Diode) flash. The duration of 60 s allowed for a greater likelihood of obtaining acceptable PPG data, and allowed enough time in the beginning and the end of the segment for motion artifacts that are common with biomedical signal measurements. A maximum of five trials were collected from each participant. Each video was captured using the default camera application of the smartphone at 30 frames per second (fps) with a resolution of 1280 × 720 pixels.
To enhance the generalizability of the regression models trained, and to test the effectiveness of our denoising protocol, we initially shortlisted four smartphones to collect data in this study. Three of these phones were on Android platforms, whereas the fourth was on iOS. The four phones had different camera software running and the built-in cameras and optical setups of each offer a wide range of specification for data collection. After a preliminary study on the performance of each phone, the data collection was narrowed down to two phones—one based on an iOS platform and a second on Android.

2.1.1. Acquisition with iPhone

In the iOS category, an iPhone 7 Plus (Made by Apple Inc, San Francisco, CA, USA) phone was used for noninvasive data collection. The imaging unit in the device consists of a WLED (White LED) as the illumination source next to a 12-megapixel camera at a center-to-center separation of around 5 mm. The phone supports color video recording at about 30 fps up to a resolution of 3840 × 2160 pixels. For this study, the fingertip was recorded at 30 fps at a resolution of 720 pixels. This did not reduce signal quality as compared to that of 1080 pixels or 4K but took up significantly less phone memory and data transfer time.

2.1.2. Acquisition with Android Phone

As for Android-based platform, the choice for the smartphone is more complex. Android devices come in a wide range of hardware specifications regarding the built-in camera and the optical setup. Therefore, the quality of acquired data can vary significantly in different models constructed by different manufacturers. We therefore conducted a preliminary study (Supplementary Material Figure S1) to evaluate the performance of data acquisition on three smartphones with varying price ranges and camera specifications.
For the preliminary study, 3 trials were completed from five subjects using a Xiaomi Redmi Note 5 Pro (Xiaomi, Beijing, China), Samsung Galaxy Note 8 (Made by Samsung, San Jose, CA, USA), and OnePlus 6T (Made by OnePlus, Guangdong, China). The camera sensors on the Redmi Note 5 Pro and the Galaxy Note 8 are manufactured by Samsung, whereas the OnePlus 6T has a camera sensor produced by Sony IMX. All of them have a dual camera setup with an illumination source in the form of a WLED flash placed next to the cameras. For this portion of the study, the video data were recorded at 30 fps at a resolution of 1920 × 1080 pixels. Detailed specification for each of the phones is provided in Table 1. The Aperture column of Table 1 denotes the focal length of the smartphone camera.
The quality of the PPG signal acquired from the same subject using three different smartphones is shown in Figure 2. In the first two signals, considerably more baseline variation can be observed in contrast with the third signal. To further evaluate the performance of the acquired signals by these phones, we attempted to extract the features from the acquired signals. For this, the signals were first preprocessed (Supplementary Material Figure S2) to remove high-frequency noise and baseline drift. Then, peak analysis was performed on the preprocessed PPG (Supplementary Material Figure S3)—the 1st derivative and 2nd derivative (Supplementary Material Figure S4) of the signals. From Supplementary Material Table S1, it can be seen the model was able to detect the 2nd derivative peak with very few misses and false positives across all the subjects. We performed peak detection and observed the results as shown in Figure 2. With 5 different subjects and multiple trials with the three candidate smartphones, we could see that the Xiaomi was outperformed by the others in terms of accuracy.
Even though the acquisition quality of Samsung Galaxy Note 8 and OnePlus 6T are comparable, Samsung Galaxy is more expensive with very similar specifications. Consequently, OnePlus 6T has been chosen as the desired Android platform for data acquisition for the study of glucose level detection. The further details on this study are available in the Supplementary resources.

2.1.3. Channel Selection

The video recorded from the subjects was stored onto the phone’s memory in MPEG-4 (Moving Picture Experts Group) format. As discussed earlier, acquired smartphone videos were first converted into RGB frames containing information from three color channels (red, green, and blue) with varying wavelengths. Channel selection for obtaining a good quality PPG has been a challenge for researchers and therefore extensively studied in the past [34]. Blood perfusion variations depend on the wavelength of light since radiation of different wavelength penetrates and reaches vascular bed to varying depths in skin layers. Red light, which has a wavelength of 620 nm, reaches deeper (several centimeters) blood vessels in contrast to blue light (432 nm), which penetrates less than 1mm in deep. The red channel was often a default choice [35,36]. On the other hand, the authors of [34,37] claimed that the green color signal provided the best signal amplitude values for smartphones and considered this as more suitable than red and blue. Therefore, the authors of [34,38] conducted several experiments and compared the PPG signal strength in the three color bands, with and without a flashlight to obtain good quality PPG. [39] Grimaldi reported that the distribution of the pixels in the green channel is not uniform for different smartphone models. Bolkhovsky [40] suggested different channels for different phones. All these experiments prove that the selection of channel may vary based on several factors that include phone model due to camera characteristics, acquisition techniques (with or without LED), and region of interest.
In the proposed system, we initially extracted a PPG signal from all three channels. In most cases, PPG extracted from the red channel was more prominent and less noisy than that of the green and blue channels, as seen in Figure 3.

2.1.4. Conversion into PPG Waveform

In a given frame, each pixel has slightly varying values. We averaged the red channel values of an entire frame to obtain one sample point of the respective PPG waveform. After extracting the sampling points from all the frames of recorded videos and ordering them in an array, complete PPG waveforms could be generated. The computation was performed using a custom-written MATLAB program. Regions of Interest (ROIs), where applicable, were removed from the frames before conversion.

2.1.5. Signal Quality

It was identified through experimental investigations that the best way to collect video data without much motion is to let the subject hold the phone on their hand and then place their fingertip on the camera sensor. This introduces fewer motion interferences as people naturally hold a phone in this way. Other approaches were also attenuated that caused many movements of the finger from the subject during data collection. A little distortion may result in scattered plots and acquired signals may suffer from severe baseline variations. An example of such an event is shown in Figure 4. Videos were also captured with the flash on and off. PPG signals obtained with a flash were much cleaner than without flash, and the red channel yielded the best overall output (Supplementary Material Figure S5).
Even after careful considerations about the type of camera to be used and orientation of the placement of the fingertip, the acquired signals had certain varieties. This is evident in Figure 4, which provides three sample signals collected from different subjects. It can be seen that one of them is quite clean, another has few baseline variations, and the last one contains high-frequency noises and baseline variations. The accuracy of PPG signal is highly influenced by the motion artifacts resulting in inaccuracy in calculation of vital features. These motion artifacts, baseline wander, and high-frequency noise are due to the invariably interference of the contact between the smartphone camera and the finger, the voluntary or involuntary subject movement that includes respiration and extrusion between the finger and the smartphone camera. Therefore, it is crucial to develop a reliable series of preprocessing steps to enhance uniformity and reliability of PPG signal acquisitions.

2.2. Invasive Data Acquisition Methodology

During the study, actual blood sugar data were simultaneously collected as a reference from the subject invasively using a commercially available medical standard glucometer. The meter can measure the sugar level from a small drop (1–2 µL) of blood sample collected from the fingertip of the subject. It can work in a range of 10–600 mg/dL. This technique also requires a one-time test strip and single-use safety lancets in addition to the meter. At first, the subject’s hand was washed using a one-time hand sanitizer and alcohol solution on the finger from which the blood sample was taken. Then, a test strip was inserted into the glucometer to prepare the device for measuring the glucose level. The blood sample was collected by pricking the side of any finger using a lancet. Then, the pricked finger was touched and held on the edge of the test strip to transfer the blood drop from the finger to the test strip. The pricked finger was treated with a band-aid. The glucometer usually takes 5–8 s to process the samples and displays the glucose level in mg/dL on the screen. At least one trial was collected from each subject to acquire the reference value.

3. Methodology

The proposed approach for the noninvasive estimation of blood glucose level using a smartphone video has been explained through a flowchart in Figure 5. Video data were first obtained from the subjects’ fingertips which were then converted into the respective PPG waveform. This signal contains noise and motion artefact, which were cleaned in the preprocessing steps. Relevant features were extracted from the preprocessed signals. These features were then fed into a regression model. The regression model was trained using extracted features and reference glucose values, collected invasively using a commercially available glucometer. Then, the trained model was used to estimate the glucose level of an unknown sample. The unknown samples used for the testing purpose followed the same preprocessing and feature extraction techniques as the training phase. The performance of the model was evaluated by calculating the Standard Error of Prediction (SEP), measured in mg/dL.

3.1. PPG Signal Preprocessing

3.1.1. Gaussian Filter

Signal cleaning can be completed using a variety of different filters. In our work, a Gaussian filter was used for high-frequency noise removal. A Gaussian filter [41] is a nonuniform lowpass filter, and consequently is a popular choice among the general-purpose filters for the removal of the high-frequency components of a PPG signal. This filter performs remarkably well as a bandpass filter in the frequency domain when it is tuned by the values of the mean center frequency and standard deviation. In our work, a Gaussian filter was used for noise removal, as shown in Equation (1) below:
G ( x ) =   e   ( x c ) 2 2 w 2
In Equation (1), the standard deviation can be adjusted by determining the degree of smoothing. The function performed the best for smoothing the PPG signal when the value of the center frequency was set to   c = 0.068   and the standard deviation was set to w = 0.0543 . A sample PPG signal with high-frequency noises which was cleaned by applying the Gaussian filter is given in Supplementary Material Figure S6.

3.1.2. Asymmetric Least Squares

ALS is a useful baseline correction technique [42]. Motion interferences were removed by the help of ALS, which is useful in correcting data with relatively narrow peaks. ALS is useful in making the systolic and diastolic peaks more prominent, which are crucial attributes for feature extraction. The baseline correction through ALS improved the overall signal quality as the fluctuations between peaks were decreased.
However, some high-frequency noise could still be observed in signals processed through ALS, as seen in Supplementary Material Figure S7. ALS was followed by a smoothing process applied through the Gaussian filter on the signal to remove existing high-frequency noise. A preprocessing model with ALS as a first stage and Gaussian filter as the second stage significantly improves the quality of the acquired PPG signals. This can be seen in Figure 6, which shows a raw PPG signal cleaned with Gaussian filter and corrected using ALS. All the signals were normalized from 0 to 1 F.

3.2. Feature Extraction

After applying preprocessing steps to the PPG signals, certain features were extracted from the signals. Multiple feature vectors were created through different combination of these features, which were then used in training the regression models. A feature could be any point in the signal or a multitude of information can be combined to make one feature. For this study, we extracted the following features from the PPG signal: systolic peaks, diastolic peaks, (DelT), first derivative peaks, and second derivative peaks.

3.2.1. Systolic and Diastolic Features

The direct pressure wave travelling from the left ventricle to the periphery of the body represents the systolic peak. On the other hand, the reflections of the pressure wave by arteries of the lower body represent the diastolic peak. Each cycle of the PPG signal contains one systolic and one diastolic peak as seen in Figure 7a. These peak locations were extracted as features of the PPG signal.
DelT refers to the time separation between systolic and diastolic peaks. It was computed from each pair of the systolic–diastolic peaks of the signal. DelT found from the time difference between the systolic and diastolic peak in each cycle of the preprocessed PPG signal were stored in an array. The systolic, diastolic and DelT features are presented in a sample PPG signal in Figure 7a.

3.2.2. First Derivative Features

The first derivative usually refers to the instantaneous rate of change. It indicates the function direction, whether the function is increasing or decreasing and how fast it increases or decreases. The number of peaks (features) in the PPG signal was increasing after the first derivative since PPG is a periodic function. To determine the first derivative of the PPG signal, a two-point central difference method was used as shown in Equation (2) below:
d ( j ) = 1 2 ( a ( j + 1 ) a ( j 1 ) )
here a ( j ) is coming from the existing preprocessed array whereas d ( j ) is stored in a new array. Since the first derivative signals also had negative peaks, the signal was processed further before extracting features from the derivative signal. This makes the signal more prominent, and features could be extracted easily from it. A derivate of a sample PPG signal is shown in Figure 7b where the extracted features have been marked.

3.2.3. 2nd Derivative Features

Similar to the first derivative, the second derivative is usually a derivative of the slope. It also increased the peaks (features) of the signal that is more than the first derivative. A three-point central difference method was used to calculate the second derivative, as shown in Equation (3):
d ( j ) = a ( j + 1 ) 2 a ( j ) + a ( j 1 )
here, the same as before, a(j) represents the points in the existing preprocessed array and d is the newly calculated second derivative of that array. The second derivate of a sample PPG signal is shown in Figure 7c where the extracted features have also been marked. The figure shows how the first and second derivative enhances the peaks of a PPG signal.

3.3. Regression Analysis

To estimate the glucose level, regression analysis was performed on the PPG signals acquired through a smartphone camera and their references. A different regression model was trained on raw signals, preprocessed signals, and the signal features that were extracted from the preprocessed signals to evaluate and compare the performance in noninvasive quantitative estimation.
Regression techniques such as CLS (Classical Least Squares), PCA (Principal Component Analysis), PCR and PLS have been studied in the past for quantitative analysis. Both PLS and PCR have widely been used in chemometrics as multivariate calibration methods, and these methods can be applied when that dataset has correlated predictor variables [40]. Both regression methods create new predictor variables (components) as linear combinations of the original predictor variables. PLS creates these components while considering the observed response values. On the other hand, PCR creates components without considering the response values at all. Both regression methods have reliable predictive power. PCR was chosen for this work due to its reported ability to predict glucose levels from NIR (Near Infrared) spectra with 94–95% accuracy [20].
Additionally, models were trained using SVR and RFR. SVR is a modification to the commonly used classification algorithm called Support Vector Machine (SVM). While SVM is primarily used in classification problems with discrete finite labels, SVR can be used to predict continuous variables such as blood glucose level. In SVR, instead of trying to minimize the error outright as with traditional regression techniques, hyperplanes are constructed to fit the error observed within a small threshold of ε (epsilon). Previous studies have used support vector-based techniques on PPG data to predict other continuous biological variables such as blood pressure estimation [43].
Finally, ensemble learning was used in the form of RFR—a technique based on using several decision trees to predict a continuous value. Classical Decision Tree Regression (DTR) involves gradually developing a decision tree from smaller subsets of the training data and can work reasonably well in mapping nonlinear functions. Outputs from several decision trees are averaged together to provide an ensemble estimate of the predicted value. Other studies have demonstrated the efficacy of RFR in the prediction of biological variables from ECG or PPG data, often outperforming other regression techniques [44].
The following sections discuss the experiment performed on each of these learning methods in detail. For the purpose of clarity in discussion, we refer to PLS and PCR as statistical learning, and refer to SVR and RFR as machine learning, in order to reflect the commonly used terminology for each technique.

4. Results and Discussion

4.1. Experiment with Random Test Cases

A train-test split of 75/25 was used in the first experiment. The model that achieved high accuracy was then tested with subject wise fully unbiased test cases which are elaborated in the subsequent sections. However, a large amount of variability was present in individual smartphone PPG signal instances, even when acquired from the same subject. Among these trials, there were different quality PPG signals, as shown in Figure 8. The preprocessing steps were held constant throughout the trials—which involved denoising through Gaussian filter and baseline corrections using ALS. The first derivative was computed upon the preprocessed signal. After that, only the positive and negative peak values were extracted as features. Figure 8 shows two representative PPG signals, their transformation after preprocessing, and the corresponding extracted features. It can be observed that first derivative features were extracted from both the signals without any issue. The proposed signal processing model offers reliability in handling PPG signals of various signal qualities, and may therefore reduce the burden during data acquisition, as an acceptable PPG signal may be acquired in spite of baseline drift and high-frequency noise.
In this study, each recorded signal contains 60 s of data. However, 5 s was initially removed from the beginning of the segment and the end of the segment. This was carried out due to the natural prevalence of large motion artifacts in the initialization and the conclusion of a recording. Several PCR, PLS, SVR and RFR models were developed with 50 s of raw PPG signal in every instance with a variety of different features. These include models based on a combination of the preprocessed PPG signal via ALS and Gaussian filter, first derivative and second derivative features extracted from the preprocessed PPG signal. The following sections outline the observed results from the various regression techniques used to predict blood glucose level and draw comparisons between the methods shown.

4.1.1. PCR

Figure 9 shows the estimated Standard Error of Prediction (SEP) against the different number of principal components (PCs) achieved for PCR model built with different approaches. For the PCR model built using second derivative characteristic features, the SEP was markedly greater than other approaches. In addition, when the numbers of principal components were incremented, the SEP increased as well, increasing from 22.95 mg/dL on the second component to 26.54 mg/dL with 11 PCs.
A significant reduction in SEP could be observed in the other approaches. Using a model built with the DeIT features obtained from the signal preprocessed through Gaussian and ALS filtering, the SEP was observed to be 21.3 mg/dL with the first two PCs, whereas in the models that included first derivative and second derivative features, the lowest SEP was observed with five components at 18.33 mg/dL. The SEP reduced even more (17.09 mg/dL) for the PCR model built with first derivative characteristic points extracted from the preprocessed PPG using 11 PCs. The preprocessing and feature extraction approach significantly improved the prediction ability of the PCR model. Here, it is essential to note that systolic and diastolic features alone did not improve the accuracy; on the other hand, the performance with the second derivative features was not noteworthy. The comparison of performances among different PCR models in terms of the lowest SEP achieved and minimum no. of PCs required has been shown in Table 2.

4.1.2. PLS

In this study, regression models were also built using PLS. Figure 10 shows the estimated SEP against a different number of principal components (PCs) achieved for the PLS model built with different approaches. In general, there was a linear relationship between the number of PCs used and the corresponding SEP. It could be observed that the SEP increased with the increase in the number of principal components for the PLS model with the first derivative signal. SEP on the second component was 22.09 mg/dL which increased to 50.59 mg/dL for 11 PCs. After performing second derivative and building model based on this, the lowest SEP was recorded as 21.37 mg/dL on the second component. The SEP reduced even more when the PLS model was built with the help of first derivative characteristics points extracted from the preprocessed PPG, and it was 17.02 mg/dL for the second component.
The comparison of performances among different PLS models in terms lowest SEP achieved and minimum no. of PCs required can be found in Table 3. It is important to note that with PLS-based models, all the lowest SEPs were achieved with just two principal components.

4.1.3. SVR

The SVR models were built with a Radial Basis Function (RBF) kernel. This kernel function was used to transform an n-dimensional input to an m-dimensional input, where m denotes much higher than n. The RBF kernel finds the dot product in higher dimensional more efficiently than any regular linear or polynomial kernels. The main idea to use kernel is to fit a linear classifier or regression curve in higher dimensions, which consequently produces a nonlinear classifier or regression curve in lower dimensions. In our study, the size of the kernel cache was set to 400 for training and the constant regularization parameter C was set to 1.0. The maximum penalty ε (epsilon) was tuned from 0 to 0.6 for each of the SVR model to reduce SEP. Figure 11 illustrates the estimated SEP of SVR models built with different features against epsilon tuned within a certain range. With the exception of the first derivative SVR model, a decreasing trend in SEP could be observed for the SVR models with an increase in epsilon from 0 to 0.5. The lowest SEP recorded for different models is shown in Table 4. The SVR model with second derivative characteristics features achieved the lowest SEP of 18.52 mg/dL even though other models also had comparable margins of error.

4.1.4. RFR

Finally, similar models were built with Random Forest Regression (RFR) for the estimation of glucose level and performances were evaluated using the same metric. RFR models are built with estimator trees/nodes and in this study, the total number of trees/nodes was increased up to 70 for all the models to identify the optimum number of trees required for the prediction with the lowest error. Some sample leaves were also added for the improvements and the total number was set to six. The SEP of RFR models built with different features against the number of estimator tress/nodes can be found in Figure 12. The prediction error for all the RFR models decreased with the increase in the number of trees/nodes. This reduction in SEP was comparatively sharp for the model built with first derivative features particularly for the first 15 trees. The lowest SEP recorded for different models along with the number of trees required is provided in Table 5. Notable in the table is that RFR model built with first derivative features had the lowest SEP of 21.88 mg/dL with requiring 25 trees to estimate. The model built with second derivative characteristic features had a similar prediction error of 22.11 mg/dL, though it required nearly double the number of estimators.

4.2. Subject Wise Glucose Level Prediction

In this experiment, PPG signals of ten subjects with glucose levels ranging from 73 to 153 mg/dL were initially separated from the training set in order to ensure independence in the test set from the training set. It is important to mention that some of the model building parameters such as the no. of principle components were retained from the previous experiment in this section of our study.

4.2.1. Statistical Approach

In subject wise tests, performance of both PLS- and PCR-based models degraded slightly as can be seen in Table 6. PCR models produced a slightly better result than PLS. PCR model built with DelT features had the highest SEP for both PLS and PCR which are 27.78 and 28.57 mg/dL, respectively. The lowest SEP of 25.50 mg/dL was recorded with PCR model built with first derivative features which is 1.92 mg/dL lower than the PLS-based best performing model. It is important to note that same model had the lowest SEP in the previous experiments too.

4.2.2. Machine Learning Approach

Subject wise tests were also carried out for SVR- and RFR-based models to compare their SEPs with the previous results. It is notable to mention that the performance has degraded here as well. The summary of the predicted error for both of these machine learning techniques have been provided in Table 7. The SEPs for different SVR models were too close to each other and they differ only by +/−0.1 mg/dL. The same is true for RFR models too. However, SVR models have overall produced slightly better results than RFR. The lowest SEP was recorded to be 26.56 mg/dL for SVR model built with second derivative characteristics points, whereas RFR had the lowest SEP of 29.6 mg/dL for the first derivative model.
We have also performed k-fold cross-validation on the proposed models. The summary of the experiments and the results are provided in the Supplementary Material Tables S2 and S3.

5. Discussion

In this study, both statistical learning (PCR and PLS) and machine learning (SVR and RFR) models were built using various approaches to observe and compare their performances to SEP. At first the dataset was split into 75:25 ratios for model building an independent test set. A minimum of three PPG signals were acquired from each human subject. The data were split into training and test sets randomly without any prior stratification. This experiment was useful for identifying the dominating features and some of the key model parameters in prediction of glucose with the lowest error. Statistical learning methods performed well overall and first derivative and derivative features dominated in models built with both statistical and machine learning methods.
There was an around 8.5 mg/dL reduction in SEP in the subject wise rigorous testing. The key observation here is that the models that performed well in the earlier experiment maintained good performances in subject wise testing too. PCR and PLS models built with first derivative characteristics points performed well in both types of experiments. The SEPs achieved with these models were comparatively lower than the others. Furthermore, the lowest SEP for these models were recorded for the same no. of principal components. This shows the possibility of using the first experiment for the selection of dominating features, optimum model, and key model parameters which can then be used for real tests.
In this study, PCR- and PLS-based models generally outperformed the models based on SVR and RFR, as seen in Figure 13. This could be attributed to the limited dataset used in the study. Zhang, G. et al. used PPG signals acquired from smartphones for the classification of the glucose level. Their model, with an accuracy of over 80%, can predict one of the three diabetic status—normal, borderline, and warning of a subject. However, knowing the quantitative information of the actual glucose level is more important than knowing the diabetic status. We have demonstrated that our model can solve that problem by predicting the actual glucose level with an error of less than 20 mg/L. First derivative and second derivative characteristic points were the dominating features with the machine learning models. SVR based models performed comparatively well in prediction in both experiments, whereas RFR models performed poorly in both the experiments.
Since most of the glucose levels in the dataset were in the range of 70–150 mg/dL, the efficacy of the proposed model is limited to prediction of glucose level within the mentioned range. We anticipate an improvement in the models with an increase in the amount and variety in the data in the near future. Robustness, platform independency tests, and user side app development for instant analysis will be performed in the near future.

6. Conclusions

Here, we have investigated a noninvasive blood glucose estimation technique using only fingertip video acquired through a commercially available smartphone camera. Commercially available smartphones have been used to acquire video data which have been converted into PPG waveform. A computational model comprised of signal processing techniques used for cleaning the data and extracting the features and regression models trained with the features for the quantitative estimation of blood glucose has also been presented. Smartphone-based data acquisition system and disease diagnosis bring significant challenges with motion artifacts and noises. This work, therefore, pays special attention to the data acquisition techniques, and the development of algorithms to clean these artifacts. Numerous experiments were conducted with different acquisition techniques to identify a suitable method and duration of data acquisition. The models have been optimized by applying four regression algorithms along with different features. A PLS-based model could predict glucose with a SEP as low as 17.71 mg/dL, which is quite comparable to the overall lowest SEP achieved by PLS, 17.02 mg/dL. The experimental results confirm the usefulness of the first development steps towards a smartphone-based novel noninvasive estimation of blood glucose. Additional experiments are planned for the testing of robustness, platform independency, and, finally, implementing it in user side app for instant measurement. We believe that the proposed noninvasive technique has importance to the community as the system uses a much more natural and comfortable method of data acquisition compared to the traditional methods, can predict glucose value with high accuracy, and has the potential to acquire and deliver the glucose level information through only a smartphone that is now available to most people.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-3417/11/2/618/s1, Figure S1: (Left) proposed model flow diagram and (Right) data acquisition procedure using smartphone camera; Figure S2: (Left) Corrected baseline PPG Signal of 5 different subjects from SP3 (a) Sub-1 (b) Sub-2 (c) Sub-3 (d) Sub-4 (e) Sub-5 (Right) complete procedure from raw PPG signal to feature extraction. (a) raw PPG; (b) corrected baseline ALS and Savitzky-Golay filter; (c) Gaussian smoothing applied, and (d) feature analysis (Peak-Detection); Figure S3: Feature analysis (Peak-Detection) (a) SP1 signal peaks; (b) SP2 signal peaks, and (c) SP3 signal peaks; Figure S4: Extracting features (peak-detection) after being preprocessed with 1st and 2nd order derivatives; Figure S5: (Top Left) Intentional finger movement at 20, 30 and 40 seconds, (Top Right) PPG signal acquired while flash on, (Bottom Left) PPG signal acquired while the flash is off, (Bottom Right) PPG signal acquired and extracted from Red Channel; Figure S6: (a) A sample Raw PPG signal with high-frequency noises and (b) its filtered version through Gaussian Filter, baseline issues are still evident; Figure S7: (a) A sample Raw PPG signal with high baseline variations and (b) its ALS corrected version no baseline problems, high-frequency noises are still evident, Table S1: 2nd Derivative Features Extracted from PPG signals acquired One Plus 6T, Table S2: K-Fold CV of different approaches for PCR and PLS, Table S3: K-Fold CV of different approaches for SVR and RFR.

Author Contributions

All authors listed contributed equally to the conception, design, analysis, drafting, and revision of this work. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by ICT Ministry & NSU CTRG: 38.

Institutional Review Board Statement

The research protocol was approved by the NSU Institutional Review Board/Ethical Review Committee (IRB/ERC). Ethics Research Review Code: #2019/OR=NSU/IRB-No.0905.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data collection for this project is still ongoing. In order to maintain subject confidentiality, the raw dataset couldn’t be released at this time.

Acknowledgments

This work was funded by ICT Ministry and NSU CTRG.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. DeFronzo, R.A.; Bonadonna, R.C.; Ferrannini, E. Pathogenesis of NIDDM: A Balanced Overview. Diabetes Care 1992, 15, 318–368. [Google Scholar] [CrossRef] [PubMed]
  2. Bandeiras, C. Advancing Technology for a Healthier Humanity. IEEE Potentials 2020, 39, 6–8. [Google Scholar] [CrossRef]
  3. International Diabetes Federation. Facts & Figures. Available online: https://www.idf.org/aboutdiabetes/what-is-diabetes/facts-figures.html (accessed on 23 October 2020).
  4. Vashist, S.K.; Zheng, D.; Al-Rubeaan, K.; Luong, J.H.; Sheu, F.-S. Technology behind commercial devices for blood glucose monitoring in diabetes management: A review. Anal. Chim. Acta 2011, 703, 124–136. [Google Scholar] [CrossRef] [PubMed]
  5. Akbulut, F.P.; Ikitimur, B.; Akan, A. Wearable sensor-based evaluation of psychosocial stress in patients with metabolic syndrome. Artif. Intell. Med. 2020, 104, 101824. [Google Scholar] [CrossRef]
  6. Nasiri, N. Introductory Chapter: Wearable Technologies for Healthcare Monitoring. In Wearable Devices—Big Wave Innovation; Macquarie University: Sydney, NSW, Australia, 2019. [Google Scholar] [CrossRef] [Green Version]
  7. Chung, J.W.; So, C.-F.; Choi, K.-S.; Wong, T.K. Recent advances in noninvasive glucose monitoring. Med. Devices 2012, 5, 45–52. [Google Scholar] [CrossRef] [Green Version]
  8. Gonzales, W.V.; Mobashsher, A.T.; Abbosh, A.M. The Progress of Glucose Monitoring—A Review of Invasive to Minimally and Non-Invasive Techniques, Devices and Sensors. Sensors 2019, 19, 800. [Google Scholar] [CrossRef] [Green Version]
  9. MIT Technology Review. Carbon Nanotube Sensor Detects Glucose in Saliva. Available online: https://www.technologyreview.com/2013/04/30/253160/carbon-nanotube-sensor-detects-glucose-in-saliva/ (accessed on 23 October 2020).
  10. Eracle, N.A.; Lavinia, R.; Monica, V. A Non-Invasive Glucose Analysis Model with a Carbon Nanotube Sensor. ARS Med. Tomitana 2019, 25, 189–192. [Google Scholar] [CrossRef] [Green Version]
  11. Girigoswami, K.; Akhtar, N. Nanobiosensors and fluorescence based biosensors: An overview. Int. J. Nano Dimens. 2019, 10, 1–17. [Google Scholar]
  12. Rakhshani, M.R.; Tavousi, A.; Mansouri-Birjandi, M.A. Design of a plasmonic sensor based on a square array of nanorods and two slot cavities with a high figure of merit for glucose concentration monitoring. Appl. Opt. 2018, 57, 7798–7804. [Google Scholar] [CrossRef]
  13. Shaker, G.; Smith, K.; Omer, A.E.; Liu, S.; Csech, C.; Wadhwa, U.; Safavi-Naeini, S.; Hughson, R.L. Non-Invasive Monitoring of Glucose Level Changes Utilizing a mm-Wave Radar System. Int. J. Mob. Hum. Comput. Interact. 2018, 10, 10–29. Available online: https://www.igi-global.com/article/non-invasive-monitoring-of-glucose-level-changes-utilizing-amm-wave-radar-system/207700 (accessed on 23 October 2020).
  14. Hanna, J.; Costantine, J.; Kanj, R.; Eid, A.; Tawk, Y.; Ramadan, A.H. Electromagnetic Based Devices for Non-invasive Glucose Monitoring. In Proceedings of the 2018 IEEE Conference on Antenna Measurements & Applications, Vasteras, Sweden, 3–6 September 2018; pp. 1–4. [Google Scholar] [CrossRef]
  15. Andersen, J.-H.; Bjerke, O.; Blakaj, F.; Flugsrud, V.M.; Jacobsen, F.A.; Jonsson, M.; Kosaka, E.N.; Langstrand, P.A.; Martinsen, Ø.G.; Moen, A.S.; et al. Bioimpedance and NIR for non-invasive assessment of blood glucose. J. Electr. Bioimpedance 2019, 10, 133–138. [Google Scholar] [CrossRef] [Green Version]
  16. Sakaki, H.; Arakawa, M.; Yashiro, S.; Todate, Y.; Ishigaki, Y.; Kanai, H. Ultrasound scattering by aggregated red blood cells in patients with diabetes. J. Med. Ultrason. 2018, 46, 3–14. [Google Scholar] [CrossRef]
  17. Kambayashi, T.; Noguchi, T.; Nojima, A.; Kono, S.; Taniguchi, S.-I.; Ozaki, Y. Glucose Monitoring in Cell Culture with Online Ultrasound-Assisted Near-Infrared Spectroscopy. Anal. Chem. 2020, 92, 2946–2952. [Google Scholar] [CrossRef] [PubMed]
  18. Phan, Q.-H. Optical Polarimetry Technique for Non-Invasive Glucose Concentration Measurement. In Proceedings of the International Society for Optics and Photonics, Novel Optical Systems, Methods, and Applications XXII, San Diego, CA, USA, 9 September 2019; Volume 11105, p. 111050L. [Google Scholar]
  19. Lan, Y.T.; Kuang, Y.P.; Zhou, L.P.; Wu, G.Y.; Gu, P.C.; Wei, H.J.; Chen, K. Noninvasive monitoring of blood glucose concentration in diabetic patients with optical coherence tomography. Laser Phys. Lett. 2017, 14, 035603. [Google Scholar] [CrossRef]
  20. Jain, P.; Maddila, R.; Joshi, A.M. A precise non-invasive blood glucose measurement system using NIR spectroscopy and Huber’s regression model. Opt. Quantum Electron. 2019, 51, 51. [Google Scholar] [CrossRef]
  21. Jernelv, I.L.; Strøm, K.; Hjelme, D.R.; Aksnes, A. Mid-Infrared Spectroscopy with a Fiber-Coupled Tuneable Quantum Cascade Laser for Glucose Sensing. In Proceedings of the Optical Fibers and Sensors for Medical Diagnostics and Treatment Applications XX, San Diego, CA, USA, 20 February 2020; Volume 11233, p. 1123311. [Google Scholar] [CrossRef]
  22. Saad, W.H.M.; Rahman, N.A.; Karis, M.S.; Chia, S.L.; Karim, S.A.A.; Talib, M.H. Analysis on Continuous Wearable Device for Blood Glucose Detection Using GSR Sensor. Int. J. Nanoelectron. Mater. 2020, 13, 8. [Google Scholar]
  23. Zhang, W.; Wang, M.L. Saliva Glucose Monitoring System. U.S. Patent 2016/0097734 A1, 7 April 2016. [Google Scholar]
  24. García-Carmona, L.; Martín, A.; Sempionatto, J.R.; Moreto, J.R.; González, M.C.; Wang, J.; Escarpa, A. Pacifier Biosensor: Toward Noninvasive Saliva Biomarker Monitoring. Anal. Chem. 2019, 91, 13883–13891. [Google Scholar] [CrossRef]
  25. Badugu, R.; Reece, E.A.; Lakowicz, J.R. Glucose-sensitive silicone hydrogel contact lens toward tear glucose monitoring. J. Biomed. Opt. 2018, 23, 1–9. [Google Scholar] [CrossRef] [Green Version]
  26. Hong, Y.S.; Lee, H.; Kim, J.; Lee, M.; Choi, H.J.; Hyeon, T.; Kim, D.-H. Multifunctional Wearable System that Integrates Sweat-Based Sensing and Vital-Sign Monitoring to Estimate Pre-/Post-Exercise Glucose Levels. Adv. Funct. Mater. 2018, 28, 1805754. [Google Scholar] [CrossRef]
  27. Chatterjee, S.; Patel, Z.; Thaha, M.A.; Kyriacou, P. In silico and in vivo investigations using an endocavitary photoplethysmography sensor for tissue viability monitoring. J. Biomed. Opt. 2020, 25, 1–16. [Google Scholar] [CrossRef]
  28. Wang, Q.; Sheng, D.; Zhou, Z.; Liu, Z. Numerical and Experimental Study of the Influence of Device Pressure on PPG Signal Acquisition. In Proceedings of the Optical Interactions with Tissue and Cells XXXI, International Society for Optics and Photonics, San Francisco, CA, USA, 20 February 2020; Volume 1238, p. 1123812. [Google Scholar] [CrossRef]
  29. Zhang, H.; Zhang, W.; Zhou, A. Chapter 4—Smartphone for glucose monitoring. In Smartphone Based Medical Diagnostics; Yoon, J.-Y., Ed.; Academic Press: Cambridge, MA, USA, 2020; pp. 45–65. [Google Scholar]
  30. Chowdhury, T.T.; Mishma, T.; Osman, S.; Rahman, T. Estimation of Blood Glucose Level of Type-2 Diabetes Patients Using Smartphone Video through PCA-DA. In Proceedings of the 6th International Conference on Networking, Systems and Security, Dhaka, Bangladesh, 17–19 December 2019; pp. 104–108. [Google Scholar] [CrossRef]
  31. Zhang, G.; Mei, Z.; Zhang, Y.; Ma, X.; Lo, B.; Chen, D.; Zhang, Y. A Noninvasive Blood Glucose Monitoring System Based on Smartphone PPG Signal Processing and Machine Learning. IEEE Trans. Ind. Inform. 2020, 16, 7209–7218. [Google Scholar] [CrossRef]
  32. Pires, I.M.; Garcia, N.M.; Garcia, N.M.; Florez, F. From Data Acquisition to Data Fusion: A Comprehensive Review and a Roadmap for the Identification of Activities of Daily Living Using Mobile Devices. Sensors 2016, 16, 184. [Google Scholar] [CrossRef] [PubMed]
  33. Olive, S.; Twentyman, O.; Ramsay, C. Comparison of fingertip and earlobe pulse oximetry with arterial blood gas results. In 1.13 Clinical Problems—Other; European Respiratory Journal: Sheffield, UK, 2016; p. PA3702. [Google Scholar] [CrossRef]
  34. Nam, Y.; Nam, Y.-C. Photoplethysmography Signal Analysis for Optimal Region-of-Interest Determination in Video Imaging on a Built-In Smartphone under Different Conditions. Sensors 2017, 17, 2385. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Rubins, U.; Erts, R.; Nikiforovs, V. The Blood Perfusion Mapping in the Human Skin by Photoplethysmography Imaging. In Proceeding of th XII Mediterranean Conference on Medical and Biological Engineering and Computing 2010, Berlin/Heidelberg, Germany, 27–30 May 2010; pp. 304–306. [Google Scholar] [CrossRef]
  36. Longmore, S.K.; Lui, G.Y.; Naik, G.R.; Breen, P.P.; Jalaludin, B.; Gargiulo, G.D. A Comparison of Reflective Photoplethysmography for Detection of Heart Rate, Blood Oxygen Saturation, and Respiration Rate at Various Anatomical Locations. Sensors 2019, 19, 1874. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Lee, J.; Matsumura, K.; Yamakoshi, K.; Rolfe, P.; Tanaka, S.; Yamakoshi, T. Comparison between Red, Green and Blue Light Reflection Photoplethysmography for Heart Rate Monitoring during Motion. In Proceedings of the 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Osaka, Japan, 3–7 July 2013; pp. 1724–1727. [Google Scholar] [CrossRef] [Green Version]
  38. Po, L.; Xu, X.; Feng, L.; Li, Y.; Cheung, K.; Cheung, C. Frame Adaptive ROI for Photoplethysmography Signal Extraction from Fingertip Video Captured by Smartphone. In Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, Lisbon, Portugal, 24–27 May 2015; pp. 1634–1637. [Google Scholar] [CrossRef]
  39. Grimaldi, D.; Kurylyak, Y.; Lamonaca, F.; Nastro, A. Photoplethysmography Detection by Smartphone’s Videocamera. In Proceedings of the 6th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems, Prague, Czech Republic, 15–17 September 2011; Volume 1, pp. 488–491. [Google Scholar] [CrossRef]
  40. Bolkhovsky, J.B.; Scully, C.G.; Chon, K.H. Statistical Analysis of Heart Rate and Heart Rate Variability Monitoring through the Use of Smart Phone Cameras. In Proceedings of the 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, San Diego, CA, USA, 28 August–1 September 2012; pp. 1610–1613. [Google Scholar] [CrossRef]
  41. Smith, S.W. Digital Signal Processing: A Practical Guide for Engineers and Scientists; Elsevier: Amsterdam, The Netherlands, 2013. [Google Scholar]
  42. Peng, J.; Peng, S.; Jiang, A.; Wei, J.; Li, C.; Tan, J. Asymmetric least squares for multiple spectra baseline correction. Anal. Chim. Acta 2010, 683, 63–68. [Google Scholar] [CrossRef]
  43. Zhang, Y.; Feng, Z. A SVM Method for Continuous Blood Pressure Estimation from a PPG Signal. In Proceedings of the 9th International Conference on Machine Learning and Computing, New York, NY, USA, 24–26 February 2017; pp. 128–132. [Google Scholar] [CrossRef]
  44. Liu, Z.; Zhou, B.; Li, Y.; Tang, M.; Miao, F. Continuous Blood Pressure Estimation from Electrocardiogram and Photoplethysmogram during Arrhythmias. Front. Physiol. 2020, 11. [Google Scholar] [CrossRef]
Figure 1. System diagram of noninvasive glucose detection using smartphone camera (top) compared with the conventional glucometer-based technique (bottom).
Figure 1. System diagram of noninvasive glucose detection using smartphone camera (top) compared with the conventional glucometer-based technique (bottom).
Applsci 11 00618 g001
Figure 2. (A) Normalized Photoplethysmography (PPG) signal acquired using (a) Xiaomi Redmi Note 5 Pro, (b) Samsung Galaxy Note 8, and (c) OnePlus 6T phones. (B) Performance evaluation on PPG signal acquisition by the three smartphones.
Figure 2. (A) Normalized Photoplethysmography (PPG) signal acquired using (a) Xiaomi Redmi Note 5 Pro, (b) Samsung Galaxy Note 8, and (c) OnePlus 6T phones. (B) Performance evaluation on PPG signal acquisition by the three smartphones.
Applsci 11 00618 g002
Figure 3. Representative PPG signal extracted from the (Red) channel, (Green) channel, and (Blue) channel of two video frames of varying quality. The extracted PPG on the left side signal was observed to be more reliable compared to the right side.
Figure 3. Representative PPG signal extracted from the (Red) channel, (Green) channel, and (Blue) channel of two video frames of varying quality. The extracted PPG on the left side signal was observed to be more reliable compared to the right side.
Applsci 11 00618 g003
Figure 4. Normalized PPG signals with different qualities—a signal with both baseline variations and high-frequency noises (top), a signal with baseline variations (middle), and a good one with no high-frequency noise and little baseline variation (bottom).
Figure 4. Normalized PPG signals with different qualities—a signal with both baseline variations and high-frequency noises (top), a signal with baseline variations (middle), and a good one with no high-frequency noise and little baseline variation (bottom).
Applsci 11 00618 g004
Figure 5. Proposed model for the quantitative analysis of glucose level from smartphone video.
Figure 5. Proposed model for the quantitative analysis of glucose level from smartphone video.
Applsci 11 00618 g005
Figure 6. Raw PPG signal (top) was first preprocessed with ALS (middle) and then cleaned with a Gaussian filter (bottom). Both high-frequency noise and baseline variations had been removed.
Figure 6. Raw PPG signal (top) was first preprocessed with ALS (middle) and then cleaned with a Gaussian filter (bottom). Both high-frequency noise and baseline variations had been removed.
Applsci 11 00618 g006
Figure 7. (a) A representative PPG single cycle waveform with systolic and diastolic peaks and DelT; (b) is the first derivative of that cycle, and (c) is the second derivative of the cycle.
Figure 7. (a) A representative PPG single cycle waveform with systolic and diastolic peaks and DelT; (b) is the first derivative of that cycle, and (c) is the second derivative of the cycle.
Applsci 11 00618 g007
Figure 8. Two representative PPG signals—(left): relatively prominent PPG signal and (right): a signal with high baseline issues and noises processed through different steps. (a) Raw PPG Signal; (b) PPG signal preprocessed with ALS and Gaussian filter; (c) first derivative of preprocessed signal, and (d) extracted features of first derivative signal.
Figure 8. Two representative PPG signals—(left): relatively prominent PPG signal and (right): a signal with high baseline issues and noises processed through different steps. (a) Raw PPG Signal; (b) PPG signal preprocessed with ALS and Gaussian filter; (c) first derivative of preprocessed signal, and (d) extracted features of first derivative signal.
Applsci 11 00618 g008
Figure 9. Standard Error of Predictions (SEPs) against a different number of principal components (PCs) achieved for Principal Component Regression (PCR) model built with different preprocessing approaches.
Figure 9. Standard Error of Predictions (SEPs) against a different number of principal components (PCs) achieved for Principal Component Regression (PCR) model built with different preprocessing approaches.
Applsci 11 00618 g009
Figure 10. SEP against a different number of principal components (PCs) achieved for Partial Least Square Regression (PLS) model built with different preprocessing approaches.
Figure 10. SEP against a different number of principal components (PCs) achieved for Partial Least Square Regression (PLS) model built with different preprocessing approaches.
Applsci 11 00618 g010
Figure 11. SEP of different Support Vector Regression (SVR) models against the ε values.
Figure 11. SEP of different Support Vector Regression (SVR) models against the ε values.
Applsci 11 00618 g011
Figure 12. SEP of different Random Forest Regression (RFR) models against the number of estimator tress.
Figure 12. SEP of different Random Forest Regression (RFR) models against the number of estimator tress.
Applsci 11 00618 g012
Figure 13. Comparison of performances of different regression techniques.
Figure 13. Comparison of performances of different regression techniques.
Applsci 11 00618 g013
Table 1. Android smartphones camera specifications.
Table 1. Android smartphones camera specifications.
SmartphonesMegapixel (MP)Cam TypeApertureSensor (μm)Stabilization
MI Note 5Cam1: 12mpMain camf/2.21.25EIS
Cam2: 5mpDepth camf/2.01.12
SM-Note 8Cam1: 12mpMain camf/2.41.0EIS + OIS
Cam2: 12mpTelephotof/1.71.4
OnePlus 6TCam1: 16mpMain camf/1.71.22EIS + OIS
Cam2: 20mpTelephotof/1.71.0
Table 2. Comparison of performances of different PCR models.
Table 2. Comparison of performances of different PCR models.
FeaturesSEP (mg/dL)Number of PCs
Del T21.32
1st Derivative18.335
1st Derivative Characteristics Points17.0811
2nd Derivative18.275
2nd Derivative Characteristics Points22.952
Table 3. Comparison of performances of different PLS models.
Table 3. Comparison of performances of different PLS models.
FeaturesSEP (mg/dL)Number of PCs
DelT23.092
1st Derivative22.092
1st Derivative Characteristics Points17.022
2nd Derivative21.372
Table 4. Comparison of performances of different SVR models.
Table 4. Comparison of performances of different SVR models.
FeaturesSEP (mg/dL)
1st Derivative18.58
2nd Derivative18.74
2nd Derivative Characteristics Points18.52
Table 5. Comparison of performances of different RFR models.
Table 5. Comparison of performances of different RFR models.
FeaturesSEP (mg/dL)n
1st Derivative21.8825
2nd Derivative24.1925
2nd Derivative Characteristics Points22.1145
Table 6. Subject wise glucose level prediction error using PCR and PLS models.
Table 6. Subject wise glucose level prediction error using PCR and PLS models.
Features PCRPLS
DelTSEP (mg/dL)27.7828.57
PC22
1st DerivativeSEP (mg/dL)25.5027.42
PC52
2nd DerivativeSEP (mg/dL)25.927.37
PC112
1st Derivative Characteristics pointsSEP (mg/dL)25.626.44
PC52
Table 7. Subject wise glucose level prediction error using SVR and RFR models.
Table 7. Subject wise glucose level prediction error using SVR and RFR models.
FeaturesSEP (mg/dL)
SVRRFR
1st Derivative26.5829.6
2nd Derivative26.7130.3
2nd derivative characteristic Points26.5630.69
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Islam, T.T.; Ahmed, M.S.; Hassanuzzaman, M.; Bin Amir, S.A.; Rahman, T. Blood Glucose Level Regression for Smartphone PPG Signals Using Machine Learning. Appl. Sci. 2021, 11, 618. https://doi.org/10.3390/app11020618

AMA Style

Islam TT, Ahmed MS, Hassanuzzaman M, Bin Amir SA, Rahman T. Blood Glucose Level Regression for Smartphone PPG Signals Using Machine Learning. Applied Sciences. 2021; 11(2):618. https://doi.org/10.3390/app11020618

Chicago/Turabian Style

Islam, Tanvir Tazul, Md Sajid Ahmed, Md Hassanuzzaman, Syed Athar Bin Amir, and Tanzilur Rahman. 2021. "Blood Glucose Level Regression for Smartphone PPG Signals Using Machine Learning" Applied Sciences 11, no. 2: 618. https://doi.org/10.3390/app11020618

APA Style

Islam, T. T., Ahmed, M. S., Hassanuzzaman, M., Bin Amir, S. A., & Rahman, T. (2021). Blood Glucose Level Regression for Smartphone PPG Signals Using Machine Learning. Applied Sciences, 11(2), 618. https://doi.org/10.3390/app11020618

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop