BagStacking: An Integrated Ensemble Learning Approach for Freezing of Gait Detection in Parkinson’s Disease
<p>BagStacking method overview: <b>D</b>—Bootstrap sampling the training set S, <b>M</b>—Training the base models, <b>P</b>—Apply the base models on the original training set, <b>M’</b>—Train the meta learner on the base models predictions, <b><math display="inline"><semantics> <msub> <mover accent="true"> <mi>y</mi> <mo stretchy="false">^</mo> </mover> <mrow> <mi>b</mi> <mi>a</mi> <mi>g</mi> <mi>s</mi> <mi>t</mi> <mi>a</mi> <mi>c</mi> <mi>k</mi> <mi>i</mi> <mi>n</mi> <mi>g</mi> </mrow> </msub> </semantics></math></b>—Apply base models to new instance; feed outputs to meta-learner for final prediction.</p> "> Figure 2
<p>Raw accelerometer data for the vertical (AccV), mediolateral (AccML), and anteroposterior (AccAP) axes over a 5-s window.</p> "> Figure 3
<p>Examples of feature transformations: time-domain features (mean, standard deviation), frequency-domain features (PSD mean, PSD median), and wavelet-domain features (wavelet coefficient means at levels 0 and 1) for the first five windows.</p> "> Figure 4
<p>AUC comparison of different methods across FOG event classes. BagStacking consistently outperforms other methods in start hesitation, turning, and walking event classes.</p> ">
Abstract
:1. Introduction
- Introduction of BagStacking Method: This study presents BagStacking, a novel ensemble learning approach specifically tailored to detect FOG in Parkinson’s disease patients using accelerometer data. The method combines bagging’s variance reduction and stacking’s blending capability, creating a unique and innovative solution for handling the high variability in FOG data.
- Theoretical Performance Analysis: We establish a theoretical foundation for BagStacking, showing that the method can outperform traditional ensemble techniques such as standalone bagging and stacking.
- Empirical Validation on Real-World Data: Using a robust dataset on FOG episodes, we validate BagStacking’s performance where it achieves higher accuracy (MAP score) and efficiency than established machine learning methods like LightGBM and standard stacking. BagStacking not only increases FOG detection accuracy but also improves runtime, making it suitable for real-time applications in clinical settings.
- Open Source Contribution to the Community: We provide an open-source implementation of BagStacking within the Scikit-learn API framework at https://github.com/SeffiCohen/BagStacking (accessed on 20 November 2024).
2. Related Work
3. Method
- Bootstrap Sampling: Randomly sample with replacement as different subsets of the training data, where .
- Base Models Training: Train the base models on each bootstrap set using cross-validation. Any supervised model such as linear models, SVM, decision trees, GBM, and neural networks can be utilized.
- Base Models Predictions: Apply the base models on the original cross-validated training set to obtain the predicted label vectors as , where .
- Meta Learner Training: Train as a meta-learner; it could be any supervised model trained on the base model outputs optimally. It uses the predicted label vectors P as input and the labels as targets.
- Ensemble Prediction: Apply base models to a new instance. Feed outputs to meta-learner for final prediction.
3.1. BagStacking Theoretical Foundation
3.1.1. Assumptions
- Base models are independently trained on different bootstrapped samples from the dataset S.
- The meta-learner is trained on the outputs of these base models.
- All models are trained to minimize the empirical loss on their respective training sets, subject to regularization constraints.
3.1.2. Step 1: Bagging’s Variance Reduction
3.1.3. Step 2: Stacking’s Model Diversity
3.1.4. Step 3: BagStacking’s Combined Strengths
3.1.5. Step 4: Expected Generalization Performance
4. Experiments
4.1. Dataset
4.2. Preprocessing and Feature Engineering
4.3. Experimental Setup
Selection of Comparison Methods
- Standalone LightGBM Model: LightGBM is a gradient boosting framework that has demonstrated high performance and efficiency in various classification tasks, including those involving large-scale datasets. By comparing BagStacking against a standalone LightGBM model, we aim to assess the added value of our ensemble approach over a powerful single-model baseline.
- Multistrategy Ensemble Learning Method [20]: This method employs diverse ensemble strategies, including bagging, gradient boosting machines (GBM), and random forests, to enhance predictive performance. By incorporating multiple ensemble techniques, the multistrategy ensemble serves as a robust benchmark to evaluate how BagStacking compares to other ensemble methodologies that leverage different mechanisms for improving model accuracy and robustness.
- Classical Stacking Method: Stacking is a widely recognized ensemble technique that combines multiple base models through a meta-learner to achieve superior performance. By including a classical stacking approach with analogous settings, we aim to demonstrate the efficacy of BagStacking’s integrated bagging and stacking strategy against traditional stacking methods that do not incorporate bagging.
4.4. Evaluation Metric
5. Results
5.1. AUC Performance
5.2. Runtime Analysis
5.2.1. Identified Runtime Bottlenecks
5.2.2. Impact of Bagging on Computational Efficiency
5.2.3. Meta-Learner Optimization
5.2.4. Scalability and Parallelization
5.2.5. Comparison with Other Methods
5.2.6. Potential Optimization Strategies
- Dynamic Data Sampling: Adjusting the size of bootstrap samples based on model performance and computational constraints.
- Efficient Meta-Learner Architectures: Employing lightweight meta-learners that require fewer computational resources without compromising performance.
- Incremental Training: Updating base models incrementally as new data arrives, reducing the need for retraining from scratch.
6. Discussion
6.1. Runtime Efficiency and Practical Implications
6.2. Limitations and Future Work
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
FOG | Freezing of gait |
PD | Parkinson’s disease |
MAP | Mean average precision |
PSD | Power spectral density |
FFT | Fast Fourier transform |
FI | Freeze index |
V | Vertical |
ML | Mediolateral |
AP | Anteroposterior |
AUC | Area under the curve |
KNN | K nearest neighbors |
GBM | Gradient boosting machines |
IoT | Internet of Things |
References
- Naghavi, N.; Miller, A.; Wade, E. Towards Real-Time Prediction of Freezing of Gait in Patients with Parkinson’s Disease: Addressing the Class Imbalance Problem. Sensors 2019, 19, 3898. [Google Scholar] [CrossRef] [PubMed]
- Shah, S.Y.; Iqbal, Z.; Rahim, A. Constrained Optimization-Based Extreme Learning Machines with Bagging for Freezing of Gait Detection. Big Data Cogn. Comput. 2018, 2, 31. [Google Scholar] [CrossRef]
- Pintelas, P.; Livieris, I.E. Special Issue on Ensemble Learning and Applications. Algorithms 2020, 13, 140. [Google Scholar] [CrossRef]
- Bose, R.; Dutta, A.; Bhattacharyya, M.; Ghosh, S.; Chakraborty, C.; Chakraborty, C.; Bhattacharyya, P. An ensemble machine learning model based on multiple filtering and supervised attribute clustering algorithm for classifying cancer samples. PeerJ Comput. Sci. 2021, 7, e671. [Google Scholar] [CrossRef] [PubMed]
- Cohen, S.; Katz, O.; Presil, D.; Arbili, O.; Rokach, L. Ensemble Learning For Alcoholism Classification Using EEG Signals. IEEE Sens. J. 2023, 23, 17714–17724. [Google Scholar] [CrossRef]
- Cohen, S.; Lior, E.; Bocher, M.; Rokach, L. Improving severity classification of Hebrew PET-CT pathology reports using test-time augmentation. J. Biomed. Inform. 2024, 149, 104577. [Google Scholar] [CrossRef] [PubMed]
- Hosni, A.; Chawki, Y.; Idri, A.; Bakkoury, Z.; Al Achhab, M.; González, J.R. A systematic mapping study for ensemble classification methods in cardiovascular disease. Artif. Intell. Rev. 2021, 54, 2827–2861. [Google Scholar] [CrossRef]
- Cohen, S.; Dagan, N.; Cohen-Inger, N.; Ofer, D.; Rokach, L. ICU survival prediction incorporating test-time augmentation to improve the accuracy of ensemble-based models. IEEE Access 2021, 9, 91584–91592. [Google Scholar] [CrossRef]
- Moore, S.T.; MacDougall, H.G.; Ondo, W.G. Ambulatory monitoring of freezing of gait in Parkinson’s disease. J. Neurosci. Methods 2008, 167, 340–348. [Google Scholar] [CrossRef]
- Bachlin, M.; Plotnik, M.; Roggen, D.; Inbar, N.; Sagiv, N.; Giladi, N.; Hausdorff, J.M.; Troster, G. Wearable assistant for Parkinson’s disease patients with the freezing of gait syndrome. IEEE Trans. Inf. Technol. Biomed. 2010, 14, 436–446. [Google Scholar] [CrossRef]
- Tahafchi, P.; Molina, R.; Roper, J.A.; Sowalsky, K.; Hass, C.J.; Gunduz, A.; Okun, M.S.; Judy, J.W. Freezing-of-Gait detection using temporal, spatial, and physiological features with a support-vector-machine classifier. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Jeju Island, Republic of Korea, 11–15 July 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 2867–2870. [Google Scholar]
- Abujrida, H.; Agu, E.O.; Pahlavan, K. Machine learning-based motor assessment of Parkinson’s disease using postural sway, gait and lifestyle features on crowdsourced smartphone data. Biomed. Phys. Eng. Express 2020, 6, 035005. [Google Scholar] [CrossRef] [PubMed]
- Shi, B.; Tay, A.; Au, W.L.; Tan, D.M.L.; Chia, N.S.Y.; Yen, S.-C. Detection of freezing of gait using convolutional neural networks and data from lower limb motion sensors. IEEE Trans. Biomed. Eng. 2022, 69, 2256–2267. [Google Scholar] [CrossRef] [PubMed]
- Habib, Z.; Mughal, M.A.; Khan, M.A.; Shabaz, M. WiFOG: Integrating deep learning and hybrid feature selection for accurate freezing of gait detection. Alex. Eng. J. 2024, 86, 481–493. [Google Scholar] [CrossRef]
- Hou, Y.; Ji, J.; Zhu, Y.; Dell, T.; Liu, X. Flexible gel-free multi-modal wireless sensors with edge deep learning for detecting and alerting freezing of gait symptom. IEEE Trans. Biomed. Circuits Syst. 2023, 17, 1010–1021. [Google Scholar] [CrossRef] [PubMed]
- Mo, W.T.; Chan, J.H. Freezing of Gait Prediction Using Deep Learning. In Proceedings of the 13th International Conference on Advances in Information Technology, Bangkok, Thailand, 5–7 July 2023; pp. 1–6. [Google Scholar]
- Chaurasia, V.; Chaurasia, A. Detection of Parkinson’s Disease by Using Machine Learning Stacking and Ensemble Method. Biomed. Mater. Devices 2023, 1, 966–978. [Google Scholar] [CrossRef]
- Mazilu, S.; Calatroni, A.; Gazit, E.; Roggen, D.; Hausdorff, J.M.; Tröster, G. Feature learning for detection and prediction of freezing of gait in Parkinson’s disease. In Proceedings of the 9th International Conference on Machine Learning and Data Mining in Pattern Recognition (MLDM 2013), New York, NY, USA, 19–25 July 2013; Springer: Berlin/Heidelberg, Germany, 2013; pp. 144–158. [Google Scholar]
- Fahira, N.R.; Lawi, A.; Aqsha, M. Early detection model of Parkinson’s Disease using Random Forest Method on voice frequency data. J. Nat. Sci. Math. Res. 2023, 9, 29–37. [Google Scholar] [CrossRef]
- Webb, G.I.; Zheng, Z. Multistrategy ensemble learning: Reducing error by combining ensemble learning techniques. IEEE Trans. Knowl. Data Eng. 2004, 16, 980–991. [Google Scholar] [CrossRef]
- Howard, A.; Salomon, A.; Gazit, E.; Hausdorff, J.; Kirsch, L.; Maggie; Ginis, P.; Holbrook, R.; Karim, Y.F. Parkinson’s Freezing of Gait Prediction. Kaggle. 2023. Available online: https://kaggle.com/competitions/tlvmc-parkinsons-freezing-gait-prediction (accessed on 20 November 2024).
- Salomon, A.; Gazit, E.; Ginis, P.; Urazalinov, B.; Takoi, H.; Yamaguchi, T.; Goda, S.; Lander, D.; Lacombe, J.; Sinha, A.K.; et al. A machine learning contest enhances automated freezing of gait detection and reveals time-of-day effects. Nat. Commun. 2024, 15, 4853. [Google Scholar] [CrossRef] [PubMed]
Training Set | Test Set | |
---|---|---|
Number of Subjects | 920 | 250 |
Total Samples | 20,588,374 | 5,000,000 |
Sampling Rate | 100 Hz/128 Hz | 100 Hz/128 Hz |
Number of FOG Events | 1029 | 270 |
FOG Event Duration (avg) | 3.5 s | 3.6 s |
Non-FOG Samples | 19,558,374 | 4,850,000 |
Method | MAP Score | Runtime (s) |
---|---|---|
BagStacking | 0.306 | 3828 |
Multistrategy Ensemble | 0.249 | 7716 |
LightGBM | 0.234 | 1518 |
Regular Stacking | 0.286 | 8350 |
Method | Start Hesitation AUC | Turning AUC | Walking AUC |
---|---|---|---|
BagStacking | 0.88 | 0.90 | 0.84 |
Multistrategy Ensemble | 0.77 | 0.88 | 0.73 |
Regular Stacking | 0.83 | 0.87 | 0.72 |
LightGBM | 0.80 | 0.87 | 0.70 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cohen, S.; Cohen-Inger, N.; Rokach, L. BagStacking: An Integrated Ensemble Learning Approach for Freezing of Gait Detection in Parkinson’s Disease. Information 2024, 15, 822. https://doi.org/10.3390/info15120822
Cohen S, Cohen-Inger N, Rokach L. BagStacking: An Integrated Ensemble Learning Approach for Freezing of Gait Detection in Parkinson’s Disease. Information. 2024; 15(12):822. https://doi.org/10.3390/info15120822
Chicago/Turabian StyleCohen, Seffi, Nurit Cohen-Inger, and Lior Rokach. 2024. "BagStacking: An Integrated Ensemble Learning Approach for Freezing of Gait Detection in Parkinson’s Disease" Information 15, no. 12: 822. https://doi.org/10.3390/info15120822
APA StyleCohen, S., Cohen-Inger, N., & Rokach, L. (2024). BagStacking: An Integrated Ensemble Learning Approach for Freezing of Gait Detection in Parkinson’s Disease. Information, 15(12), 822. https://doi.org/10.3390/info15120822