[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3647444.3647937acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimmiConference Proceedingsconference-collections
research-article

Risk Factor Prediction for Heart Disease Using Decision Trees

Published: 13 May 2024 Publication History

Abstract

Global mortality rates are high due to heart disease, it is crucial to early analyzing the risk factors of prediction and diagnosis are crucial. Decision trees are a reliable machine-learning method for this problem. Four decision tree algorithms CART (Classification and Regression Trees), C5.0, M5P (Model Tree with Linear Models in Leaves), and XGBoost are tested for heart disease risk factor prediction. An extensive dataset of heart disease risk factors was used to evaluate decision tree algorithms. To assess model performance rigorously, the study used data preprocessing, feature selection, and stratified cross-validation.  The results from the experiments were significant. Despite its simplicity, the CART algorithm predicted heart disease risk factors accurately. The C5.0 algorithm handled missing data well, a common healthcare dataset issue. M5P, which integrates linear models into leaf nodes, improves interpretability without compromising accuracy. XGBoost was the best model, predicting risk factors with 99.32% accuracy. The result shows the algorithm's durability and suitability for complex medical diagnostic tasks.  Decision tree algorithms predict heart disease risk factors effectively and clearly. XGBoost was the best decision tree algorithm due to its accuracy and reliability. This technology's precision suggests it could help doctors make quick, accurate diagnoses, potentially saving lives. This study highlights the importance of using machine learning and decision trees to improve heart disease risk assessment and patient care. Further research could examine the integration of XGBoost and other decision tree models into clinical practice to improve heart disease detection and treatment.

References

[1]
J. Azmi, M. Arif, M. T. Nafis, M. A. Alam, S. Tanweer, and G. Wang, “A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data,” Med. Eng. Phys., vol. 105, no. May, p. 103825, 2022.
[2]
J. Azmi, M. Arif, M. T. Nafis, M. A. Alam, S. Tanweer, and G. Wang, “A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data,” Med. Eng. Phys., vol. 105, no. February, p. 103825, 2022.
[3]
H. B. Kibria and A. Matin, “The severity prediction of the binary and multi-class cardiovascular disease − A machine learning-based fusion approach,” Comput. Biol. Chem., vol. 98, no. November 2021, p. 107672, 2022.
[4]
C. Krittanawong, H. J. Zhang, Z. Wang, M. Aydar, and T. Kitai, “Artificial Intelligence in Precision Cardiovascular Medicine,” J. Am. Coll. Cardiol., vol. 69, no. 21, pp. 2657–2664, 2017.
[5]
S. Uddin, A. Khan, M. E. Hossain, and M. A. Moni, “Comparing different supervised machine learning algorithms for disease prediction,” BMC Med. Inform. Decis. Mak., vol. 19, no. 1, pp. 1–16, 2019.
[6]
Y. Tian, “Short-term exposure to reduced specific-size ambient particulate matter increase the risk of cause-specific cardiovascular disease: A national-wide evidence from hospital admissions,” Ecotoxicol. Environ. Saf., vol. 263, no. August, p. 115327, 2023.
[7]
U. Ahmed, J. C. W. Lin, and G. Srivastava, “Multivariate time-series sensor vital sign forecasting of cardiovascular and chronic respiratory diseases,” Sustain. Comput. Informatics Syst., vol. 38, no. December 2021, 2023.
[8]
V. Chaurasia, “Early Prediction of Heart Diseases Using Data Mining,” Caribb. J. Sci. Technol., vol. 1, pp. 208–217, 2013.
[9]
P. Chinnasamy, S. Arun Kumar, V. Navya, K. Lakshmi Priya, and S. Sruthi Boddu, “Machine learning based cardiovascular disease prediction,” Mater. Today Proc., vol. 64, pp. 459–463, 2022.
[10]
E. Choi, A. Schuetz, W. F. Stewart, and J. Sun, “Using recurrent neural network models for early detection of heart failure onset,” J. Am. Med. Informatics Assoc., vol. 24, no. 2, pp. 361–370, 2017.
[11]
R. Divya, F. D. Shadrach, and S. Padmaja, “Cardiovascular risk detection using Harris Hawks optimization with ensemble learning model on PPG signals,” Signal, Image Video Process., vol. 17, no. 8, pp. 4503–4512, 2023.
[12]
B. P. Doppala, D. Bhattacharyya, M. Chakkravarthy, and T. hoon Kim, “A hybrid machine learning approach to identify coronary diseases using feature selection mechanism on heart disease dataset,” Distrib. Parallel Databases, vol. 41, no. 1–2, pp. 1–20, 2023.
[13]
N. L. Fitriyani, M. Syafrudin, G. Alfian, and J. Rhee, “HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System,” IEEE Access, vol. 8, pp. 133034–133050, 2020.
[14]
U. Haq, J. P. Li, M. H. Memon, S. Nazir, R. Sun, and I. Garciá-Magarinõ, “A hybrid intelligent system framework for the prediction of heart disease using machine learning algorithms,” Mob. Inf. Syst., vol. 2018, 2018.
[15]
M. E. Hossain, S. Uddin, and A. Khan, “Network analytics and machine learning for predictive risk modelling of cardiovascular disease in patients with type 2 diabetes,” Expert Syst. Appl., vol. 164, no. April 2020, p. 113918, 2021.
[16]
S. Kiliçarslan, “PSO + GWO: a hybrid particle swarm optimization and Grey Wolf optimization based Algorithm for fine-tuning hyper-parameters of convolutional neural networks for Cardiovascular Disease Detection,” J. Ambient Intell. Humaniz. Comput., vol. 14, no. 1, pp. 87–97, 2023.
[17]
M. Bhatia and D. Motwani, “Use of Ensemblers Learning for Prediction of Heart Disease,” no. Icoei, pp. 1016–1023, 2020.
[18]
R. Wahab Sait and A. K. Dutta, “Developing a Deep-Learning-Based Coronary Artery Disease Detection Technique Using Computer Tomography Images,” Diagnostics, vol. 13, no. 7, pp. 1–14, 2023.
[19]
IEEE dataport, “Heart Disease Dataset,” IEEE datport. pp. 1–6, 2020, [Online]. Available: https://ieee-dataport.org/open-access/heart-disease-dataset-comprehensive.
[20]
Khetani, Vinit, "Cross-Domain Analysis of ML and DL: Evaluating their Impact in Diverse Domains." International Journal of Intelligent Systems and Applications in Engineering 11.7s (2023): 253-262.
[21]
Saxena, V., Saxena, D., & Singh, U. P. (2022, December). Security Enhancement using Image verification method to Secure Docker Containers. In Proceedings of the 4th International Conference on Information Management & Machine Intelligence (pp. 1-5).
[22]
Singh, U. P., Saxena, V., Kumar, A., Bhari, P., & Saxena, D. (2022, December). Unraveling the Prediction of Fine Particulate Matter over Jaipur, India using Long Short-Term Memory Neural Network. In Proceedings of the 4th International Conference on Information Management & Machine Intelligence (pp. 1-5).
[23]
Mittal, A. K., Singh, U. P., Tiwari, A., Dwivedi, S., Joshi, M. K., & Tripathi, K. C. (2015). Short-term predictions by statistical methods in regions of varying dynamical error growth in a chaotic system. Meteorology and Atmospheric Physics, 127, 457-465.
[24]
Singh, U. P., Mittal, A. K., Dwivedi, S., & Tiwari, A. (2015). Predictability study of forced Lorenz model: an artificial neural network approach. History, 40(181), 27-33.
[25]
Singh, U. P., Mittal, A. K., Dwivedi, S., & Tiwari, A. (2020). Evaluating the predictability of central Indian rainfall on short and long timescales using theory of nonlinear dynamics. Journal of water and Climate Change, 11(4), 1134-1149.
[26]
Singh, U., Pathak, M., Malhotra, R., & Chauhan, M. (2012). Secure communication protocol for ATM using TLS handshake. Journal of Engineering Research and Applications (IJERA), 2(2), 838-948.
[27]
Singh, U. P., & Mittal, A. K. (2021). Testing reliability of the spatial Hurst exponent method for detecting a change point. Journal of Water and Climate Change, 12(8), 3661-3674.
[28]
Tiwari, A., Mittal, A. K., Dwivedi, S., & Singh, U. P. (2015). Nonlinear time series analysis of rainfall over central Indian region using CMIP5 based climate model. Climate Change, 1(4), 411-417.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICIMMI '23: Proceedings of the 5th International Conference on Information Management & Machine Intelligence
November 2023
1215 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. C5.0
  2. CART
  3. Decision Tree
  4. Machine Learning
  5. XGBoost

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICIMMI 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 23
    Total Downloads
  • Downloads (Last 12 months)23
  • Downloads (Last 6 weeks)5
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media