Abstract
With the phenomenal growth of machine learning, privacy protection has become a major concern in medical science. Sensitive medical information, such as chronic kidney disease (CKD) data, is subject to privacy protections and cannot be shared with others without patients’confidentiality and data security. This research study aims to design a secure and effective methodology for developing a predictive model for the early detection of chronic kidney disease. In our research study, we addressed two major problems: accurate diagnosis of chronic kidney disease and data security. The workflow in this research paper is divided into two stages. In the initial phase, to prioritize shorter processing times and improve the model’s performance, we applied two meta-heuristic algorithms, the genetic algorithm and the bat algorithm, as feature selection methods to identify the most relevant features for accurate CKD diagnosis. In the subsequent phase of our study, we proposed a privacy-preserving logistic regression-based chronic kidney disease inference model to perform predictive analysis on encrypted chronic kidney disease data. In this model, we used the Paillier homomorphic cryptosystem that provides strong security assurances, ensuring secure communication and processing of medical data, which maintains patients’confidentiality during diagnosis. Our proposed model, utilizing the genetic algorithm feature selection approach, achieves higher accuracy at 98.75% compared to the original features and bat algorithm selected features. We also observed that this model had less computation time compared to the original features and bat algorithm-selected features. To confirm the efficiency of our proposed model, we compared its performance with those obtained by applying the logistic regression algorithm to plain text. We found that, in all cases, we achieved the same performance as observed with encrypted data.
Similar content being viewed by others
Data availability
This article uses public datasets, which are publicly available at (https://archive.ics.uci.edu/ml).
References
Almasoud M, Ward TE (2019) Detection of chronic kidney disease using machine learning algorithms with least number of predictors. International Journal of Advanced Computer Science and Applications(IJACSA), 10(8). https://doi.org/10.14569/IJACSA.2019.0100813
Ani R, Sasi G, Sankar UR, Deepa O (2016) Decision support system for diagnosis and prediction of chronic renal failure using random subspace classification. In: Paper presented at the 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE, 1287–1292
Chahar V, Katoch S, Chauhan S (2021) A review on genetic algorithm: past, present, and future. Multimed Tools Appl 80. https://doi.org/10.1007/s11042-020-10139-6
Chen TK, Knicely DH, Grams ME (2019) Chronic kidney disease diagnosis and management: a review. JAMA 322(13):1294–1304. https://doi.org/10.1001/jama.2019.14745
Dare AJ, Fu SH, Patra J, Rodriguez PS, Thakur JS, Jha P (2017). Million Death Study Collaborators. Renal failure deaths and their risk factors in India 2001-13: nationally representative estimates from the Million Death Study. Lancet Glob Health. 5(1):e89-e95. https://doi.org/10.1016/S2214-109X(16)30308-4
Dua D, Graff C (2017). UCI Machine Learning Repository. https://archive.ics.uci.edu/ml
Eroglu K, Palabas T, Ieee. (2016). The Impact on the Classification Performance of the Combined Use of Different Classification Methods and Different Ensemble Algorithms in Chronic Kidney Disease Detection. New York: Paper presented at the 2016 National Conference on Electrical, Electronics, and Biomedical Engineering (ELECO), IEEE, pp. 512–516
Ethiopia (2020) Kidney disease. https://www.worldlifeexpectancy.com/ethiopia-kidney-disease. Accessed 07
GBD Chronic Kidney Disease Collaboration (2020) Global, regional, and national burden of chronic kidney disease, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 395(10225):709–733. https://doi.org/10.1016/S0140-6736(20)30045-3
George C, Mogueo A, Okpechi I, Echouffo-Tcheugui JB, Kengne AP (2017) Chronic kidney disease in low-income to middle-income countries: the case for increased screening. BMJ Glob Health 2(2):e000256. https://doi.org/10.1136/bmjgh-2016-000256
Han X, Zheng X, Wang Y, Sun X, Xiao Y, Tang Y, Qin W (2019). Random forest can accurately predict the development of end-stage renal disease in immunoglobulin nephropathy patients. Ann Transl Med. 7(11):234. https://doi.org/10.21037/atm.2018.12.11
Hill NR, Fatoba ST, Oke JL, Hirst JA, O’Callaghan CA, Lasserson DS, Hobbs FD (2016) Global Prevalence of Chronic Kidney Disease - A Systematic Review and Meta-Analysis. PLoS ONE 11(7):e0158765. https://doi.org/10.1371/journal.pone.0158765
Hossain MM, Detwiler RK, Chang EH et al (2019) Mechanical Anisotropy Assessment in Kidney Cortex Using ARFI Peak Displacement: Preclinical Validation and Pilot In Vivo Clinical Results in Kidney Allografts. IEEE Trans Ultrason Ferroelectr Freq Control 66(3):551–562. https://doi.org/10.1109/tuffc.2018.2865203
Ifraz G.M, Rashid M.H, Tazin T, Bourouis S, Khan M.M (2021). Comparative Analysis for Prediction of Kidney Disease Using Intelligent Machine Learning Methods. Computational and Mathematical Methods in Medicine, 2021. vol. 2021, Article ID 6141470, 10 pages, 2021. https://doi.org/10.1155/2021/6141470
Institute of Medicine (US) Committee on Health Research and the Privacy of Health Information: The HIPAA Privacy Rule. Beyond the HIPAA Privacy Rule: Enhancing Privacy, Improving Health Through Research. Nass SJ, Levit LA, Gostin LO, editors. Washington (DC): National Academies Press (US); 2009
Jeyasingh S, Veluchamy M (2017 ). Modified Bat Algorithm for Feature Selection with the Wisconsin Diagnosis Breast Cancer (WDBC) Dataset. Asian Pac J Cancer Prev.18(5):1257-1264. https://doi.org/10.22034/APJCP.2017.18.5.1257
Krishnamurthy S, Ks K, Dovgan E, Luštrek M, Gradišek Piletič B, Srinivasan K, Li YJ, Gradišek A, Syed-Abdul S (2021) Machine Learning Prediction Models for Chronic Kidney Disease Using National Health Insurance Claim Data in Taiwan. Healthcare (Basel). 9(5):546. https://doi.org/10.3390/healthcare9050546
Levey AS, Atkins R, Coresh J, Cohen EP, Collins AJ, Eckardt KU, Nahas ME, Jaber BL, Jadoul M, Levin A, Powe NR, Rossert J, Wheeler DC, Lameire N, Eknoyan G (2007) Chronic kidney disease as a global public health problem: approaches and initiatives - a position statement from Kidney Disease Improving Global Outcomes. Kidney Int 72(3):247–59. https://doi.org/10.1038/sj.ki.5002343
Mallappallil M, Friedman EA, Delano BG, McFarlane SI, Salifu MO (2014) Chronic kidney disease in the elderly: evaluation and management. Clin Pract (Lond). 11(5):525–535. https://doi.org/10.2217/cpr.14.46
Moahmmed S (2019) A Self-learning Knowledge based System for Diagnosis and Treatment of Chronic Kidney Disease. International Journal of Education and Management Engineering. 9:44–58. https://doi.org/10.5815/ijeme.2019.02.05
Murti Della, Pujianto Utomo, Wibawa Aji, Akbar Muhammad (2019). K-Nearest Neighbor (K-NN) based Missing Data Imputation. 83-88. https://doi.org/10.1109/ICSITech46713.2019.8987530
Nandimath OV (2009) Consent and medical treatment: The legal paradigm in India. Indian J Urol. 25(3):343–7. https://doi.org/10.4103/0970-1591.56202
Noroozi Z, Orooji A, Erfannia L (2023) Analyzing the impact of feature selection methods on machine learning algorithms for heart disease prediction. Sci Rep 13:22588. https://doi.org/10.1038/s41598-023-49962-w
Ogburn M, Turner C, Dahal P (2013) Homomorphic Encryption. Procedia Computer Science. 20:502–509. https://doi.org/10.1016/j.procs.2013.09.310
Paillier Pascal (1999). Public-Key Cryptosystems Based on Composite Degree Residuosity Classes. Advances in Cryptology–EUROCRYPT’99, 223–238 (Springer, Berlin, 1999). https://doi.org/10.1007/3-540-48910-X_16
Papademetriou V, Nylen ES, Doumas M, Probstfield J, Mann JFE, Gilbert RE, Gerstein HC (2017) Chronic Kidney Disease, Basal Insulin Glargine, and Health Outcomes in People with Dysglycemia: The ORIGIN Study. Am J Med 130(12):1465.e27-1465.e39. https://doi.org/10.1016/j.amjmed.2017.05.047
Peng J, Lee K, Ingersoll G (2002) An Introduction to Logistic Regression Analysis and Reporting. Journal of Educational Research - J EDUC RES. 96:3–14. https://doi.org/10.1080/00220670209598786
Polat H, Danaei Mehr H, Cetin A (2017) Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods. J Med Syst 41(4):55. https://doi.org/10.1007/s10916-017-0703-x
Poonia RC, Gupta MK, Abunadi I, Albraikan AA, Al-Wesabi FN, Hamza MA (2022 ). B T. Intelligent Diagnostic Prediction and Classification Models for Detection of Kidney Disease. Healthcare (Basel). 10(2):371. https://doi.org/10.3390/healthcare10020371
Priyanka K (2019) Science BC. Chronic kidney disease prediction based on naive Bayes technique. p. 1653–9
Pudjihartono N, Fadason T, Kempa-Liehr AW, O’Sullivan JM (2022) A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction. Front Bioinform. 27(2):927312. https://doi.org/10.3389/fbinf.2022.927312
Qin Jiongming, Chen Lin, Liu Yuhua, Liu Chuanjun, Feng Changhao, Chen Bin (2019). A Machine Learning Methodology for Diagnosing Chronic Kidney Disease. IEEE Access. PP. 1-1. https://doi.org/10.1109/ACCESS.2019.2963053
Radhakrishnan J, Mohan S (2017) KI Reports and World Kidney Day. Kidney Int Rep. 2(2):125–126. https://doi.org/10.1016/j.ekir.2017.01.014
Rady El Houssainy, Anwar Ayman (2019) Prediction of kidney disease stages using data mining algorithms. Informatics in Medicine Unlocked. 15. https://doi.org/10.1016/j.imu.2019.100178
Rostami M, Berahmand K, Forouzandeh S (2021) A novel community detection based genetic algorithm for feature selection. J Big Data 8:2. https://doi.org/10.1186/s40537-020-00398-3
Salekin A, Stankovic J (2016). Detection of Chronic Kidney Disease and Selecting Important Predictive Attributes, 2016 IEEE International Conference on Healthcare Informatics (ICHI), Chicago, IL, USA, pp. 262-270. https://doi.org/10.1109/ICHI.2016.36
Tekale S, Shingavi P, Wandhekar S, Chatorikar A (2018) Prediction of chronic kidney disease using machine learning algorithm. Disease. 7(10):92–6
Varkey B (2021) Principles of Clinical Ethics and Their Application to Practice. Med Princ Pract 30(1):17–28. https://doi.org/10.1159/000509119
Wickramasinghe M.P.N.M, Perera D.M. ,Kahandawaarachchi K.A.D.C.P (2017). Dietary prediction for patients with Chronic Kidney Disease (CKD) by considering blood potassium level using machine learning algorithms. 300-303
Yang Xin-She (2013) Bat Algorithm: Literature Review and Applications. International Journal of Bio-Inspired Computation. https://doi.org/10.1504/IJBIC.2013.055093
Yang W, Wang S, Cui H, Tang Z, Li Y (2023) A Review of Homomorphic Encryption for Privacy-Preserving Biometrics. Sensors (Basel). 23(7):3566. https://doi.org/10.3390/s23073566
Yashfi Shanila , Islam Md , Pritilata, Sakib Nazmus , Islam Tanzila, Shahbaaz Mohammad, Pantho Sadaf (2020). Risk Prediction Of Chronic Kidney Disease Using Machine Learning Algorithms. 1-5. https://doi.org/10.1109/ICCCNT49239.2020.9225548
Zhang L, Wang F, Wang L, Wang W, Liu B, Liu J, Chen M, He Q, Liao Y, Yu X, Chen N, Zhang JE, Hu Z, Liu F, Hong D, Ma L, Liu H, Zhou X, Chen J, Pan L, Chen W, Wang W, Li X, Wang H (2012) Prevalence of chronic kidney disease in China: a cross-sectional survey. Lancet 379(9818):815–22. https://doi.org/10.1016/S0140-6736(12)60033-6
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
We authors declare that there are no conflicts of interest relevant to this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gogoi, P., Valan, J.A. Privacy-preserving predictive modeling for early detection of chronic kidney disease. Netw Model Anal Health Inform Bioinforma 13, 16 (2024). https://doi.org/10.1007/s13721-024-00452-7
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13721-024-00452-7