Abstract
Summary
Many predictive tools have been reported for assessing osteoporosis risk. The development and validation of osteoporosis risk prediction models were supported by machine learning.
Introduction
Osteoporosis is a silent disease until it results in fragility fractures. However, early diagnosis of osteoporosis provides an opportunity to detect and prevent fractures. We aimed to develop machine learning approaches to achieve high predictive ability for osteoporosis risk that could help primary care providers identify which women are at increased risk of osteoporosis and should therefore undergo further testing with bone densitometry.
Methods
We included all postmenopausal Korean women from the Korea National Health and Nutrition Examination Surveys (KNHANES V-1, V-2) conducted in 2010 and 2011. Machine learning models using methods such as the k-nearest neighbors (KNN), decision tree (DT), random forest (RF), gradient boosting machine (GBM), support vector machine (SVM), artificial neural networks (ANN), and logistic regression (LR) were developed to predict osteoporosis risk. We analyzed the effect of applying the machine learning algorithms to the raw data and featuring the selected data only where the statistically significant variables were included as model inputs. The accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUROC) were used to evaluate performance among the seven models.
Results
A total of 1792 patients were included in this study, of which 613 had osteoporosis. The raw data consisted of 19 variables and achieved performances (in terms of AUROCs) of 0.712, 0.684, 0.727, 0.652, 0.724, 0.741, and 0.726 for KNN, DT, RF, GBM, SVM, ANN, and LR with fivefold cross-validation, respectively. The feature selected data consisted of nine variables and achieved performances (in terms of AUROCs) of 0.713, 0.685, 0.734, 0.728, 0.728, 0.743, and 0.727 for KNN, DT, RF, GBM, SVM, ANN, and LR with fivefold cross-validation, respectively.
Conclusion
In this study, we developed and compared seven machine learning models to accurately predict osteoporosis risk. The ANN model performed best when compared to the other models, having the highest AUROC value. Applying the ANN model in the clinical environment could help primary care providers stratify osteoporosis patients and improve the prevention, detection, and early treatment of osteoporosis.
Similar content being viewed by others
References
Cosman F, de Beur SJ, LeBoff MS, Lewiecki EM, Tanner B, Randall S, Lindsay R (2014) Clinician’s guide to prevention and treatment of osteoporosis. Osteoporos Int 25:2359–2381
Kim SY, Ok HG, Birkenmaier C, Kim KH (2017) Can denosumab be a substitute, competitor, or complement to bisphosphonates? Korean J Pain 30:86–92
Black DM, Rosen CJ (2016) Clinical Practice. Postmenopausal Osteoporosis. N Engl J Med 374:254–262
Diab DL, Watts NB (2013) Postmenopausal osteoporosis. Curr Opin Endocrinol Diabetes Obes 20:501–509
Kanis JA (1994) Assessment of fracture risk and its application to screening for postmenopausal osteoporosis. Report of a WHO Study Group. World Health Organ Tech Rep Ser 843:1–129
NIH Consensus Development Panel on Osteoporosis Prevention, Diagnosis, and Therapy (2001) Osteoporosis prevention, diagnosis, and therapy. Jama 285:785–795
Gallagher JC (2018) Advances in osteoporosis from 1970 to 2018. Menopause 25:1403–1417
Yedavally-Yellayi S, Ho AM, Patalinghug EM (2019) Update on osteoporosis. Prim Care 46:175–190
Cadarette SM, Jaglal SB, Murray TM, McIsaac WJ, Joseph L, Brown JP (2001) Evaluation of decision rules for referring women for bone densitometry by dual-energy x-ray absorptiometry. Jama 286:57–63
Ma Z, Yang Y, Lin J, Zhang X, Meng Q, Wang B, Fei Q (2016) BFH-OST, a new predictive screening tool for identifying osteoporosis in postmenopausal Han Chinese women. Clin Interv Aging 11:1051–1059
Toh LS, Lai PSM, Wu DB, Bell BG, Dang CPL, Low BY, Wong KT, Guglielmi G, Anderson C (2019) A comparison of 6 osteoporosis risk assessment tools among postmenopausal women in Kuala Lumpur, Malaysia. Osteoporos Sarcopenia 5:87–93
Kim JS, Merrill RK, Arvind V, Kaji D, Pasik SD, Nwachukwu CC, Vargas L, Osman NS, Oermann EK, Caridi JM, Cho SK (2018) Examining the ability of artificial neural networks machine learning models to accurately predict complications following posterior lumbar spine fusion. Spine (Phila Pa 1976) 43:853–860
Lee HC, Yoon HK, Nam K, Cho YJ, Kim TK, Kim WH, Bahk JH (2018) Derivation and validation of machine learning approaches to predict acute kidney injury after cardiac surgery. J. Clin. Med. 7(10):322
Lee HC, Yoon SB, Yang SM, Kim WH, Ryu HG, Jung CW, Suh KS, Lee KH (2018) J Clin Med 7(11):428
Motwani M, Dey D, Berman DS, Germano G, Achenbach S, al-Mallah MH, Andreini D, Budoff MJ, Cademartiri F, Callister TQ, Chang HJ, Chinnaiyan K, Chow BJ, Cury RC, Delago A, Gomez M, Gransar H, Hadamitzky M, Hausleiter J, Hindoyan N, Feuchtner G, Kaufmann PA, Kim YJ, Leipsic J, Lin FY, Maffei E, Marques H, Pontone G, Raff G, Rubinshtein R, Shaw LJ, Stehli J, Villines TC, Dunning A, Min JK, Slomka PJ (2017) Machine learning for prediction of all-cause mortality in patients with suspected coronary artery disease: a 5-year multicentre prospective registry analysis. Eur Heart J 38:500–507
Obermeyer Z, Emanuel EJ (2016) Predicting the future - big data, machine learning, and clinical medicine. N Engl J Med 375:1216–1219
Yoo TK, Kim SK, Kim DW, Choi JY, Lee WH, Oh E, Park EC (2013) Osteoporosis risk prediction for bone mineral density assessment of postmenopausal women using machine learning. Yonsei Med J 54:1321–1330
Kanis JA, Cooper C, Rizzoli R, Reginster JY (2019) European guidance for the diagnosis and management of osteoporosis in postmenopausal women. Osteoporos Int 30:3–44
Siris ES, Adler R, Bilezikian J, Bolognese M, Dawson-Hughes B, Favus MJ, Harris ST, Jan de Beur SM, Khosla S, Lane NE, Lindsay R, Nana AD, Orwoll ES, Saag K, Silverman S, Watts NB (2014) The clinical diagnosis of osteoporosis: a position statement from the National Bone Health Alliance Working Group. Osteoporos Int 25:1439–1443
Bijelic R, Milicevic S, Balaban J (2017) Risk factors for osteoporosis in postmenopausal women. Med Arch 71:25–28
Schnatz PF, Marakovits KA, O'Sullivan DM (2010) Assessment of postmenopausal women and significant risk factors for osteoporosis. Obstet Gynecol Surv 65:591–596
Kweon S, Kim Y, Jang MJ, Kim Y, Kim K, Choi S, Chun C, Khang YH, Oh K (2014) Data resource profile: the Korea National Health and Nutrition Examination Survey (KNHANES). Int J Epidemiol 43:69–77
Wang Q, Luo Z, Huang J, Feng Y, Liu Z (2017) A novel ensemble method for imbalanced data learning: bagging of extrapolation-SMOTE SVM. Comput Intell Neurosci 2017:1827016
Wu CC, Hsu WD, Islam MM, Poly TN, Yang HC, Nguyen PA, Wang YC, Li YJ (2019) An artificial intelligence approach to early predict non-ST-elevation myocardial infarction patients with chest pain. Comput Methods Prog Biomed 173:109–117
Wu CC, Yeh WC, Hsu WD, Islam MM, Nguyen PAA, Poly TN, Wang YC, Yang HC, Jack Li YC (2019) Prediction of fatty liver disease using machine learning algorithms. Comput Methods Prog Biomed 170:23–29
Zhang Z (2016) Introduction to machine learning: k-nearest neighbors. Ann Transl Med 4:218
Lynch CM, Abdollahi B, Fuqua JD, de Carlo AR, Bartholomai JA, Balgemann RN, van Berkel VH, Frieboes HB (2017) Prediction of lung cancer patient survival via supervised machine learning classification techniques. Int J Med Inform 108:1–8
Podgorelec V, Kokol P, Stiglic B, Rozman I (2002) Decision trees: an overview and their use in medicine. J Med Syst 26:445–463
Breiman L (2001) Random forests. Mach Learn 45:5–32
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29:1189–1232
Zhang Z, Zhao Y, Canes A, Steinberg D, Lyashevska O (2019) Predictive analytics with gradient boosting in clinical medicine. Ann Transl Med 7:152
Cortes C, Vapnik V (1995) Support-vector networks. In: Machine learning, vol 20. Kluwer Academic Publisher, Boston, pp 237–297
Papadopoulos MC, Abel PM, Agranoff D, Stich A, Tarelli E, Bell BA, Planche T, Loosemore A, Saadoun S, Wilkins P, Krishna S (2004) A novel and accurate diagnostic test for human African trypanosomiasis. Lancet 363:1358–1363
Rosenblatt F (1958) The perceptron: a probabilistic model for information storage and organization in the brain. Psychol Rev 65:386–408
Dreiseitl S, Ohno-Machado L (2002) Logistic regression and artificial neural network classification models: a methodology review. J Biomed Inform 35:352–359
Buda M, Maki A, Mazurowski MA (2018) A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw 106:249–259
Mehmood A, Maqsood M, Bashir M, Shuyuan Y (2020) A deep Siamese convolution neural network for multi-class classification of Alzheimer disease. Brain Sci. 10(2):84
Panesar SS, D'Souza RN, Yeh FC, Fernandez-Miranda JC (2019) Machine learning versus logistic regression methods for 2-year mortality prognostication in a small, heterogeneous Glioma database. World Neurosurg X 2:100012
The Board of Trustees of The North American Menopause Society (2010) Management of osteoporosis in postmenopausal women: 2010 position statement of The North American Menopause Society. Menopause 17(1):25–54
Rossini M, Adami S, Bertoldo F, Diacinti D, Gatti D, Giannini S, Giusti A, Malavolta N, Minisola S, Osella G, Pedrazzoni M, Sinigaglia L, Viapiana O, Isaia GC (2016) Guidelines for the diagnosis, prevention and management of osteoporosis. Reumatismo 68:1–39
Rubin KH, Holmberg T, Rothmann MJ, Høiberg M, Barkmann R, Gram J, Hermann AP, Bech M, Rasmussen O, Glüer CC, Brixen K (2015) The risk-stratified osteoporosis strategy evaluation study (ROSE): a randomized prospective population-based study. Design and baseline characteristics. Calcif Tissue Int 96:167–179
Rubin KH, Rothmann MJ, Holmberg T, Høiberg M, Möller S, Barkmann R, Glüer CC, Hermann AP, Bech M, Gram J, Brixen K (2018) Effectiveness of a two-step population-based osteoporosis screening program using FRAX: the randomized risk-stratified osteoporosis strategy evaluation (ROSE) study. Osteoporos Int 29:567–578
Crown WH (2019) Real-world evidence, causal inference, and machine learning. Value Health 22:587–592
Sperandei S (2014) Understanding logistic regression analysis. Biochem Med (Zagreb) 24:12–18
Meng J, Sun N, Chen Y, Li Z, Cui X, Fan J, Cao H, Zheng W, Jin Q, Jiang L, Zhu W (2019) Artificial neural network optimizes self-examination of osteoporosis risk in women. J Int Med Res 47:3088–3098
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
None.
Ethics approval
This study was approved by the Kangbuk Samsung Hospital Institutional Review Board. The KNHANES received ethical approval from the Institutional Review Board of the Korea Centers for Disease Control and Prevention. Informed consent was obtained from all participants for inclusion in the surveys.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Shim, JG., Kim, D.W., Ryu, KH. et al. Application of machine learning approaches for osteoporosis risk prediction in postmenopausal women. Arch Osteoporos 15, 169 (2020). https://doi.org/10.1007/s11657-020-00802-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11657-020-00802-8