Abstract
Medical data processing and analysis using machine learning algorithms are a prominent research topic these days. To obtain high performances, most state-of-the-art models must be trained on a large number of labeled datasets. However, manually collecting a large-scale real-world medical data is expensive due to the issues like privacy, security, and reliability. Moreover, the collected samples may be incomplete, i.e., some important data items are missing, which has a significant impact on the performance of the machine learning algorithms, especially deep learning models. In this paper, we present a strategy that uses the idea of adversarial learning to augment the real-world medical samples with the incomplete issue to obtain better performance in predicting patient states. Rather than supplying the incomplete data sample with extra instance-level information as in existing data-complementation methods, our method aims to achieve the complementary effect in the feature level without consuming human effort. The method receives both high-quality data and low-quality data (with serious incomplete issue) to learn a comprehensive feature space where the incomplete samples can be complemented with the knowledge transferred from the high-quality samples by the adversarial learning scheme. From the perspective of feature representations, our method can alleviate the incomplete issue of the low-quality data, which enhances the model performance in the end task. Experiments show that our method works well on a real-world medical dataset, which collected for assessment of pelvic organ prolapse. When compared to machine learning approaches frequently employed in medical data analysis, our approach shows a significant improvement of up to 10%.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Rajkomar A, Dean J, Kohane I (2019) Machine learning in medicine. N Engl J Med 380(14):1347–1358
Kwak GH, Hui P (2019) Deephealth: review and challenges of artificial intelligence in health informatics. arXiv preprint https://arxiv.org/abs/1909.00384
Richesson RL, Sun J, Pathak J, Kho AN, Denny JC (2016) Clinical phenotyping in selected national networks: demonstrating the need for high-throughput, portable, and computational methods. Artif Intell Med 71:57–61
Pathak J, Kho AN, Denny JC (2013) Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. BMJ Publishing Group BMA House, London
Xu Y, Hong K, Tsujii J, Chang EI-C (2012) Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries. J Am Med Inform Assoc 19(5):824–832
Beaulieu-Jones, BK, Moore JH, Pooled Resource Open-Access ALS Clinical Trials Consortium (2017) Missing data imputation in the electronic health record using deeply learned autoencoders. In: Pacific symposium on biocomputing 2017. World Scientific, pp 207–218
Kim Y-J, Chi M (2018) Temporal belief memory: imputing missing data during RNN training. In: Proceedings of the 27th international joint conference on artificial intelligence (IJCAI-2018)
Zhang Y (2019) Attain: attention-based time-aware LSTM networks for disease progression modeling. In: Proceedings of the 28th international joint conference on artificial intelligence (IJCAI-2019). Macao, China, pp 4369–4375
Gong JJ, Naumann T, Szolovits P, Guttag JV (2017) Predicting clinical outcomes across changing electronic health record systems. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining. pp 1497–1505
Friedman T, Eslick GD, Dietz HP (2018) Risk factors for prolapse recurrence: systematic review and meta-analysis. Int Urogynecol J 29(1):13–21
Alhassan Z, Budgen D, Alshammari R, Al Moubayed N (2020) Predicting current glycated hemoglobin levels in adults from electronic health records: validation of multiple logistic regression algorithm. JMIR Med Inform 8(7):18963
Gu Y, Huang Y, Ly VK, Yaseen A, Miao H (2020) Ehr data analytics and predictions: machine learning methods. In: Yamal J-M, Yaseen A, Maroufy V, Hulin W (eds) Statistics and machine learning methods for EHR data. Chapman and Hall/CRC, London, pp 273–293
Giacomelli I, Jha S, Kleiman R, Page D, Yoon K (2019) Privacy preserving collaborative prediction using random forests. AMIA Summits Transl Sci Proc 2019:248
Negro-Calduch E, Azzopardi-Muscat N, Krishnamurthy RS, Novillo-Ortiz D (2021) Technological progress in electronic health record system optimization: systematic review of systematic literature reviews. Int J Med Inform 152:104507
Wanyan T, Honarvar H, Azad A, Ding Y, Glicksberg BS (2021) Deep learning with heterogeneous graph embeddings for mortality prediction from electronic health records. Data Intell 3(3):329–339
Meng Y, Speier WF, Ong MK, Arnold C (2021) Bidirectional representation learning from transformers using multimodal electronic health record data to predict depression. IEEE J Biomed Health Inform 25:3121–3129
Yi X, Walia E, Babyn P (2019) Generative adversarial network in medical imaging: a review. Med Image Anal 58:101552
Chu J, Dong W, Wang J, He K, Huang Z (2020) Treatment effect prediction with adversarial deep learning using electronic health records. BMC Med Inform Decis Mak 20(4):1–14
Baowaly MK, Lin C-C, Liu C-L, Chen K-T (2019) Synthesizing electronic health records using improved generative adversarial networks. J Am Med Inform Assoc 26(3):228–241
Yang Y, Wu Z, Tresp V, Fasching PA (2019) Categorical Ehr imputation with generative adversarial nets. In: 2019 IEEE international conference on healthcare informatics (ICHI). IEEE, pp 1–10
Weber A, Abrams P, Brubaker L, Cundiff G, Davis G, Dmochowski R, Fischer J, Hull T, Nygaard I, Weidner A (2001) The standardization of terminology for researchers in female pelvic floor disorders. Int Urogynecol J 12(3):178–186
Ganin Y, Lempitsky V (2015) Unsupervised domain adaptation by backpropagation. In: International conference on machine learning. PMLR, pp 1180–1189
Wang L, Tong L, Davis D, Arnold T, Esposito T (2020) The application of unsupervised deep learning in predictive models using electronic health records. BMC Med Res Methodol 20(1):1–9
Chui KT, Tsang KF, Chi HR, Ling BWK, Wu CK (2016) An accurate ECG-based transportation safety drowsiness detection scheme. IEEE Trans Ind Inform 12(4):1438–1452
Acknowledgements
We gratefully acknowledge the help of the Third Affiliated Hospital of Zhengzhou University, which is the data provider in this work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, I declare there exist no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Luo, M., Yang, X. Postpartum pelvic organ prolapse assessment via adversarial feature complementation in heterogeneous data. Neural Comput & Applic 35, 13851–13860 (2023). https://doi.org/10.1007/s00521-021-06869-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06869-9