[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Multi-task heterogeneous graph learning on electronic health records

Published: 01 December 2024 Publication History

Abstract

Learning electronic health records (EHRs) has received emerging attention because of its capability to facilitate accurate medical diagnosis. Since the EHRs contain enriched information specifying complex interactions between entities, modeling EHRs with graphs is shown to be effective in practice. The EHRs, however, present a great degree of heterogeneity, sparsity, and complexity, which hamper the performance of most of the models applied to them. Moreover, existing approaches modeling EHRs often focus on learning the representations for a single task, overlooking the multi-task nature of EHR analysis problems and resulting in limited generalizability across different tasks. In view of these limitations, we propose a novel framework for EHR modeling, namely MulT-EHR (Multi-Task EHR), which leverages a heterogeneous graph to mine the complex relations and model the heterogeneity in the EHRs. To mitigate the large degree of noise, we introduce a denoising module based on the causal inference framework to adjust for severe confounding effects and reduce noise in the EHR data. Additionally, since our model adopts a single graph neural network for simultaneous multi-task prediction, we design a multi-task learning module to leverage the inter-task knowledge to regularize the training process. Extensive empirical studies on MIMIC-III and MIMIC-IV datasets validate that the proposed method consistently outperforms the state-of-the-art designs in four popular EHR analysis tasks — drug recommendation, and predictions of the length of stay, mortality, and readmission. Thorough ablation studies demonstrate the robustness of our method upon variations to key components and hyperparameters.

References

[1]
Bordes A., Usunier N., Garcia-Duran A., Weston J., Yakhnenko O., Translating embeddings for modeling multi-relational data, Advances in Neural Information Processing Systems 26 (2013).
[2]
Chan T.H., Wong C.H., Shen J., Yin G., Source-aware embedding training on heterogeneous information networks, Data Intelligence (2023) 1–14.
[3]
Choi, E., Bahadori, M. T., Song, L., Stewart, W. F., & Sun, J. (2017). GRAM: graph-based attention model for healthcare representation learning. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 787–795).
[4]
Choi E., Xiao C., Stewart W., Sun J., Mime: Multilevel medical embedding of electronic health records for predictive healthcare, Advances in Neural Information Processing Systems 31 (2018).
[5]
Dong, D., Wu, H., He, W., Yu, D., & Wang, H. (2015). Multi-task learning for multiple language translation. In Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (volume 1: long papers) (pp. 1723–1732).
[6]
Fuglede B., Topsoe F., Jensen-Shannon divergence and Hilbert space embedding, in: International symposium on information theory, 2004, IEEE, 2004, p. 31.
[7]
Gao J., Xiao C., Glass L.M., Sun J., Dr. Agent: Clinical predictive model via mimicked second opinions, Journal of the American Medical Informatics Association 27 (7) (2020) 1084–1091.
[8]
Gao, J., Xiao, C., Wang, Y., Tang, W., Glass, L. M., & Sun, J. (2020). Stagenet: Stage-aware neural networks for health risk prediction. In Proceedings of the web conference 2020 (pp. 530–540).
[9]
Hägele, A., Rothfuss, J., Lorch, L., Somnath, V. R., Schölkopf, B., & Krause, A. (2022). BaCaDI: Bayesian causal discovery with unknown interventions. In UAI 2022 workshop on causal representation learning.
[10]
Hochreiter S., Schmidhuber J., Long short-term memory, Neural Computation 9 (8) (1997) 1735–1780.
[11]
Hu, Z., Dong, Y., Wang, K., & Sun, Y. (2020). Heterogeneous graph transformer. In Proceedings of the web conference 2020 (pp. 2704–2710).
[12]
Huang T., Xu K., Wang D., DA-HGT: Domain adaptive heterogeneous graph transformer, 2020, arXiv preprint arXiv:2012.05688.
[13]
Ji, G., He, S., Xu, L., Liu, K., & Zhao, J. (2015). Knowledge graph embedding via dynamic mapping matrix. In Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (volume 1: long papers) (pp. 687–696).
[14]
Jiang P., Xiao C., Cross A., Sun J., GraphCare: Enhancing healthcare predictions with open-world personalized knowledge graphs, 2023, arXiv preprint arXiv:2305.12788.
[15]
Jing J., Ge W., Hong S., Fernandes M.B., Lin Z., Yang C., et al., Development of expert-level classification of seizures and rhythmic and periodic patterns during eeg interpretation, Neurology 100 (17) (2023) e1750–e1762.
[16]
Kojima R., Ishida S., Ohta M., Iwata H., Honma T., Okuno Y., kGCN: a graph-based deep learning framework for chemical structures, Journal of Cheminformatics 12 (2020) 1–10.
[17]
Lin, Y., Liu, Z., Sun, M., Liu, Y., & Zhu, X. (2015). Learning entity and relation embeddings for knowledge graph completion. In Twenty-ninth AAAI conference on artificial intelligence.
[18]
Liu Z., Li X., Peng H., He L., Philip S.Y., Heterogeneous similarity graph neural network on electronic health records, in: 2020 IEEE international conference on big data (big data), IEEE, 2020, pp. 1196–1205.
[19]
Long M., Cao Z., Wang J., Yu P.S., Learning multiple tasks with multilinear relationship networks, Advances in Neural Information Processing Systems 30 (2017).
[20]
Ma, F., Chitta, R., Zhou, J., You, Q., Sun, T., & Gao, J. (2017). Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1903–1911).
[21]
Ma, L., Gao, J., Wang, Y., Zhang, C., Wang, J., Ruan, W., et al. (2020). Adacare: Explainable clinical health status representation learning via scale-adaptive feature extraction and recalibration. Vol. 34, In Proceedings of the AAAI conference on artificial intelligence (01), (pp. 825–832).
[22]
Ma, F., You, Q., Xiao, H., Chitta, R., Zhou, J., & Gao, J. (2018). Kame: Knowledge-based attention model for diagnosis prediction in healthcare. In Proceedings of the 27th ACM international conference on information and knowledge management (pp. 743–752).
[23]
Ma, L., Zhang, C., Wang, Y., Ruan, W., Wang, J., Tang, W., et al. (2020). Concare: Personalized clinical feature embedding via capturing the healthcare context. Vol. 34, In Proceedings of the AAAI conference on artificial intelligence (01), (pp. 833–840).
[24]
Medsker L.R., Jain L., Recurrent neural networks, Design and Applications 5 (2001) 64–67.
[25]
Melnychuk V., Frauen D., Feuerriegel S., Causal transformer for estimating counterfactual outcomes, in: International conference on machine learning, PMLR, 2022, pp. 15293–15329.
[26]
Nguyen P., Tran T., Wickramasinghe N., Venkatesh S., Deepr: a convolutional net for medical records, IEEE Journal of Biomedical and Health Informatics 21 (1) (2016) 22–30.
[27]
Ong Ly C., Unnikrishnan B., Tadic T., Patel T., Duhamel J., Kandel S., et al., Shortcut learning in medical AI hinders generalization: method for estimating AI model generalization without external data, npj Digital Medicine 7 (1) (2024) 124.
[28]
Schlichtkrull M., Kipf T.N., Bloem P., Berg R.v.d., Titov I., Welling M., Modeling relational data with graph convolutional networks, in: European semantic web conference, Springer, 2018, pp. 593–607.
[29]
Sener O., Koltun V., Multi-task learning as multi-objective optimization, Advances in Neural Information Processing Systems 31 (2018).
[30]
Shi, H., Gao, J., Xu, H., Liang, X., Li, Z., Kong, L., et al. (2022). Revisiting Over-smoothing in BERT from the Perspective of Graph. In The international conference on learning representations.
[31]
Sui, Y., Wang, X., Wu, J., Lin, M., He, X., & Chua, T.-S. (2022). Causal Attention for Interpretable and Generalizable Graph Classification. In Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining (pp. 1696–1705).
[32]
Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., et al., Attention is all you need, Advances in Neural Information Processing Systems 30 (2017).
[33]
Veličković P., Cucurull G., Casanova A., Romero A., Lio P., Bengio Y., Graph attention networks, 2017, arXiv preprint arXiv:1710.10903.
[34]
Wang, X., Ji, H., Shi, C., Wang, B., Ye, Y., Cui, P., et al. (2019). Heterogeneous graph attention network. In The world wide web conference (pp. 2022–2032).
[35]
Welling, M., & Kipf, T. N. (2016). Semi-supervised classification with graph convolutional networks. In J. international conference on learning representations.
[36]
Wu, Q., Zhang, H., Yan, J., & Wipf, D. (2021). Handling Distribution Shifts on Graphs: An Invariance Perspective. In International conference on learning representations.
[37]
Xu, K., Hu, W., Leskovec, J., & Jegelka, S. (2018). How Powerful are Graph Neural Networks?. In International conference on learning representations.
[38]
Xue Y., Liao X., Carin L., Krishnapuram B., Multi-task learning for classification with Dirichlet process priors, Journal of Machine Learning Research 8 (1) (2007).
[39]
Yang, Y., & Hospedales, T. (2017). Trace Norm Regularised Deep Multi-Task Learning. In 5th international conference on learning representations workshop.
[40]
Yang, S., Song, G., Jin, Y., & Du, L. (2020). Domain Adaptive Classification on Heterogeneous Information Networks. In IJCAI (pp. 1410–1416).
[41]
Yang, C., Xiao, C., Glass, L., & Sun, J. (2021). Change Matters: Medication Change Prediction with Recurrent Residual Networks. In 30th international joint conference on artificial intelligence (pp. 3728–3734).
[42]
Yang C., Xiao C., Ma F., Glass L., Sun J., Safedrug: Dual molecular graph encoders for recommending effective and safe drug combinations, 2021, arXiv preprint arXiv:2105.02711.
[43]
Yang, N., Zeng, K., Wu, Q., & Yan, J. (2023). MoleRec: Combinatorial Drug Recommendation with Substructure-Aware Molecular Representation Learning. In Proceedings of the ACM web conference 2023 (pp. 4075–4085).
[44]
Yu Y., Chen J., Gao T., Yu M., DAG-GNN: DAG structure learning with graph neural networks, in: International conference on machine learning, PMLR, 2019, pp. 7154–7163.
[45]
Yun S., Jeong M., Kim R., Kang J., Kim H.J., Graph transformer networks, Advances in Neural Information Processing Systems 32 (2019).
[46]
Zhang, C., Gao, X., Ma, L., Wang, Y., Wang, J., & Tang, W. (2021). GRASP: generic framework for health status representation learning based on incorporating knowledge from similar patients. Vol. 35, In Proceedings of the AAAI conference on artificial intelligence (1), (pp. 715–723).
[47]
Zhao, L., & Akoglu, L. (2019). PairNorm: Tackling Oversmoothing in GNNs. In International conference on learning representations.
[48]
Zhao Y., Qiao Z., Xiao C., Glass L., Sun J., Pyhealth: A python library for health predictive models, 2021, arXiv preprint arXiv:2101.04209.
[49]
Zhao W., Tang D., Chen X., Lv D., Ou D., Li B., et al., Disentangled causal embedding with contrastive learning for recommender system, 2023, arXiv preprint arXiv:2302.03248.
[50]
Zhu, W., & Razavian, N. (2021). Variationally regularized graph-based representation learning for electronic health records. In Proceedings of the conference on health, inference, and learning (pp. 1–13).

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Neural Networks
Neural Networks  Volume 180, Issue C
Dec 2024
1432 pages

Publisher

Elsevier Science Ltd.

United Kingdom

Publication History

Published: 01 December 2024

Author Tags

  1. Causal inference
  2. Electronic health records
  3. Graph representation learning
  4. Multi-task learning

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media