[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3442381.3449795acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Online Disease Diagnosis with Inductive Heterogeneous Graph Convolutional Networks

Published: 03 June 2021 Publication History

Abstract

We propose a Healthcare Graph Convolutional Network (HealGCN) to offer disease self-diagnosis service for online users based on Electronic Healthcare Records (EHRs). Two main challenges are focused in this paper for online disease diagnosis: (1) serving cold-start users via graph convolutional networks and (2) handling scarce clinical description via a symptom retrieval system. To this end, we first organize the EHR data into a heterogeneous graph that is capable of modeling complex interactions among users, symptoms and diseases, and tailor the graph representation learning towards disease diagnosis with an inductive learning paradigm. Then, we build a disease self-diagnosis system with a corresponding EHR Graph-based Symptom Retrieval System (GraphRet) that can search and provide a list of relevant alternative symptoms by tracing the predefined meta-paths. GraphRet helps enrich the seed symptom set through the EHR graph when confronting users with scarce descriptions, hence yield better diagnosis accuracy. At last, we validate the superiority of our model on a large-scale EHR dataset.

References

[1]
Yoshua Bengio, Jérôme Louradour, Ronan Collobert, and Jason Weston. 2009. Curriculum learning. In Proceedings of the 26th Annual International Conference on Machine Learning. 41–48.
[2]
Siddharth Biswal, Cao Xiao, Lucas M. Glass, Elizabeth Milkovits, and Jimeng Sun. 2019. Doctor2Vec: Dynamic doctor representation learning for clinical trial recruitment. arXiv preprint arXiv:1911.10395(2019).
[3]
Gerlof Bouma. 2009. Normalized (pointwise) mutual information in collocation extraction. Proceedings of German Society for Computational Linguistics and Language Technology, 31–40.
[4]
Zhengping Che and Yan Liu. 2017. Deep learning solutions to computational phenotyping in health care. In IEEE International Conference on Data Mining Workshops. 1100–1109.
[5]
Edward Choi, Mohammad Taha Bahadori, Andy Schuetz, Walter F Stewart, and Jimeng Sun. 2016. Doctor AI: Predicting clinical events via recurrent neural networks. In Machine Learning for Healthcare Conference. 301–318.
[6]
Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Michael Thompson, James Bost, Javier Tejedor-Sojo, and Jimeng Sun. 2016. Multi-layer representation learning for medical concepts. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1495–1504.
[7]
Edward Choi, Mohammad Taha Bahadori, Jimeng Sun, Joshua Kulas, Andy Schuetz, and Walter Stewart. 2016. RETAIN: An interpretable predictive model for healthcare using reverse time attention mechanism. In Advances in Neural Information Processing Systems. 3504–3512.
[8]
Edward Choi, Cao Xiao, Walter Stewart, and Jimeng Sun. 2018. MiME: Multilevel medical embedding of electronic health records for predictive healthcare. In Advances in Neural Information Processing Systems. 4547–4557.
[9]
Edward Choi, Zhen Xu, Yujia Li, Michael W. Dusenberry, Gerardo Flores, Yuan Xue, and Andrew M. Dai. 2019. Graph convolutional transformer: Learning the graphical structure of electronic health records. arXiv preprint arXiv:1906.04716(2019).
[10]
Youngduck Choi, Chill Yi-I Chiu, and David Sontag. 2016. Learning low-dimensional representations of medical concepts. AMIA Summits on Translational Science Proceedings (2016), 41–50.
[11]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the Conference of the Association for Computational Linguistics. 4171–4186.
[12]
Dmitriy Dligach and Timothy Miller. 2018. Learning patient representations from text. arXiv preprint arXiv:1805.02096(2018).
[13]
Wael Farhan, Zhimu Wang, Yingxiang Huang, Shuang Wang, Fei Wang, and Xiaoqian Jiang. 2016. A predictive model for medical events based on contextual embedding of temporal sequences. JMIR Medical Informatics 4, 4 (2016), e39.
[14]
Ferenc Galkó and Carsten Eickhoff. 2018. Biomedical question answering via weighted neural network passage retrieval. In European Conference on Information Retrieval. 523–528.
[15]
Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. 1024–1034.
[16]
Jiawei Han, Yizhou Sun, Xifeng Yan, and Philip S. Yu. 2010. Mining knowledge from databases: An information network analysis approach. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 1251–1252.
[17]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web. 173–182.
[18]
Emily Herrett, Arlene M. Gallagher, Krishnan Bhaskaran, Harriet Forbes, Rohini Mathur, Tjeerd van Staa, and Liam Smeeth. 2015. Data resource profile: Clinical practice research datalink (CPRD). International Journal of Epidemiology 44 (2015), 827–836.
[19]
Bhagya Hettige, Weiqing Wang, Yuan-Fang Li, Suong Le, and Wray Buntine. 2020. MedGraph: Structural and Temporal Representation Learning of Electronic Medical Records. In Proceedings of the 24th European Conference on Artificial Intelligence (ECAI).
[20]
Anahita Hosseini, Ting Chen, Wenjun Wu, Yizhou Sun, and Majid Sarrafzadeh. 2018. HeteroMed: Heterogeneous information network for medical diagnosis. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 763–772.
[21]
Anahita Hosseini, Tyler Davis, and Majid Sarrafzadeh. 2019. Hierarchical target-attentive diagnosis prediction in heterogeneous information networks. In International Conference on Data Mining Workshops. 949–957.
[22]
Zhiheng Huang, Wei Xu, and Kai Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991(2015).
[23]
Alistair EW. Johnson, Tom J. Pollard, Lu Shen, H. Lehman Li-wei, Mengling Feng, Mohammad Ghassemi, Benjamin Moody, Peter Szolovits, Leo Anthony Celi, and Roger G. Mark. 2016. MIMIC-III, a freely accessible critical care database. Scientific Data 3(2016).
[24]
Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. Lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems. 3146–3154.
[25]
Yoon Kim. 2014. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882(2014).
[26]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980(2014).
[27]
Thomas N. Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907(2016).
[28]
Yikuan Li, Shishir Rao, Jose Roberto Ayala Solares, Abdelaali Hassaine, Dexter Canoy, Yajie Zhu, Kazem Rahimi, and Gholamreza Salimi-Khorshidi. 2019. BEHRT: Transformer for electronic health records. arXiv preprint arXiv:1907.09538(2019).
[29]
Rong-Ho Lin. 2009. An intelligent model for liver disease diagnosis. Artificial Intelligence in Medicine 47 (2009), 53–62.
[30]
Fenglong Ma, Radha Chitta, Jing Zhou, Quanzeng You, Tong Sun, and Jing Gao. 2017. Dipole: Diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining. 1903–1911.
[31]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781(2013).
[32]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems. 3111–3119.
[33]
James Mullenbach, Sarah Wiegreffe, Jon Duke, Jimeng Sun, and Jacob Eisenstein. 2018. Explainable prediction of medical codes from clinical text. arXiv preprint arXiv:1802.05695(2018).
[34]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems. 8024–8035.
[35]
Sanjay Purushotham, Chuizheng Meng, Zhengping Che, and Yan Liu. 2017. Benchmark of deep learning models on large healthcare MIMIC datasets. arXiv preprint arXiv:1710.08531(2017).
[36]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2012. BPR: Bayesian personalized ranking from implicit feedback. arXiv preprint arXiv:1205.2618(2012).
[37]
Jyoti Soni, Ujma Ansari, Dipesh Sharma, and Sunita Soni. 2011. Predictive data mining for medical diagnosis: An overview of heart disease prediction. International Journal of Computer Applications 17 (2011), 43–48.
[38]
Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, and Tat-Seng Chua. 2019. Neural graph collaborative filtering. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 165–174.
[39]
Zifeng Wang, Yifan Yang, Rui Wen, Xi Chen, Shao-Lun Huang, and Yefeng Zheng. 2021. Lifelong Learning based Disease Diagnosis on Clinical Notes. In Proceedings of the 25th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD).
[40]
Xing Wei and Carsten Eickhoff. 2018. Embedding electronic health records for clinical information retrieval. arXiv preprint arXiv:1811.05402(2018).
[41]
Cheng-Hsiung Weng, Tony Cheng-Kui Huang, and Ruo-Ping Han. 2016. Disease prediction with different types of neural network classifiers. Telematics and Informatics 33 (2016), 277–292.
[42]
Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L. Hamilton, and Jure Leskovec. 2018. Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 974–983.
[43]
Jinghe Zhang, Kamran Kowsari, James H. Harrison, Jennifer M. Lobo, and Laura E. Barnes. 2018. Patient2Vec: A personalized interpretable deep representation of the longitudinal electronic health record. IEEE Access 6(2018), 65333–65346.
[44]
Xianli Zhang, Buyue Qian, Yang Li, Yang Liu, Xi Chen, Chong Guan, Yefeng Zheng, and Chen Li. 2021. Learning Robust Patient Representations from Multi-modal Electronic Health Records: A Supervised Deep Learning Approach. In Proceedings of the 2021 SIAM International Conference on Data Mining (SDM).
[45]
Zihao Zhu, Changchang Yin, Buyue Qian, Yu Cheng, Jishang Wei, and Fei Wang. 2016. Measuring patient similarities via a deep architecture with medical concept embedding. In IEEE 16th International Conference on Data Mining. 749–758.

Cited By

View all
  • (2024)Few-Shot Synthetic Online Transfer Learning for Cross-Site Neurological Disease DiagnosisIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.327056911:2(2201-2209)Online publication date: Apr-2024
  • (2024)Multi-masks and Bi-spaces Reconstruction based Single-Layer Auto-encoder for Heterogeneous Graph Representation Learning2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650586(1-9)Online publication date: 30-Jun-2024
  • (2024)ExpertODE: Continuous Diagnosis Prediction with Expert Enhanced Neural Ordinary Differential Equations2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687504(1-6)Online publication date: 15-Jul-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '21: Proceedings of the Web Conference 2021
April 2021
4054 pages
ISBN:9781450383127
DOI:10.1145/3442381
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. disease diagnosis
  2. graph neural network
  3. online healthcare service

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WWW '21
Sponsor:
WWW '21: The Web Conference 2021
April 19 - 23, 2021
Ljubljana, Slovenia

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)66
  • Downloads (Last 6 weeks)7
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Few-Shot Synthetic Online Transfer Learning for Cross-Site Neurological Disease DiagnosisIEEE Transactions on Computational Social Systems10.1109/TCSS.2023.327056911:2(2201-2209)Online publication date: Apr-2024
  • (2024)Multi-masks and Bi-spaces Reconstruction based Single-Layer Auto-encoder for Heterogeneous Graph Representation Learning2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650586(1-9)Online publication date: 30-Jun-2024
  • (2024)ExpertODE: Continuous Diagnosis Prediction with Expert Enhanced Neural Ordinary Differential Equations2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687504(1-6)Online publication date: 15-Jul-2024
  • (2024)Knowledge-Routed Automatic Diagnosis With Heterogeneous Patient-Oriented GraphIEEE Access10.1109/ACCESS.2024.341241912(89573-89584)Online publication date: 2024
  • (2024)Graph neural networks for clinical risk prediction based on electronic health recordsJournal of Biomedical Informatics10.1016/j.jbi.2024.104616151:COnline publication date: 2-Jul-2024
  • (2024)Feature aggregation-based multi-relational knowledge reasoning for COPD intelligent diagnosisComputers and Electrical Engineering10.1016/j.compeleceng.2023.109068114(109068)Online publication date: Mar-2024
  • (2024)GTP-4o: Modality-Prompted Heterogeneous Graph Learning for Omni-Modal Biomedical RepresentationComputer Vision – ECCV 202410.1007/978-3-031-73235-5_10(168-187)Online publication date: 30-Sep-2024
  • (2023)Bayesian-Based Symptom Screening for Medical Dialogue Diagnosis2023 IEEE Symposium on Computers and Communications (ISCC)10.1109/ISCC58397.2023.10218219(1-6)Online publication date: 9-Jul-2023
  • (2023)Enhancing Personalized Healthcare via Capturing Disease Severity, Interaction, and Progression2023 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM58522.2023.00173(1349-1354)Online publication date: 1-Dec-2023
  • (2023)A comprehensive review of machine learning algorithms and their application in geriatric medicine: present and futureAging Clinical and Experimental Research10.1007/s40520-023-02552-235:11(2363-2397)Online publication date: 8-Sep-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media