[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Medical knowledge graph completion via fusion of entity description and type information

Published: 01 May 2024 Publication History

Abstract

Medical Knowledge Graphs (MKGs) are vital in propelling big data technologies in healthcare and facilitating the realization of medical intelligence. However, large-scale MKGs often exhibit characteristics of data sparsity and missing facts. Following the latest advances, knowledge embedding addresses these problems by performing knowledge graph completion. Most knowledge embedding algorithms rely solely on triplet structural information, overlooking the rich information hidden within entity property sets, leading to bottlenecks in performance enhancement when dealing with the intricate relations of MKGs. Inspired by the semantic sensitivity and explicit type constraints unique to the medical domain, we propose BioBERT-based graph embedding model. This model represents an evolvable framework that integrates graph embedding, language embedding, and type information, thereby optimizing the utility of MKGs. Our study utilizes not only WordNet as a benchmark dataset but also incorporates MedicalKG to compare and corroborate the specificity of medical knowledge. Experimental results on these datasets indicate that the proposed fusion framework achieves state-of-art (SOTA) performance compared to other baselines. We believe that this incremental improvement provides promising insights for future medical knowledge graph completion endeavors.

Highlights

A novel fusion framework is introduced for medical knowledge graph completion.
Our methodology incorporates a type-sensitive learning strategy that ensure higher precision.
We improve link prediction with an ontology language model and if-else logic.
Research shows that the medical knowledge graph is more sensitive to semantic information.
Our method surpasses baselines in WordNet and MedicalKG, highlighting its applicability.

References

[1]
Chen A., A novel graph methodology for analyzing disease risk factor distribution using synthetic patient data, Healthc Anal (ISSN ) 2 (2022).
[2]
Pham T., Tao X., Zhang J., Yong J., Li Y., Xie H., Graph-based multi-label disease prediction model learning from medical data and domain knowledge, Knowl-Based Syst (ISSN ) 235 (2022).
[3]
Paulheim H., Knowledge graph refinement: A survey of approaches and evaluation methods, Cimiano P. (Ed.), Semantic Web (ISSN ) 8 (3) (2017) 489–508.
[4]
Wang M., He X., Liu L., Qing L., Chen H., Liu Y., et al., Medical visual question answering based on question-type reasoning and semantic space constraint, Artif Intell Med (ISSN ) 131 (2022).
[5]
Liu Z., Sun M., Lin Y., Xie R., Knowledge representation learning: A review, J Comput Res Dev 53 (2) (2016) 247–261.
[6]
Balazevic I., Allen C., Hospedales T., Tucker: Tensor factorization for knowledge graph completion, in: Proceedings of EMNLP-IJCNLP, Association for Computational Linguistics, 2019.
[7]
Daniali M., Galer P.D., Lewis-Smith D., Parthasarathy S., Kim E., Salvucci D.D., et al., Enriching representation learning using 53 million patient notes through human phenotype ontology embedding, Artif Intell Med 139 (2023).
[8]
Rossi A., Barbosa D., Firmani D., Matinata A., Merialdo P., Knowledge graph embedding for link prediction: a comparative analysis, ACM Trans Knowl Discov Data 15 (2) (2021) 14:1–14:49.
[9]
Wang Q., Mao Z., Wang B., Li G., Knowledge graph embedding: A survey of approaches and applications, IEEE Trans Knowl Data Eng 29 (12) (2017) 2724–2743.
[10]
Bordes A, Usunier N, Garcia-Duran A, Weston J, Yakhnenko O. Translating Embeddings for Modeling Multi-Relational Data. In: Proceedings of NIPS. 2013, p. 1–9.
[11]
Wang Z, Zhang J, Feng J, Chen Z. Knowledge Graph Embedding by Translating on Hyperplanes. In: Proceedings of AAAI. 2014, p. 1112–9.
[12]
Lin Y, Liu Z, Sun M, Liu Y, Zhu X. Learning Entity and Relation Embeddings for Knowledge Graph Completion. In: Proceedings of the AAAI. 2015, p. 2181–7.
[13]
Ji G, He S, Xu L, Liu K, Zhao J. Knowledge Graph Embedding via Dynamic Mapping Matrix. In: Proceedings of ACL. 2015, p. 687–96.
[14]
Hou M., Wei R., Lu L., Lan X., Cai H., Research review of knowledge graph and its application in medical domain, J Comput Res Dev 55 (12) (2018) 2587–2599.
[15]
Zhang Z., Parulian N., Ji H., Elsayed A., Myers S., Palmer M., Fine-grained information extraction from biomedical literature based on knowledge-enriched abstract meaning representation, in: Proceedings of ACL, Association for Computational Linguistics, 2021, pp. 6261–6270. Online.
[16]
Wang H., Wu Y., Gao C., Deng Y., Zhang F., Huang J., et al., Medication combination prediction using temporal attention mechanism and simple graph convolution, IEEE J Biomed Health Inf 25 (10) (2021) 3995–4004.
[17]
Su Q., Ou E., Sun Y., Lv C., Xie G., Wang H., et al., SimH: A novel representation learning model with activation and projection mechanisms for COVID-19 knowledge bases, IEEE J Biomed Health Inf (2022) 1–11.
[18]
Sun Z., Yin H., Chen H., Chen T., Cui L., Yang F., Disease prediction via graph neural networks, IEEE J Biomed Health Inf 25 (3) (2021) 818–826.
[19]
Nicholas M., Vlaeyen J.W.S., Rief W., Barke A., Aziz Q., Benoliel R., et al., The IASP classification of chronic pain for ICD-11: Chronic primary pain, PAIN 160 (1) (2019) 28.
[20]
Pahor M., Chrischilles E.A., Guralnik J.M., Brown S.L., Wallace R.B., Carbonin P., Drug data coding and analysis in epidemiologic studies, Eur J Epidemiol 10 (4) (1994) 405–411.
[21]
Zhang Z., Cai J., Zhang Y., Wang J., Learning hierarchy-aware knowledge graph embeddings for link prediction, in: Proceedings of AAAI, 34, 2020, pp. 3065–3072.
[22]
Miller G.A., WordNet: A lexical database for english, Commun ACM 38 (11) (1995) 39–41.
[23]
Bordes A, Weston J, Collobert R, Bengio Y. Learning Structured Embeddings of Knowledge Bases. In: Proceedings of AAAI. 2011, p. 301–6.
[24]
Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed Representations of Words and Phrases and Their Compositionality. In: Proceedings of NIPS. 2013, p. 3111–9.
[25]
Zhang W., Paudel B., Zhang W., Bernstein A., Chen H., Interaction embeddings for prediction and explanation in knowledge graphs, in: Proceedings of WSDM, WSDM ’19, Association for Computing Machinery, New York, NY, USA, ISBN 978-1-4503-5940-5, 2019, pp. 96–104.
[26]
Sun Z, Deng Z-H, Nie J-Y, Tang J. RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space. In: Proceedings of ICLR. 2019.
[27]
Zhang S., Tay Y., Yao L., Liu Q., Quaternion Knowledge Graph Embeddings, Advances in neural information processing systems, vol. 32, Curran Associates, Inc., 2019.
[28]
Nguyen D.Q., Vu T., Nguyen T.D., Phung D., QuatRE: relation-aware quaternions for knowledge graph embeddings, in: Companion proceedings of the web conference 2022, WWW ’22, Association for Computing Machinery, New York, NY, USA, ISBN 978-1-4503-9130-6, 2022, pp. 189–192.
[29]
Bordes A., Glorot X., Weston J., Bengio Y., A semantic matching energy function for learning with multi-relational data, Mach Learn (ISSN ) 94 (2) (2014) 233–259.
[30]
Bordes A, Glorot X, Weston J. Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing. In: Proceedings of AISTATs. 2012, p. 127–35.
[31]
Balazevic I., Allen C., Hospedales T., TuckER: tensor factorization for knowledge graph completion, in: Proceedings of EMNLP-IJCNLP, Association for Computational Linguistics, Hong Kong, China, 2019, pp. 5185–5194.
[32]
Nickel M., Tresp V., Kriegel H., A three-way model for collective learning on multi-relational data, in: Proceedings of ICML, ACM, 2011, pp. 809–816.
[33]
Yang B, Yih SW-t, He X, Gao J, Deng L. Embedding Entities and Relations for Learning and Inference in Knowledge Bases. In: Proceedings of ICLR. 2015.
[34]
Trouillon T., Welbl J., Riedel S., Gaussier E., Bouchard G., Complex embeddings for simple link prediction, in: Proceedings of ICML, PMLR, 2016, pp. 2071–2080.
[35]
Demir C., Ngomo A.-C.N., Convolutional complex knowledge graph embeddings, in: The semantic web: 18th international conference, ESWC 2021, virtual event, June 6–10, 2021, proceedings, Springer-Verlag, Berlin, Heidelberg, ISBN 978-3-030-77384-7, 2021, pp. 409–424.
[36]
Dettmers T, Minervini P, Stenetorp P, Riedel S. Convolutional 2D Knowledge Graph Embeddings. In: Proceedings of AAAI, vol. 32. 2018, p. 1811–8.
[37]
Vashishth S., Sanyal S., Nitin V., Agrawal N., Talukdar P., InteractE: improving convolution-based knowledge graph embeddings by increasing feature interactions, in: Proceedings of AAAI, vol. 34, 2020, pp. 3009–3016.
[38]
Miotto R., Wang F., Wang S., Jiang X., Dudley J.T., Deep learning for healthcare: Review, opportunities and challenges, Brief Bioinform 19 (6) (2017) 1236–1246.
[39]
Zhang A., Xing L., Zou J., Wu J.C., Shifting machine learning for healthcare from development to deployment and from models to data, Nat Biomed Eng 6 (12) (2022) 1330–1345.
[40]
Bechhofer S., Harmelen F., Hendler J., Horrocks I., Mcguinness D., Patel-Schneider P., et al., OWL web ontology language reference, 2004.
[41]
Uschold M., Gruninger M., Ontologies: principles, methods and applications, Knowl Eng Rev 11 (2) (1996) 93–136.
[42]
Devlin J, Chang M-W, Lee K, Toutanova K. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. In: Proceedings of NAACL, vol. 1. 2019, p. 4171–86.
[43]
Mikolov T, Chen K, Corrado G, Dean J. Efficient Estimation of Word Representations in Vector Space. In: Proceedings of ICLR. 2013.
[44]
Lee J., Yoon W., Kim S., Kim D., Kim S., So C.H., et al., BioBERT: A pre-trained biomedical language representation model for biomedical text mining, Wren J. (Ed.), Bioinformatics (ISSN ) 36 (2019) 1234–1240.
[45]
Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., et al., Attention is all you need, Adv Neural Inf Process Syst (2017).
[46]
van der Maaten L., Hinton G., Visualizing Data using T-SNE, J Mach Learn Res 9 (86) (2008) 2579–2605.

Cited By

View all
  • (2024)Improving embedding-based link prediction performance using clusteringJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2024.10218136:8Online publication date: 1-Oct-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Artificial Intelligence in Medicine
Artificial Intelligence in Medicine  Volume 151, Issue C
May 2024
297 pages

Publisher

Elsevier Science Publishers Ltd.

United Kingdom

Publication History

Published: 01 May 2024

Author Tags

  1. Link prediction
  2. Entity property set
  3. Graph embeddings
  4. Language embeddings
  5. BioBERT
  6. Information fusion

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Improving embedding-based link prediction performance using clusteringJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2024.10218136:8Online publication date: 1-Oct-2024

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media