Abstract
MicroRNAs (miRNAs) are closely related to numerous complex human diseases, therefore, exploring miRNA-disease associations (MDAs) can help people gain a better understanding of complex disease mechanism. An increasing number of computational methods have been developed to predict MDAs. However, the sparsity of the MDAs may hinder the performance of many methods. In addition, many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes. In this study, we propose a deep matrix factorization model with variational autoencoder (DMFVAE) to predict potential MDAs. DMFVAE first decomposes the original association matrix and the enhanced association matrix, in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method, to obtain sparse vectors and dense vectors, respectively. Then, the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors, and meanwhile, node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors. Finally, sample features are acquired by combining the latent vectors and network structure embedding vectors, and the final prediction is implemented by convolutional neural network with channel attention. To evaluate the performance of DMFVAE, we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well. Furthermore, case studies on lung neoplasms, colon neoplasms, and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Gebert L F R, MacRae I J. Regulation of microRNA function in animals. Nature Reviews Molecular Cell Biology, 2019, 20(1): 21–37
Van Meter E N, Onyango J A, Teske K A. A review of currently identified small molecule modulators of microRNA function. European Journal of Medicinal Chemistry, 2020, 188: 112008
Hammond S M. An overview of microRNAs. Advanced Drug Delivery Reviews, 2015, 87: 3–14
Patanè S. The complex miRNAs-p53 signaling network in cardiovascular disease. Journal of the American College of Cardiology, 2017, 69(16): 2099–2100
Wang X, He Y, Mackowiak B, Gao B. MicroRNAs as regulators, biomarkers and therapeutic targets in liver diseases. Gut, 2021, 70(4): 784–795
Niu Y W, Wang G H, Yan G Y, Chen X. Integrating random walk and binary regression to identify novel miRNA-disease association. BMC Bioinformatics, 2019, 20(1): 59
Chen X, Xie D, Zhao Q, You Z H. MicroRNAs and complex diseases: from experimental results to computational models. Briefings in Bioinformatics, 2019, 20(2): 515–539
Huang L, Zhang L, Chen X. Updated review of advances in microRNAs and complex diseases: taxonomy, trends and challenges of computational models. Briefings in Bioinformatics, 2022, 23(5): bbac358
Huang L, Zhang L, Chen X. Updated review of advances in microRNAs and complex diseases: experimental results, databases, webservers and data fusion. Briefings in Bioinformatics, 2022, 23(6): bbac397
Huang L, Zhang L, Chen X. Updated review of advances in microRNAs and complex diseases: towards systematic evaluation of computational models. Briefings in Bioinformatics, 2022, 23(6): bbac407
Dai L Y, Liu J X, Zhu R, Wang J, Yuan S S. Logistic weighted profile-based bi-random walk for exploring miRNA-disease associations. Journal of Computer Science and Technology, 2021, 36(2): 276–287
Xuan P, Wang D, Cui H, Zhang T, Nakaguchi T. Integration of pairwise neighbor topologies and miRNA family and cluster attributes for miRNA–disease association prediction. Briefings in Bioinformatics, 2022, 23(1): bbab428
Xu J, Zhu W, Cai L, Liao B, Meng Y, Xiang J, Yuan D, Tian G, Yang J. LRMCMDA: predicting miRNA-disease association by integrating low-rank matrix completion with miRNA and disease similarity information. IEEE Access, 2020, 8: 80728–80738
Chen X, Yin J, Qu J, Huang L. MDHGI: matrix decomposition and heterogeneous graph inference for miRNA-disease association prediction. PLoS Computational Biology, 2018, 14(8): e1006418
Chen X, Sun L G, Zhao Y. NCMCMDA: miRNA–disease association prediction through neighborhood constraint matrix completion. Briefings in Bioinformatics, 2021, 22(1): 485–496
Lu X, Li J, Zhu Z, Yuan Y, Chen G, He K. Predicting miRNA-disease associations via combining probability matrix feature decomposition with neighbor learning. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19(6): 3160–3170
Zhang Y, Lei X, Fang Z, Pan Y. CircRNA-disease associations prediction based on metapath2vec++ and matrix factorization. Big Data Mining and Analytics, 2020, 3(4): 280–291
Xuan P, Sun H, Wang X, Zhang T, Pan S. Inferring the disease-associated miRNAs based on network representation learning and convolutional neural networks. International Journal of Molecular Sciences, 2019, 20(15): 3648
Li Z, Li J, Nie R, You Z H, Bao W. A graph auto-encoder model for miRNA-disease associations prediction. Briefings in Bioinformatics, 2021, 22(4): bbaa240
Ding Y, Tian L P, Lei X, Liao B, Wu F X. Variational graph autoencoders for miRNA-disease association prediction. Methods, 2021, 192: 25–34
Liu W, Lin H, Huang L, Peng L, Tang T, Zhao Q, Yang L. Identification of miRNA–disease associations via deep forest ensemble learning based on autoencoder. Briefings in Bioinformatics, 2022, 23(3): bbac104
Zeng M, Lu C, Fei Z, Wu F X, Li Y, Wang J, Li M. DMFLDA: a deep learning framework for predicting lncRNA–disease associations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2021, 18(6): 2353–2363
Lu C, Zeng M, Zhang F, Wu F X, Li M, Wang J. Deep matrix factorization improves prediction of human circRNA-disease associations. IEEE Journal of Biomedical and Health Informatics, 2021, 25(3): 891–899
Liu D, Huang Y, Nie W, Zhang J, Deng L. SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost. BMC Bioinformatics, 2021, 22(1): 219
Li J, Chen X, Huang Q, Wang Y, Xie Y, Dai Z, Zou X, Li Z. Seq-SymRF: a random forest model predicts potential miRNA-disease associations based on information of sequences and clinical symptoms. Scientific Reports, 2020, 10(1): 17901
Li J, Li Z, Nie R, You Z, Bao W. FCGCNMDA: predicting miRNA-disease associations by applying fully connected graph convolutional networks. Molecular Genetics and Genomics, 2020, 295(5): 1197–1209
Chu Y, Wang X, Dai Q, Wang Y, Wang Q, Peng S, Wei X, Qiu J, Salahub D R, Xiong Y, Wei D Q. MDA-GCNFTG: identifying miRNA-disease associations based on graph convolutional networks via graph sampling through the feature and topology graph. Briefings in Bioinformatics, 2021, 22(6): bbab165
Zhang L, Chen X, Yin J. Prediction of potential miRNA–disease associations through a novel unsupervised deep learning framework with variational autoencoder. Cells, 2019, 8(9): 1040
Dai Q, Wang Z, Liu Z, Duan X, Song J, Guo M. Predicting miRNA-disease associations using an ensemble learning framework with resampling method. Briefings in Bioinformatics, 2022, 23(1): bbab543
Zhong T, Li Z, You Z H, Nie R, Zhao H. Predicting miRNA–disease associations based on graph random propagation network and attention network. Briefings in Bioinformatics, 2022, 23(2): bbab589
Zhou S, Wang S, Wu Q, Azim R, Li W. Predicting potential miRNA-disease associations by combining gradient boosting decision tree with logistic regression. Computational Biology and Chemistry, 2020, 85: 107200
Jin C, Shi Z, Lin K, Zhang H. Predicting miRNA-disease association based on neural inductive matrix completion with graph autoencoders and self-attention mechanism. Biomolecules, 2022, 12(1): 64
Zhao Y, Chen X, Yin J. Adaptive boosting-based computational model for predicting potential miRNA-disease associations. Bioinformatics, 2019, 35(22): 4730–4738
Ding Y, Lei X, Liao B, Wu F X. Predicting miRNA-disease associations based on multi-view variational graph auto-encoder with matrix factorization. IEEE Journal of Biomedical and Health Informatics, 2022, 26(1): 446–457
Ding Y, Lei X, Liao B, Wu F X. MLRDFM: a multi-view Laplacian regularized DeepFM model for predicting miRNA-disease associations. Briefings in Bioinformatics, 2022, 23(3): bbac079
Yang Z, Ren F, Liu C, He S, Sun G, Gao Q, Yao L, Zhang Y, Miao R, Cao Y, Zhao Y, Zhong Y, Zhao H. dbDEMC: a database of differentially expressed miRNAs in human cancers. BMC Genomics, 2010, 11(S4): S5
Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, Li M, Wang G, Liu Y. miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Research, 2009, 37: D98–D104
Seijo L M, Zulueta J J. Understanding the links between lung cancer, COPD, and emphysema: a key to more effective treatment and screening. Oncology, 2017, 31(2): 93–102
Hamamoto J, Soejima K, Yoda S, Naoki K, Nakayama S, Satomi R, Terai H, Ikemura S, Sato T, Yasuda H, Hayashi Y, Sakamoto M, Takebayashi T, Betsuyaku T. Identification of microRNAs differentially expressed between lung squamous cell carcinoma and lung adenocarcinoma. Molecular Medicine Reports, 2013, 8(2): 456–462
Ciuleanu T, Brodowicz T, Zielinski C, Kim J H, Krzakowski M, Laack E, Wu Y L, Bover I, Begbie S, Tzekova V, Cucevic B, Pereira J R, Yang S H, Madhavan J, Sugarman K P, Peterson P, John W J, Krejcy K, Belani C P. Maintenance pemetrexed plus best supportive care versus placebo plus best supportive care for non-small-cell lung cancer: a randomised, double-blind, phase 3 study. The Lancet, 2009, 374(9699): 1432–1440
Schabath M B, Cote M L. Cancer progress and priorities: lung cancer. Cancer Epidemiology, Biomarkers & Prevention, 2019, 28(10): 1563–1579
Cappell M S. Pathophysiology, clinical presentation, and management of colon cancer. Gastroenterology Clinics of North America, 2008, 37(1): 1–24
Aslam M I, Taylor K, Pringle J H, Jameson J S. MicroRNAs are novel biomarkers of colorectal cancer. British Journal of Surgery, 2009, 96(7): 702–710
Yamada A, Horimatsu T, Okugawa Y, Nishida N, Honjo H, Ida H, Kou T, Kusaka T, Sasaki Y, Yagi M, Higurashi T, Yukawa N, Amanuma Y, Kikuchi O, Muto M, Ueno Y, Nakajima A, Chiba T, Boland C R, Goel A. Serum miR-21, miR-29a, and miR-125b are promising biomarkers for the early detection of colorectal neoplasia. Clinical Cancer Research, 2015, 21(18): 4234–4242
El-Serag H B, Sweet S, Winchester C C, Dent J. Update on the epidemiology of gastro-oesophageal reflux disease: a systematic review. Gut, 2014, 63(6): 871–880
Sohda M, Kuwano H. Current status and future prospects for esophageal cancer treatment. Annals of Thoracic and Cardiovascular Surgery, 2017, 23(1): 1–11
Gramantieri L, Ferracin M, Fornari F, Veronese A, Sabbioni S, Liu C G, Calin G A, Giovannini C, Ferrazzi E, Grazi G L, Croce C M, Bolondi L, Negrini M. Cyclin G1 is a target of miR-122a, a microRNA frequently down-regulated in human hepatocellular carcinoma. Cancer Research, 2007, 67(13): 6092–6099
Li Y, Qiu C, Tu J, Geng B, Yang J, Jiang T, Cui Q. HMDD v2.0: a database for experimentally supported human microRNA and disease associations. Nucleic Acids Research, 2014, 42(D1): D1070–D1074
Huang Z, Shi J, Gao Y, Cui C, Zhang S, Li J, Zhou Y, Cui Q. HMDD v3.0: a database for experimentally supported human microRNA–disease associations. Nucleic Acids Research, 2019, 47(D1): D1013–D1017
Xuan P, Han K, Guo M, Guo Y, Li J, Ding J, Liu Y, Dai Q, Li J, Teng Z, Huang Y. Prediction of microRNAs associated with human diseases based on weighted k most similar neighbors. PLoS One, 2013, 8(8): e70204
Wang D, Wang J, Lu M, Song F, Cui Q. Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases. Bioinformatics, 2010, 26(13): 1644–1650
Pasquier C, Gardès J. Prediction of miRNA-disease associations with a vector space model. Scientific Reports, 2016, 6: 27036
Zhang Z W, Gao Z, Zheng C H, Wang Y T, Qi S M. MELPMDA: a new method based on matrix enhancement and label propagation for predicting miRNA-disease association. In: Proceedings of the 17th International Conference on Intelligent Computing Theories and Application. 2021, 536–548
Xie F, Yang Z, Song J, Dai Q, Duan X. DHNLDA: a novel deep hierarchical network based method for predicting lncRNA-disease associations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19(6): 3395–3403
Dhillon A, Verma G K. Convolutional neural network: a review of models, methodologies and applications to object detection. Progress in Artificial Intelligence, 2020, 9(2): 85–112
Tang X, Luo J, Shen C, Lai Z. Multi-view multichannel attention graph convolutional network for miRNA–disease association prediction. Briefings in Bioinformatics, 2021, 22(6): bbab174
Acknowledgments
This work was supported by the National Natural Science Foundation of China (Grant Nos. 62202004, and 62322301), the Natural Science Foundation of Anhui Province (No. 2108085QF267), the University Synergy Innovation Program of Anhui Province (No. GXXT-2021-039), and the Anhui University Outstanding Youth Research Project (No. 2022AH020010).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests The authors declare that they have no competing interests or financial conflicts to disclose.
Additional information
Supplementary document Our supplementary document consists of three parts. The first part is the Supplementary Figures related to the experiment, the second part is the Supplementary Tables related to the experiment, and the third part is the Supplementary Formulas related to the experimental results and some methods. The supporting information is available online at journal.hep.com.cn and link.springer.com.
Pijing Wei received the PhD degree in computer science and technology from Anhui University, China in 2020. She is currently a lecturer in the Institute of Physical Science and Information Technology, Anhui University, China. Her main research interests include bioinformatics, synthetic biology, cancer data mining, and machine learning.
Qianqian Wang received the BS degree in science from Anhui University of Science and Technology, China in 2021. She is currently pursuing the MS degree in the School of Computer Science and Technology, Anhui University, China. Her research interests include research of bioinformatics and deep learning.
Zhen Gao received the MS degree in computer science from Qufu Normal University, China in 2021. She is currently working toward the PhD degree in the School of Computer Science and Technology, Anhui University, China. Her research interests include research of bioinformatics, deep learning and gene regulatory networks.
Ruifen Cao received the PhD degree from Hefei Institute of Physical Sciences, Chinese Academy of Sciences, China in 2009. She is currently an associate professor at the School of Computer Science and Technology, Anhui University, China. Her research interests include artificial intelligence, medical image processing, and multimodal data fusion.
Chunhou Zheng received the the PhD degree in pattern recognition and intelligent system in 2006, from University of Science and Technology of China. From February 2007 to June 2009, he worked as a Postdoctoral Fellow in the Hefei Institutes of Physical Sceience, Chinese Academy of Sciences, China. From July 2009 to July 2010, he worked as a Postdoctoral Fellow in the Department of Computing, the Hong Kong Polytechnic University, China. He is currently a Professor in the School of Artificial Intelligence, Anhui University, China. His research interests include pattern recognition, synthetic biology and bioinformatics.
Supplemental document
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
To view a copy of this licence, visit creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Wei, P., Wang, Q., Gao, Z. et al. DMFVAE: miRNA-disease associations prediction based on deep matrix factorization method with variational autoencoder. Front. Comput. Sci. 18, 186912 (2024). https://doi.org/10.1007/s11704-023-3610-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11704-023-3610-y