Abstract
Recent outbreaks of highly pathogenic influenza have highlighted the need to develop novel anti-influenza therapeutics. Neuraminidase has become the most important target for the treatment of influenza virus. In this study, classification models were developed from a large training dataset containing 457 neuraminidase inhibitors and 358 non-inhibitors using random forest and support vector machine algorithms. Recursive feature elimination (RFE) method was used to improve the accuracy of the models by selecting the most relevant molecular descriptors. The performances of the models were evaluated by five-fold cross-validation and independent validation. The accuracies of all the models are over 86% in both validation methods. This work suggests machine learning algorithms combined with RFE method can be used to build useful models for predicting influenza neuraminidase inhibitors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hurt, A.C.: The epidemiology and spread of drug resistant human influenza viruses. Curr. Opin. Virol. 8, 22–29 (2014)
Gao, R., Cao, B., Hu, Y., Feng, Z., Wang, D., Hu, W., Chen, J., Jie, Z., Qiu, H., Xu, K.: Human infection with a novel avian-origin influenza A (H7N9) virus. N. Engl. J. Med. 368, 1888–1897 (2013)
Matrosovich, M.N., Matrosovich, T.Y., Gray, T., Roberts, N.A., Klenk, H.-D.: Neuraminidase is important for the initiation of influenza virus infection in human airway epithelium. J. Virol. 78, 12665–12667 (2004)
Ai, H., Zhang, L., Chang, A.K., Wei, H., Che, Y., Liu, H.: Virtual screening of potential inhibitors from TCM for the CPSF30 binding site on the NS1A protein of influenza A virus. J. Mol. Model. 20, 2142 (2014)
Ai, H., Zheng, F., Deng, F., Zhu, C., Gu, Y., Zhang, L., Li, X., Chang, A.K., Zhao, J., Zhu, J.: Structure-based virtual screening for potential inhibitors of influenza A virus RNA polymerase PA subunit. Int. J. Pept. Res. Ther. 21, 149–156 (2015)
Batool, S., Mushtaq, G., Kamal, W., Kamal, M.A.: Pharmacophore-based virtual screening for identification of novel neuraminidase inhibitors and verification of inhibitory activity by molecular docking. Med. Chem. 12, 63–73 (2016)
Cong, Y., Li, B.-K., Yang, X.-G., Xue, Y., Chen, Y.-Z., Zeng, Y.: Quantitative structure–activity relationship study of influenza virus neuraminidase A/PR/8/34 (H1N1) inhibitors by genetic algorithm feature selection and support vector regression. Chemometr. Intell. Lab. 127, 35–42 (2013)
Lian, W., Fang, J., Li, C., Pang, X., Liu, A.-L., Du, G.-H.: Discovery of Influenza A virus neuraminidase inhibitors using support vector machine and Naïve Bayesian models. Mol. Divers. 20, 439–451 (2016)
Li, Y., Kong, Y., Zhang, M., Yan, A., Liu, Z.: Using support vector machine (SVM) for classification of selectivity of H1N1 neuraminidase inhibitors. Mol. Inform. 35, 116–124 (2016)
Tao, P., Liu, T., Li, X., Chen, L.: Prediction of protein structural class using tri-gram probabilities of position-specific scoring matrix and recursive feature elimination. Amino Acids 47, 461–468 (2015)
Gilson, M.K., Liu, T., Baitaluk, M., Nicola, G., Hwang, L., Chong, J.: BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res. 44, D1045–D1053 (2016)
Li, B.-K., Cong, Y., Yang, X.-G., Xue, Y., Chen, Y.-Z.: In silico prediction of spleen tyrosine kinase inhibitors using machine learning approaches and an optimized molecular descriptor subset generated by recursive feature elimination method. Comput. Biol. Med. 43, 395–404 (2013)
Yap, C.W.: PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J. Comput. Chem. 32, 1466–1474 (2011)
Kuhn, M.: Caret package. J. Stat. Softw. 28, 1–26 (2008)
Chen, X., Yan, C.C., Zhang, X., Zhang, X., Dai, F., Yin, J., Zhang, Y.: Drug–target interaction prediction: databases, web servers and computational models. Brief. Bioinform. 17, 696–712 (2016)
Chen, X., Ren, B., Chen, M., Wang, Q., Zhang, L., Yan, G.: NLLSS: predicting synergistic drug combinations based on semi-supervised learning. PLoS Comput. Biol. 12, e1004975 (2016)
Chen, X., Yan, C.C., Zhang, X., You, Z.-H.: Long non-coding RNAs and complex diseases: from experimental results to computational models. Brief. Bioinform. bbw060 (2016). doi:10.1093/bib/bbw060
Chen, W., Feng, P., Yang, H., Ding, H., Lin, H., Chou, K.-C.: iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences. Oncotarget 8, 4208–4217 (2017)
Chen, W., Tang, H., Ye, J., Lin, H., Chou, K.-C.: iRNA-PseU: identifying RNA pseudouridine sites. Mol. Ther. Nucleic Acids 5, e332 (2016)
Chen, X., Huang, Y.-A., You, Z.-H., Yan, G.-Y., Wang, X.-S.: A novel approach based on KATZ measure to predict associations of human microbiota with non-infectious diseases. Bioinformatics 33, 733–739 (2017)
Huang, Z.-A., Chen, X., Zhu, Z., Liu, H., Yan, G.-Y., You, Z.-H., Wen, Z.: PBHMDA: Path-based human microbe-disease association prediction. Front. Microbiol. 8, 233 (2017)
Chen, X., Huang, Y.-A., Wang, X.-S., You, Z.-H., Chan, K.: FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model. Oncotarget 7, 45948–45958 (2016)
Chen, X., You, Z., Yan, G., Gong, D.: IRWRLDA: improved random walk with restart for lncRNA-disease association prediction. Oncotarget 7, 57919–57931 (2016)
Chen, W., Ding, H., Feng, P., Lin, H., Chou, K.-C.: iACP: a sequence-based tool for identifying anticancer peptides. Oncotarget 7, 16895 (2016)
Acknowledgments
This work was supported by the National Natural Science Foundation of China (No: 31570160), Innovation Team Project (No: LT2015011) from Education Department of Liaoning Province, Large-scale Equipment Shared Services Project (No: F15165400) and Applied Basic Research Project (No: F16205151) from Science and Technology Bureau of Shenyang. This project was supported by Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zhang, L. et al. (2017). Computational Prediction of Influenza Neuraminidase Inhibitors Using Machine Learning Algorithms and Recursive Feature Elimination Method. In: Cai, Z., Daescu, O., Li, M. (eds) Bioinformatics Research and Applications. ISBRA 2017. Lecture Notes in Computer Science(), vol 10330. Springer, Cham. https://doi.org/10.1007/978-3-319-59575-7_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-59575-7_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59574-0
Online ISBN: 978-3-319-59575-7
eBook Packages: Computer ScienceComputer Science (R0)