Abstract
Hereditary traits are anticipated by the mutations in the gene sequences. Identifying a disease based on mutations is an essential and challenging task in the determination of genetic disorders such as Muscular dystrophy. Silent mutation is a single nucleotide variant does not result in changes in the encoded protein but appear in the variation of codon usage pattern that results in disease. A new ensemble learning-based computational model is proposed using the synonymous codon usage for identifying the muscular dystrophy disease. The feature vector is designed by calculating the Relative Synonymous Codon Usage (RSCU) values from the mutated gene sequences and a model is built by adopting codon usage bias pattern. This paper addresses the problem by formulating it as multi-classification trained with feature vectors of fifty-nine RSCU frequency values from the mutated gene sequences. Finally, a model is built based on ensemble learning LibD3C algorithm to recognize muscular dystrophy disease classification. Experiments showed that the accuracy of the classifier shows 90%, which proves that ensemble-based learning, is effective for predicting muscular dystrophy disease.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Lenka Fajkusova, ZdeneIk LukasIb, Miroslava Tvrdoakova a, Viera Kuhrova a, Jirioa Haajekb, Jirioa Fajkusc, Novel dystrophin mutations revealed by analysis of dystrophin mRNA: alternative splicing suppresses the phenotypic effect of a nonsense mutation, Neuromuscular Disorders Vol 11, (2001)
Kann, M.G., Advances in translational bioinformatics: computational approaches for the hunting of disease genes, Briefings in Bioinformatics 11, 96–110 (2009)
Tranchevent, L.-C., et al., A guide to web tools to prioritize candidate genes, Briefings in Bioinformatics, 12, 22–32 (2010)
KN North and KJ Jones, Diagnosing childhood muscular dystrophies, Journal of Paediatrics and Child Health
Koenig M, Hoffman EP, Bertelson CJ, Monaco AP, Feener C, Kunkel LM., Complete cloning of the Duchenne muscular dystrophy (DMD) cDNA and preliminary genomic organization of the DMD gene innormal and affected individuals, Cell 1987;50:509 ± 517
Charif D, Thioulouse J, Lobry J. R and Perrière G, Online synonymous codon usage analyses with the ade4 and seqinR packages, Bioinformatics Oxford Journal,2005,21(4):545–547
Jianmin Ma, Minh N. Nguyen, Gavyn W.L. Pang, and Jagath C. Rajapakse, Gene Classification using Codon Usage and SVMs, IEEE, 2005
C.M. Nisha, Bhasker Pant, and K. R. Pardasani, SVM model for classification of genotypes of HCV using Relative Synonymous Codon Usage Journal of Advanced Bioinformatics Applications and Research ISSN 0976-2604. Online ISSN 2278 – 6007 Vol 3, Issue 3, 2012, pp 357–363
Quan Zou, et al., An approach for identifying cytokines based on a novel Ensemble classifier, Hindawi Publishing Corporation BioMed Research International, 2013
Peter D. Stenson, Matthew Mort, Edward V. Ball, Katy Shaw, Andrew D. Phillips, David N. Cooper, The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine, July 2013
Chen, et al., LibD3C: Ensemble classifiers with a clustering and a dynamic strategy, Elseiver’s Neurocomputing 123 (2014) pp 424–435
Gulisong, et al., A Triple-Random Ensemble Classification Method for Mining Multi-label Data, IEEE 2010, 978-0-7695-4257-7/10
R. Yan, et al., Model-shared subspace boosting for multi-label classification, in: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and DataMining, ACM, 2007, pp. 834–843
Z.-H. Zhou, J. Wu, andW. Tang, Ensembling neural networks: many could be better than all, Artificial Intelligence, vol. 137, no. 1–2, pp. 239–263, 2002.
Electronic Supplementary Material (ESI) for Molecular BioSystems. The Royal Society of Chemistry 2015
Ian H. Witten, Eibe Frank, Len Trigg, Mark Hall, Geoffrey Holmes, Sally Jo Cunningham. Weka: Practical Machine Learning Tools and Techniques with Java Implementations
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
K. Sathyavikasini, M.S. Vijaya (2017). Ensemble Learning for Identifying Muscular Dystrophy Diseases Using Codon Bias Pattern. In: Satapathy, S., Bhateja, V., Udgata, S., Pattnaik, P. (eds) Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications . Advances in Intelligent Systems and Computing, vol 515. Springer, Singapore. https://doi.org/10.1007/978-981-10-3153-3_3
Download citation
DOI: https://doi.org/10.1007/978-981-10-3153-3_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3152-6
Online ISBN: 978-981-10-3153-3
eBook Packages: EngineeringEngineering (R0)