[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Metagenomic Phylogenetic Classification Using Improved Naïve Bayes

  • Conference paper
Intelligent Computing in Bioinformatics (ICIC 2014)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 8590))

Included in the following conference series:

  • 3492 Accesses

Abstract

The phylogenetic classification of DNA fragments from a variety of microorganisms is often performed in metagenomic analysis to understand the taxonomic composition of microbial communities. A faster method for taxonomic classification based on metagenomic reads is required with the improvement of DNA sequencer’s throughput in recent years. In this research we focus on naïve Bayes, which can quickly classify organisms with sufficient accuracy, and we have developed an acceptably fast, yet more accurate classification method using improved naïve Bayes, Weightily Averaged One-Dependence Estimators (WAODE). Additionally, we accelerated WAODE classification by introducing a cutoff for the mutual information content, and achieved a 20 times faster classification speed while keeping comparable prediction accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 35.99
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 44.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Daniel, R.: The Soil Metagenome – A Rich Resource for The Discovery of Novel Natural Products. Current Opinion in Biotechnology 15, 199–204 (2004)

    Article  Google Scholar 

  2. Tyson, G.W., Chapman, J., Hugenholtz, P., Allen, E.E., Raml, R.J., et al.: Community Structure and Metabolism Through Reconstruction of Microbial Genomes From the Environment. Nature 428, 37–43 (2004)

    Article  Google Scholar 

  3. Qin, J., Li, R., Raes, J., Arumugam, M., Burgdorf, K.S., et al.: A Human Gut Microbial Gene Catalogue Established by Metagenomic Sequencing. Nature 464, 59–65 (2010)

    Article  Google Scholar 

  4. Mchardy, A.C., Martin, H.G., et al.: Accurate Phylogenetic Classification of Variable-Length DNA Fragments. Nature Methods 4(1), 63–72 (2007)

    Article  Google Scholar 

  5. Rosen, G., Garbarine, E., Caseiro, D., Polikar, R., Sokhansanj, B.: Metagenome Fragment Classification Using N-Mer Frequency Profiles. Advances in Bioinformatics 2008(20), 59–69 (2008)

    Google Scholar 

  6. Brady, A., Salzberg, S.L., et al.: Metagenomic Phylogenetic Classification with Interpolated Markov Models. Nature Methods 6(9), 673–676 (2009)

    Article  Google Scholar 

  7. Diaz, N.N., Krause, L., Goesmann, A., Niehaus, K., Nattkemper, T.W.: TACOA – Taxonomic Classification of Environmental Genomic Fragments Using a Kernelized Nearest Neighbor Approach. BMC Bioinformatics 10, 56 (2009)

    Article  Google Scholar 

  8. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic Local Alignment Search Tool. Journal of Molecular Biology 215(3), 403–410 (1990)

    Article  Google Scholar 

  9. Huson, D.H., Auch, A.F., Qi, J., Schuster, S.C.: MEGAN Analysis of Metagenomic Data. Genome Research 17(3), 377–386 (2007)

    Article  Google Scholar 

  10. Suzuki, S., Ishida, T., Kurokawa, K., Akiyama, Y.: GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics. Plos One 7(5), E36060 (2012)

    Google Scholar 

  11. Zhao, Y., Tang, H., Ye, Y.: Rapsearch2: A Fast And Memory-Efficient Protein Similarity Search Tool for Next-Generation Sequencing Data. Bioinformatics 23(1), 125–126 (2012)

    Article  Google Scholar 

  12. Jiang, L., Zhang, H.: Weightily Averaged One-Dependence Estimators. In: Yang, Q., Webb, G. (eds.) PRICAI 2006. LNCS (LNAI), vol. 4099, pp. 970–974. Springer, Heidelberg (2006)

    Google Scholar 

  13. Koc, L., Mazzuchi, T.A., Sarkani, S.: A Network Intrusion Detection System Based on a Hidden Naive Bayes Multiclass Classifier. Expert Systems with Applications 39(18), 13492–13500 (2012)

    Article  Google Scholar 

  14. Chang, C.-C., Lin, C.-J.: LIBSVM: A Library for Support Vector Machines. ACM Transactions on Intelligent Systems and Technology 2(27), 1–27 (2011)

    Article  Google Scholar 

  15. Richter, D.C., Ott, F., Auch, A.F., Schmid, R., Huson, D.H.: Metasim - A Sequence Simulator for Genomics and Metagenomics. Plos One 3(10), P.E3373 (2008)

    Google Scholar 

  16. Zweig, M.H., Campbell, G.: Receiver-Operating Characteristic (ROC) Plots: a Fundamental Evaluation Tool in Clinical Medicine. Clinical Chemistry 39, 561–577 (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Komatsu, Y., Ishida, T., Akiyama, Y. (2014). Metagenomic Phylogenetic Classification Using Improved Naïve Bayes. In: Huang, DS., Han, K., Gromiha, M. (eds) Intelligent Computing in Bioinformatics. ICIC 2014. Lecture Notes in Computer Science(), vol 8590. Springer, Cham. https://doi.org/10.1007/978-3-319-09330-7_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09330-7_32

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09329-1

  • Online ISBN: 978-3-319-09330-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics