[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

IDBA-UD

Published: 01 June 2012 Publication History

Abstract

Motivation: Next-generation sequencing allows us to sequence reads from a microbial environment using single-cell sequencing or metagenomic sequencing technologies. However, both technologies suffer from the problem that sequencing depth of different regions of a genome or genomes from different species are highly uneven. Most existing genome assemblers usually have an assumption that sequencing depths are even. These assemblers fail to construct correct long contigs.
Results: We introduce the IDBA-UD algorithm that is based on the de Bruijn graph approach for assembling reads from single-cell sequencing or metagenomic sequencing technologies with uneven sequencing depths. Several non-trivial techniques have been employed to tackle the problems. Instead of using a simple threshold, we use multiple depthrelative thresholds to remove erroneous k-mers in both low-depth and high-depth regions. The technique of local assembly with paired-end information is used to solve the branch problem of low-depth short repeat regions. To speed up the process, an error correction step is conducted to correct reads of high-depth regions that can be aligned to highconfident contigs. Comparison of the performances of IDBA-UD and existing assemblers (Velvet, Velvet-SC, SOAPdenovo and Meta-IDBA) for different datasets, shows that IDBA-UD can reconstruct longer contigs with higher accuracy.
Availability: The IDBA-UD toolkit is available at our website http://www.cs.hku.hk/~alse/idba_ud

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Bioinformatics
Bioinformatics  Volume 28, Issue 11
June 2012
140 pages

Publisher

Oxford University Press, Inc.

United States

Publication History

Published: 01 June 2012

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Distributed RMI-DBG modelExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.120859233:COnline publication date: 15-Dec-2023
  • (2023)An Optimized Scaffolding Algorithm for Unbalanced SequencingNew Generation Computing10.1007/s00354-023-00221-641:3(553-579)Online publication date: 28-May-2023
  • (2022)MetaCoAG: Binning Metagenomic Contigs via Composition, Coverage and Assembly GraphsResearch in Computational Molecular Biology10.1007/978-3-031-04749-7_5(70-85)Online publication date: 22-May-2022
  • (2021)Accelerating large scale de novo metagenome assembly using GPUsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3458817.3476212(1-11)Online publication date: 14-Nov-2021
  • (2021)PaKman: A Scalable Algorithm for Generating Genomic Contigs on Distributed Memory MachinesIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2020.304324132:5(1191-1209)Online publication date: 11-Jan-2021
  • (2021)Ultra Efficient Acceleration for De Novo Genome Assembly via Near-Memory ComputingProceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques10.1109/PACT52795.2021.00022(199-212)Online publication date: 26-Sep-2021
  • (2020)Accelerating De Novo Assembler WTDBG2 on Commodity ServersAlgorithms and Architectures for Parallel Processing10.1007/978-3-030-60245-1_16(232-246)Online publication date: 2-Oct-2020
  • (2019)FastEtchIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2017.273799916:4(1091-1106)Online publication date: 1-Jul-2019
  • (2019)Rpair: Rescaling RePair with RsyncString Processing and Information Retrieval10.1007/978-3-030-32686-9_3(35-44)Online publication date: 7-Oct-2019
  • (2019)Efficient Gene Assembly and Identification for Many Genome SamplesCognitive Computing – ICCC 201910.1007/978-3-030-23407-2_1(1-11)Online publication date: 25-Jun-2019
  • Show More Cited By

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media