Kannan et al., 2016 - Google Patents

Shannon: an information-optimal de novo RNA-Seq assembler

Kannan et al., 2016

Document ID: 3399018553271010819
Author: Kannan S; Hui J; Mazooji K; Pachter L; Tse D
Publication year: 2016
Publication venue: BioRxiv

External Links

Cited by

Snippet

De novo assembly of short RNA-Seq reads into transcripts is challenging due to sequence similarities in transcriptomes arising from gene duplications and alternative splicing of transcripts. We present Shannon, an RNA-Seq assembler with an optimality guarantee …

Continue reading at www.biorxiv.org (PDF) (other versions)

229920001186 RNA-Seq 0 title abstract description 27

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/22—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for sequence comparison involving nucleotides or amino acids, e.g. homology search, motif or SNP [Single-Nucleotide Polymorphism] discovery or sequence alignment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/24—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for machine learning, data mining or biostatistics, e.g. pattern finding, knowledge discovery, rule extraction, correlation, clustering or classification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/18—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for functional genomics or proteomics, e.g. genotype-phenotype associations, linkage disequilibrium, population genetics, binding site identification, mutagenesis, genotyping or genome annotation, protein-protein interactions or protein-nucleic acid interactions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/20—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for hybridisation or gene expression, e.g. microarrays, sequencing by hybridisation, normalisation, profiling, noise correction models, expression ratio estimation, probe design or probe optimisation
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N27/00—Investigating or analysing materials by the use of electric, electro-chemical, or magnetic means
- G01N27/26—Investigating or analysing materials by the use of electric, electro-chemical, or magnetic means by investigating electrochemical variables; by using electrolysis or electrophoresis
- G01N27/416—Systems
- G01N27/447—Systems using electrophoresis
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by the preceding groups
- G01N33/48—Investigating or analysing materials by specific methods not covered by the preceding groups biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing

Similar Documents

Publication	Publication Date	Title
Kannan et al.	2016	Shannon: an information-optimal de novo RNA-Seq assembler
Haas et al.	2017	STAR-Fusion: fast and accurate fusion transcript detection from RNA-Seq
Jones et al.	2015	MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins
Ye et al.	2016	DBG2OLC: efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies
Iyer et al.	2015	The landscape of long noncoding RNAs in the human transcriptome
Ghandi et al.	2014	Enhanced regulatory sequence prediction using gapped k-mer features
Trigg et al.	2011	Multicoil2: predicting coiled coils and their oligomerization states from sequence in the twilight zone
Chung et al.	2011	Discovering transcription factor binding sites in highly repetitive regions of genomes with multi-read analysis of ChIP-Seq data
Kamal et al.	2017	De-Bruijn graph with MapReduce framework towards metagenomic data classification
Lin et al.	2012	CLIIQ: Accurate comparative detection and quantification of expressed isoforms in a population
Savino et al.	2020	Differential co-expression analyses allow the identification of critical signalling pathways altered during tumour transformation and progression
Futschik et al.	2014	Multiscale DNA partitioning: statistical evidence for segments
Goussarov et al.	2022	Introduction to the principles and methods underlying the recovery of metagenome‐assembled genomes from metagenomic data
Wang et al.	2012	A de Bruijn graph approach to the quantification of closely-related genomes in a microbial community
Kritikos et al.	2011	Noise reduction in protein-protein interaction graphs by the implementation of a novel weighting scheme
Rautiainen et al.	2020	AERON: Transcript quantification and gene-fusion detection using long reads
Li et al.	2014	Bayesian model of protein primary sequence for secondary structure prediction
Dufault‐Thompson et al.	2022	Applications of de Bruijn graphs in microbiome research
Bruford et al.	2015	Devising a consensus framework for validation of novel human coding loci
Zhang et al.	2021	Biobank-scale inference of ancestral recombination graphs enables genealogy-based mixed model association of complex traits
Wajid et al.	2016	The A, C, G, and T of genome assembly
Bhattacharya et al.	2016	FRAGSION: ultra-fast protein fragment library generation by IOHMM sampling
Vasimuddin et al.	2018	Identification of significant computational building blocks through comprehensive investigation of NGS secondary analysis methods
White III et al.	2017	MerCat: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from metagenomic and/or metatranscriptomic sequencing data
González et al.	2020	VTAM: A robust pipeline for validating metabarcoding data using internal controls