Jousheghani et al., 2024 - Google Patents
Oarfish: Enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantificationJousheghani et al., 2024
View HTML- Document ID
- 11480575381911949651
- Author
- Jousheghani Z
- Patro R
- Publication year
- Publication venue
- bioRxiv
External Links
Snippet
Results: We introduce a new method and software tool for long read transcript quantification called oarfish. Our model incorporates a novel and innovative coverage score, which affects the conditional probability of fragment assignment in the underlying probabilistic model. We …
- 238000011002 quantification 0 title abstract description 78
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/22—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for sequence comparison involving nucleotides or amino acids, e.g. homology search, motif or SNP [Single-Nucleotide Polymorphism] discovery or sequence alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/44—Arrangements for executing specific programmes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/28—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for programming tools or database systems, e.g. ontologies, heterogeneous data integration, data warehousing or computing architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/20—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for hybridisation or gene expression, e.g. microarrays, sequencing by hybridisation, normalisation, profiling, noise correction models, expression ratio estimation, probe design or probe optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/24—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for machine learning, data mining or biostatistics, e.g. pattern finding, knowledge discovery, rule extraction, correlation, clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/50—Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/60—Software deployment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce, e.g. shopping or e-commerce
- G06Q30/02—Marketing, e.g. market research and analysis, surveying, promotions, advertising, buyer profiling, customer management or rewards; Price estimation or determination
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bray et al. | Near-optimal probabilistic RNA-seq quantification | |
Jain et al. | Long-read mapping to repetitive reference sequences using Winnowmap2 | |
Sweeney et al. | R2DT is a framework for predicting and visualising RNA secondary structure using templates | |
Steinhauser et al. | A comprehensive comparison of tools for differential ChIP-seq analysis | |
Sessegolo et al. | Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules | |
Li et al. | Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data | |
Tarazona et al. | Harmonization of quality metrics and power calculation in multi-omic studies | |
Dai et al. | NGSQC: cross-platform quality analysis pipeline for deep sequencing data | |
Tyanova et al. | The Perseus computational platform for comprehensive analysis of (prote) omics data | |
Nix et al. | Empirical methods for controlling false positives and estimating confidence in ChIP-Seq peaks | |
Hu et al. | On the detection and refinement of transcription factor binding sites using ChIP-Seq data | |
Tran et al. | A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data | |
Zhang et al. | RNA-Skim: a rapid method for RNA-Seq quantification at transcript level | |
Edwards et al. | Real time metagenomics: using k-mers to annotate metagenomes | |
Shemirani et al. | Rapid detection of identity-by-descent tracts for mega-scale datasets | |
Zhao et al. | Interpreting omics data with pathway enrichment analysis | |
Hensman et al. | Fast and accurate approximate inference of transcript expression from RNA-seq data | |
Ahmed et al. | Identifying A-and P-site locations on ribosome-protected mRNA fragments using Integer Programming | |
Keleş | Mixture modeling for genome-wide localization of transcription factors | |
Yalamanchili et al. | DDGni: dynamic delay gene-network inference from high-temporal data using gapped local alignment | |
Okoniewski et al. | Comprehensive analysis of affymetrix exon arrays using BioConductor | |
Sarkar et al. | Minnow: a principled framework for rapid simulation of dscRNA-seq data at the read level | |
Zhang et al. | Reference panel guided topological structure annotation of Hi-C data | |
Jousheghani et al. | Oarfish: Enhanced probabilistic modeling leads to improved accuracy in long read transcriptome quantification | |
Belka et al. | LVQ-KNN: Composition-based DNA/RNA binning of short nucleotide sequences utilizing a prototype-based k-nearest neighbor approach |