A Survey of Software and Hardware Approaches to Performing Read Alignment in Next Generation Sequencing
Computational genomics is an emerging field that is enabling us to reveal the origins of life and the genetic basis of diseases such as cancer. Next Generation Sequencing NGS technologies have unleashed a wealth of genomic information by producing ...
Batch Mode TD$\lambda$ for Controlling Partially Observable Gene Regulatory Networks
External control of gene regulatory networks GRNs has received much attention in recent years. The aim is to find a series of actions to apply to a gene regulation system making it avoid its diseased states. In this work, we propose a novel method for ...
Benchmark Dataset for Whole Genome Sequence Compression
The research in DNA data compression lacks a standard dataset to test out compression tools specific to DNA. This paper argues that the current state of achievement in DNA compression is unable to be benchmarked in the absence of such scientifically ...
Copy Number Variations Detection: Unravelling the Problem in Tangible Aspects
In the midst of the important genomic variants associated to the susceptibility and resistance to complex diseases, Copy Number Variations CNV has emerged as a prevalent class of structural variation. Following the flood of next-generation sequencing ...
Data Management for Heterogeneous Genomic Datasets
Next Generation Sequencing NGS, a family of technologies for reading DNA and RNA, is changing biological research, and will soon change medical practice, by quickly providing sequencing data and high-level features of numerous individual genomes in ...
Detecting Pairwise Interactive Effects of Continuous Random Variables for Biomarker Identification with Small Sample Size
Aberrant changes to interactions among cellular components have been conjectured to be potential causes of abnormalities in cellular functions. By systematic analysis of high-throughput-omics data, researchers hope to detect potential associations among ...
Effect of Aggregation Operators on Network-Based Disease Gene Prioritization: A Case Study on Blood Disorders
Owing to the innate noise in the biological data sources, a single source or a single measure do not suffice for an effective disease gene prioritization. So, the integration of multiple data sources or aggregation of multiple measures is the need of ...
Enhancing Protein Conformational Space Sampling Using Distance Profile-Guided Differential Evolution
De novo protein structure prediction aims to search for low-energy conformations as it follows the thermodynamics hypothesis that places native conformations at the global minimum of the protein energy surface. However, the native conformation is not ...
Extending the Applicability of Graphlets to Directed Networks
With recent advances in high-throughput cell biology, the amount of cellular biological data has grown drastically. Such data is often modeled as graphs also called networks and studying them can lead to new insights into molecule-level organization. A ...
High Class-Imbalance in pre-miRNA Prediction: A Novel Approach Based on deepSOM
The computational prediction of novel microRNA within a full genome involves identifying sequences having the highest chance of being a miRNA precursor pre-miRNA. These sequences are usually named candidates to miRNA. The well-known pre-miRNAs are ...
Improving Biochemical Named Entity Recognition Using PSO Classifier Selection and Bayesian Combination Methods
Named Entity Recognition NER is a basic step for large number of consequent text mining tasks in the biochemical domain. Increasing the performance of such recognition systems is of high importance and always poses a challenge. In this study, a new ...
ML-Space: Hybrid Spatial Gillespie and Particle Simulation of Multi-Level Rule-Based Models in Cell Biology
Spatio-temporal dynamics of cellular processes can be simulated at different levels of detail, from deterministic partial differential equations via the spatial Stochastic Simulation algorithm to tracking Brownian trajectories of individual particles. ...
Multi-Block Bipartite Graph for Integrative Genomic Analysis
Human diseases involve a sequence of complex interactions between multiple biological processes. In particular, multiple genomic data such as Single Nucleotide Polymorphism SNP, Copy Number Variation CNV, DNA Methylation DM, and their interactions ...
Normalizing Kernels in the Billera-Holmes-Vogtmann Treespace
As costs of genome sequencing have dropped precipitously, development of efficient bioinformatic methods to analyze genome structure and evolution have become ever more urgent. For example, most published phylogenomic studies involve either massive ...
Novel Methods for Microglia Segmentation, Feature Extraction, and Classification
- Yuchun Ding,
- Marie Christine Pardon,
- Alessandra Agostini,
- Henryk Faas,
- Jinming Duan,
- Wil O C Ward,
- Felicity Easton,
- Dorothee Auer,
- Li Bai
Segmentation and analysis of histological images provides a valuable tool to gain insight into the biology and function of microglial cells in health and disease. Common image segmentation methods are not suitable for inhomogeneous histology image ...
Pluribus—Exploring the Limits of Error Correction Using a Suffix Tree
Next generation sequencing technologies enable efficient and cost-effective genome sequencing. However, sequencing errors increase the complexity of the de novo assembly process, and reduce the quality of the assembled sequences. Many error correction ...
Predicting Protein-DNA Binding Residues by Weightedly Combining Sequence-Based Features and Boosting Multiple SVMs
Protein-DNA interactions are ubiquitous in a wide variety of biological processes. Correctly locating DNA-binding residues solely from protein sequences is an important but challenging task for protein function annotations and drug discovery, especially ...
Protein Inference from the Integration of Tandem MS Data and Interactome Networks
Since proteins are digested into a mixture of peptides in the preprocessing step of tandem mass spectrometry MS, it is difficult to determine which specific protein a shared peptide belongs to. In recent studies, besides tandem MS data and peptide ...
Reframed Genome-Scale Metabolic Model to Facilitate Genetic Design and Integration with Expression Data
Genome-scale metabolic network models GEMs have played important roles in the design of genetically engineered strains and helped biologists to decipher metabolism. However, due to the complex gene-reaction relationships that exist in model systems, ...
Significance and Functional Similarity for Identification of Disease Genes
One of the most significant research issues in functional genomics is insilico identification of disease related genes. In this regard, the paper presents a new gene selection algorithm, termed as SiFS, for identification of disease genes. It integrates ...
Strategies for Comparing Metabolic Profiles: Implications for the Inference of Biochemical Mechanisms from Metabolomics Data
Background: Large amounts of metabolomics data have been accumulated in recent years and await analysis. Previously, we had developed a systems biology approach to infer biochemical mechanisms underlying metabolic alterations observed in cancers and ...
Triangular Alignment TAME: A Tensor-Based Approach for Higher-Order Network Alignment
Network alignment has extensive applications in comparative interactomics. Traditional approaches aim to simultaneously maximize the number of conserved edges and the underlying similarity of aligned entities. We propose a novel formulation of the ...
Unsupervised Binning of Metagenomic Assembled Contigs Using Improved Fuzzy C-Means Method
Metagenomic contigs binning is a necessary step of metagenome analysis. After assembly, the number of contigs belonging to different genomes is usually unequal. So a metagenomic contigs dataset is a kind of imbalanced dataset and traditional fuzzy c-...
Collective Prediction of Disease-Associated miRNAs Based on Transduction Learning
The discovery of human disease-related miRNA is a challenging problem for complex disease biology research. For existing computational methods, it is difficult to achieve excellent performance with sparse known miRNA-disease association verified by ...
Modeling and Identification of Amnioserosa Cell Mechanical Behavior by Using Mass-Spring Lattices
Various mechanical models of live amnioserosa cells during Drosophila melanogaster’s dorsal closure are proposed. Such models account for specific biomechanical oscillating behaviors and depend on a different set of parameters. The identification of the ...
Soft Ngram Representation and Modeling for Protein Remote Homology Detection
Remote homology detection represents a central problem in bioinformatics, where the challenge is to detect functionally related proteins when their sequence similarity is low. Recent solutions employ representations derived from the sequence profile, ...
Subjects
Currently Not Available