Stars
Deep DNA Storage: Scalable and Robust DNA-based Storage via Coding Theory and Deep Learning
DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.
Error correction scheme for storing information on DNA using Reed Solomon codes
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
A novel method for sequence similarity estimation
Deep neural network to find the consensus of clustering in DNA storage system
[ISMB 2025] DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models
[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome
HSFC: Hash sketches fuzzy clustering for reliable DNA storage data reconstruction