Highlights
- Pro
Starred repositories
Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMIR25!)
The latent diffusion model for text-to-music generation.
Beat annotations for the beat tracker Beat This!
Audio Plugin for Audio to MIDI transcription using deep learning.
Muon: An optimizer for hidden layers in neural networks
PyTorch native quantization and sparsity for training and inference
Implementation of the Aurora model for Earth system forecasting
Efficient Training of Audio Transformers with Patchout
Master programming by recreating your favorite technologies from scratch.
An open-source 3DS emulator project based on Citra.
🪐 Markdown with superpowers — from ideas to presentations, articles and books.
Collection of self-contained header-only libraries for C++17
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.
Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of exp…
SYSTEM PROMPT TRANSPARENCY FOR ALL - CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, WINDSURF, DEVIN, REPLIT, AND MORE!
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling