Stars
paarthneekhara / NeMo
Forked from NVIDIA/NeMoNeMo: a toolkit for conversational AI
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. All in a modern, AI-native editor.
PAM is a no-reference audio quality metric for audio generation tasks
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
A concise but complete full-attention transformer with a set of promising experimental features from various papers
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients
Deep Learning book the covers the principles of deep learning, motivation, explanations, state of the art papers for the various tasks and architectures: CNNs, object detection, semantic segmentati…
Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
A curated list of awesome audio technology resources for developers
A Flow-based Generative Network for Speech Synthesis
Free monospaced font with programming ligatures
Code for NeurIPS 2019 paper Emergence of Object Segmentation in Perturbed Generative Models
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more