Stars
🔊 Text-Prompted Generative Audio Model
A collection of resources and papers on Diffusion Models
Text Normalization & Inverse Text Normalization
Muzic: Music Understanding and Generation with Artificial Intelligence
A light-weight Python library for computing Kaldi-style acoustic features based on NumPy
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Yeongtae / tacotron2
Forked from NVIDIA/tacotron2Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
A TensorFlow implementation of DeepMind's WaveNet paper
hlp2819 / darts-clone
Forked from s-yata/darts-cloneA clone of Darts (Double-ARray Trie System)
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
DeepMind's Tacotron-2 Tensorflow implementation
This is now the official location of the Merlin project.
pineking / openBliSSART
Forked from openBliSSART/openBliSSARTBlind Source Separation for Audio Recognition Tasks
Chinese keras documents with more examples, explanations and tips.