Stars
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
This is a repo for storing the code to solve AoC 2022 questions
Robust Speech Recognition via Large-Scale Weak Supervision
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
5weggeman / hifi-gan-laughnet
Forked from jik876/hifi-ganHiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
Akella17 / speaker-embedding
Forked from andabi/voice-vectorA deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
End to End Dialect Identification using Convolutional Neural Network
Keras code and weights files for popular deep learning models.
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Tensorflow Implementation of the paper [Neural Discrete Representation Learning](https://arxiv.org/abs/1711.00937) (VQ-VAE).
Convolutional nets which can take molecular graphs of arbitrary size as input.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Speech Recognition using DeepSpeech2.
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure