-
Wuhan University
- Wuhan, China
- https://orcid.org/0000-0002-5339-7182
-
Lombard-VLD Public
Official implementation of Lombard-VLD (IEEE S&P 25)
-
Lombard-VLD-speech-examples Public
Partial speech examples of Lombard-VLD
HTML MIT License UpdatedMar 18, 2025 -
VoxTracer Public
Official Implementation of VoxTracer (MM' 23)
-
ccf-deadlines Public
Forked from ccfddl/ccf-deadlines⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Vue MIT License UpdatedMay 24, 2023 -
-
-
YourTTS Public
Forked from Edresson/YourTTSYourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Jupyter Notebook Other UpdatedDec 12, 2022 -
Praat_Scripts Public
Forked from feelins/Praat_ScriptsSome basic praat scripts.
Python UpdatedDec 5, 2022 -
SRD-VC Public
Forked from YoungSeng/SRD-VCSpeech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
Python UpdatedNov 1, 2022 -
SpeechSplit2 Public
Forked from biggytruck/SpeechSplit2Official implementation of SpeechSplit2
Python UpdatedOct 22, 2022 -
audio-watermarking-traditional Public
Forked from kosta-pmf/audio-watermarkingImplementations of different audio watermarking techniques
Python MIT License UpdatedOct 17, 2022 -
dnn-audio-watermarking Public
Forked from kosta-pmf/dnn-audio-watermarkingDNN-based audio watermarking
Python GNU General Public License v3.0 UpdatedOct 13, 2022 -
mel_cepstral_distance Public
Forked from stefantaubert/mel-cepstral-distanceComputes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment" by Robert F. Kubichek.
Python MIT License UpdatedOct 4, 2022 -
silk-v3-decoder Public
Forked from kn007/silk-v3-decoder[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.
C MIT License UpdatedSep 10, 2022 -
leetcode-master Public
Forked from youngyangyang04/leetcode-master《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
UpdatedSep 7, 2022 -
waveglow Public
Forked from NVIDIA/waveglowA Flow-based Generative Network for Speech Synthesis
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 21, 2022 -
VF-VC Public
Voice Conversion with CVAE augmented with Flow
-
MOSNet Public
Forked from lochenchou/MOSNetImplementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
Python Other UpdatedMay 26, 2022 -
Real-Time-Voice-Cloning Public
Forked from CorentinJ/Real-Time-Voice-CloningClone a voice in 5 seconds to generate arbitrary speech in real-time
Python Other UpdatedMay 3, 2022 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
Python Apache License 2.0 UpdatedMar 30, 2022 -
python_sound_open Public
Forked from busyyang/python_sound_open语音信号处理试验教程,Python代码
Python Apache License 2.0 UpdatedMar 18, 2022 -
LeetCodeAnimation Public
Forked from MisterBooo/LeetCodeAnimationDemonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
Java UpdatedMar 6, 2022 -
NATSpeech Public template
Forked from NATSpeech/NATSpeechA Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Python MIT License UpdatedMar 4, 2022 -
intro_dgm Public
Forked from jmtomczak/intro_dgmAn Introduction to Deep Generative Modeling: Examples
Jupyter Notebook MIT License UpdatedFeb 21, 2022 -
MOSNet-pytorch Public
Forked from ruaruaruabick/MOSNet-pytorchThe pytorch implement of MOSNet
Python Other UpdatedDec 22, 2021 -
nndl.github.io Public
Forked from nndl/nndl.github.io《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
HTML UpdatedDec 9, 2021 -
vits Public
Forked from jaywalnut310/vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Python MIT License UpdatedOct 28, 2021 -
FAKEBOB Public
Forked from FAKEBOB-adversarial-attack/FAKEBOBSource code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems"
Python BSD 2-Clause "Simplified" License UpdatedOct 6, 2021 -
FastSpeech2 Public
Forked from ming024/FastSpeech2An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Python MIT License UpdatedSep 24, 2021 -
Pytorch-MBNet Public
Forked from sky1456723/Pytorch-MBNetA pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
Python UpdatedSep 24, 2021