hongchengzhu

hongchengzhu hongchengzhu

M.S. at School of Cyber Science and Engineering, Wuhan University. Please contact me at hongchengz@whu.edu.cn.

3 followers · 0 following

Wuhan University
Wuhan, China
https://orcid.org/0000-0002-5339-7182

Lombard-VLD Public

Official implementation of Lombard-VLD (IEEE S&P 25)

Python 2 MIT License Updated Mar 22, 2025
Lombard-VLD-speech-examples Public

Partial speech examples of Lombard-VLD

HTML MIT License Updated Mar 18, 2025
VoxTracer Public

Official Implementation of VoxTracer (MM' 23)

Python 10 1 MIT License Updated Oct 27, 2023
ccf-deadlines Public
Forked from ccfddl/ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue MIT License Updated May 24, 2023
demo Public

Updated May 10, 2023
hongchengzhu.github.com Public

Updated Jan 12, 2023
YourTTS Public
Forked from Edresson/YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook Other Updated Dec 12, 2022
Praat_Scripts Public
Forked from feelins/Praat_Scripts

Some basic praat scripts.

Python Updated Dec 5, 2022
SRD-VC Public
Forked from YoungSeng/SRD-VC

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

Python Updated Nov 1, 2022
SpeechSplit2 Public
Forked from biggytruck/SpeechSplit2

Official implementation of SpeechSplit2

Python Updated Oct 22, 2022
audio-watermarking-traditional Public
Forked from kosta-pmf/audio-watermarking

Implementations of different audio watermarking techniques

Python MIT License Updated Oct 17, 2022
dnn-audio-watermarking Public
Forked from kosta-pmf/dnn-audio-watermarking

DNN-based audio watermarking

Python GNU General Public License v3.0 Updated Oct 13, 2022
mel_cepstral_distance Public
Forked from stefantaubert/mel-cepstral-distance

Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment" by Robert F. Kubichek.

Python MIT License Updated Oct 4, 2022
silk-v3-decoder Public
Forked from kn007/silk-v3-decoder

[Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.

C MIT License Updated Sep 10, 2022
leetcode-master Public
Forked from youngyangyang04/leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Updated Sep 7, 2022
waveglow Public
Forked from NVIDIA/waveglow

A Flow-based Generative Network for Speech Synthesis

Python BSD 3-Clause "New" or "Revised" License Updated Jun 21, 2022
VF-VC Public

Voice Conversion with CVAE augmented with Flow

Python 1 MIT License Updated May 30, 2022
MOSNet Public
Forked from lochenchou/MOSNet

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Python Other Updated May 26, 2022
Real-Time-Voice-Cloning Public
Forked from CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python Other Updated May 3, 2022
s3prl Public
Forked from s3prl/s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Python Apache License 2.0 Updated Mar 30, 2022
python_sound_open Public
Forked from busyyang/python_sound_open

语音信号处理试验教程，Python代码

Python Apache License 2.0 Updated Mar 18, 2022
LeetCodeAnimation Public
Forked from MisterBooo/LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）

Java Updated Mar 6, 2022
NATSpeech Public template
Forked from NATSpeech/NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Python MIT License Updated Mar 4, 2022
intro_dgm Public
Forked from jmtomczak/intro_dgm

An Introduction to Deep Generative Modeling: Examples

Jupyter Notebook MIT License Updated Feb 21, 2022
MOSNet-pytorch Public
Forked from ruaruaruabick/MOSNet-pytorch

The pytorch implement of MOSNet

Python Other Updated Dec 22, 2021
nndl.github.io Public
Forked from nndl/nndl.github.io

《神经网络与深度学习》邱锡鹏著 Neural Network and Deep Learning

HTML Updated Dec 9, 2021
vits Public
Forked from jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python MIT License Updated Oct 28, 2021
FAKEBOB Public
Forked from FAKEBOB-adversarial-attack/FAKEBOB

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems"

Python BSD 2-Clause "Simplified" License Updated Oct 6, 2021
FastSpeech2 Public
Forked from ming024/FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python MIT License Updated Sep 24, 2021
Pytorch-MBNet Public
Forked from sky1456723/Pytorch-MBNet

A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

Python Updated Sep 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hongchengzhu hongchengzhu

Block or report hongchengzhu

Lombard-VLD Public

Lombard-VLD-speech-examples Public

VoxTracer Public

ccf-deadlines Public

demo Public

hongchengzhu.github.com Public

YourTTS Public

Praat_Scripts Public

SRD-VC Public

SpeechSplit2 Public

audio-watermarking-traditional Public

dnn-audio-watermarking Public

mel_cepstral_distance Public

silk-v3-decoder Public

leetcode-master Public

waveglow Public

VF-VC Public

MOSNet Public

Real-Time-Voice-Cloning Public

s3prl Public

python_sound_open Public

LeetCodeAnimation Public

NATSpeech Public template

intro_dgm Public

MOSNet-pytorch Public

nndl.github.io Public

vits Public

FAKEBOB Public

FastSpeech2 Public

Pytorch-MBNet Public