8000 hongchengzhu (hongchengzhu) / Repositories · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hongchengzhu's full-sized avatar

Block or report hongchengzhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Lombard-VLD Public

    Official implementation of Lombard-VLD (IEEE S&P 25)

    Python 2 MIT License Updated Mar 22, 2025
  • Partial speech examples of Lombard-VLD

    HTML MIT License Updated Mar 18, 2025
  • VoxTracer Public

    Official Implementation of VoxTracer (MM' 23)

    Python 10 1 MIT License Updated Oct 27, 2023
  • ⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

    Vue MIT License Updated May 24, 2023
  • demo Public

    Updated May 10, 2023
  • Updated Jan 12, 2023
  • YourTTS Public

    Forked from Edresson/YourTTS

    YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

    Jupyter Notebook Other Updated Dec 12, 2022
  • Some basic praat scripts.

    Python Updated Dec 5, 2022
  • SRD-VC Public

    Forked from YoungSeng/SRD-VC

    Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

    Python Updated Nov 1, 2022
  • Official implementation of SpeechSplit2

    Python Updated Oct 22, 2022
  • Implementations of different audio watermarking techniques

    Python MIT License Updated Oct 17, 2022
  • DNN-based audio watermarking

    Python GNU General Public License v3.0 Updated Oct 13, 2022
  • Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment" by Robert F. Kubichek.

    Python MIT License Updated Oct 4, 2022
  • [Skype Silk Codec SDK]Decode silk v3 audio files (like wechat amr, aud files, qq slk files) and convert to other format (like mp3). Batch conversion support.

    C MIT License Updated Sep 10, 2022
  • 《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

    Updated Sep 7, 2022
  • waveglow Public

    Forked from NVIDIA/waveglow

    A Flow-based Generative Network for Speech Synthesis

    Python BSD 3-Clause "New" or "Revised" License Updated Jun 21, 2022
  • VF-VC Public

    Voice Conversion with CVAE augmented with Flow

    Python 1 MIT License Updated May 30, 2022
  • MOSNet Public

    Forked from lochenchou/MOSNet

    Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

    Python Other Updated May 26, 2022
  • Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Python Other Updated May 3, 2022
  • s3prl Public

    Forked from s3prl/s3prl

    Self-Supervised Speech Pre-training and Representation Learning Toolkit.

    Python Apache License 2.0 Updated Mar 30, 2022
  • 语音信号处理试验教程,Python代码

    Python Apache License 2.0 Updated Mar 18, 2022
  • Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

    Java Updated Mar 6, 2022
  • NATSpeech Public template

    Forked from NATSpeech/NATSpeech

    A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

    Python MIT License Updated Mar 4, 2022
  • intro_dgm Public

    Forked from jmtomczak/intro_dgm

    An Introduction to Deep Generative Modeling: Examples

    Jupyter Notebook MIT License Updated Feb 21, 2022
  • The pytorch implement of MOSNet

    Python Other Updated Dec 22, 2021
  • 《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning

    HTML Updated Dec 9, 2021
  • vits Public

    Forked from jaywalnut310/vits

    VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

    Python MIT License Updated Oct 28, 2021
  • Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems"

    Python BSD 2-Clause "Simplified" License Updated Oct 6, 2021
  • FastSpeech2 Public

    Forked from ming024/FastSpeech2

    An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

    Python MIT License Updated Sep 24, 2021
  • A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

    Python Updated Sep 24, 2021
0