Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,057 713 Updated Apr 12, 2025

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,403 28,966 Updated May 16, 2025

meta-llama / llama

Inference code for Llama models

Python 58,244 9,771 Updated Jan 26, 2025

Diaoxiaozhang / Ximalaya-XM-Decrypt

喜马拉雅xm文件解密工具

Python 446 109 Updated May 20, 2024

tencent-ailab / bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Python 228 31 Updated Jul 13, 2022

wesbz / SoundStream

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Python 386 55 Updated Apr 21, 2022

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,130 325 Updated Nov 14, 2023

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,322 105 Updated Sep 24, 2023

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,402 4,372 Updated May 17, 2025

modelscope / KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Python 508 87 Updated Dec 28, 2023

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,902 70 Updated Jan 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chenyi0818

Block or report chenyi0818

Stars

SparkAudio / Spark-TTS

ASLP-lab / OSUM

OpenMOSS / SpeechGPT-2.0-preview

open-webui / open-webui

deepseek-ai / DeepSeek-R1

huggingface / open-r1

jishengpeng / WavChat

zhaoolee / ChineseBQB

FunAudioLLM / CosyVoice

ymcui / Chinese-LLaMA-Alpaca

KdaiP / StableTTS

jasonppy / VoiceCraft

996icu / 996.ICU

RVC-Boss / GPT-SoVITS

mindspore-courses / step_into_llm

LlamaFamily / Llama-Chinese

AmadeusChan / Awesome-LLM-System-Papers

mlabonne / llm-course

AIGC-Audio / AudioGPT

open-mmlab / Amphion