8000 chenyi0818 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View chenyi0818's full-sized avatar

Block or report chenyi0818

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Spark-TTS Inference Code

Python 9,387 982 Updated Apr 9, 2025

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python 363 25 Updated May 13, 2025

GPT-4o-level, real-time spoken dialogue system.

Python 325 23 Updated Jan 27, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 94,968 12,175 Updated May 16, 2025

Fully open reproduction of DeepSeek-R1

Python 24,436 2,250 Updated May 16, 2025

A Survey of Spoken Dialogue Models (60 pages)

298 16 Updated Nov 28, 2024

🇨🇳 Chinese sticker pack,More joy / 表情包的博物馆, Github最有毒的仓库, 中国表情包大集合, 聚欢乐~

JavaScript 13,198 1,274 Updated Apr 20, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 13,856 1,427 Updated May 6, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,838 1,892 Updated Apr 30, 2024

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 407 42 Updated Sep 13, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,266 789 Updated Mar 15, 2025

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

271,909 21,101 Updated Oct 3, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 46,435 5,108 Updated Apr 25, 2025

MindSpore online courses: Step into LLM

Jupyter Notebook 466 115 Updated Jan 6, 2025

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,585 1,306 Updated Apr 6, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 51,604 5,519 Updated May 12, 2025

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,143 861 Updated Jul 6, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,057 713 Updated Apr 12, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,403 28,966 Updated May 16, 2025

Inference code for Llama models

Python 58,244 9,771 Updated Jan 26, 2025

喜马拉雅xm文件解密工具

Python 446 109 Updated May 20, 2024

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Python 228 31 Updated Jul 13, 2022

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Python 386 55 Updated Apr 21, 2022

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,130 325 Updated Nov 14, 2023

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,322 105 Updated Sep 24, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,402 4,372 Updated May 17, 2025

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Python 508 87 Updated Dec 28, 2023

A timeline of the latest AI models for audio generation, starting in 2023!

1,902 70 Updated Jan 4, 2024
Next
0