8000 yasyune / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yasyune's full-sized avatar

Block or report yasyune

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SoTA open-source TTS

Python 7,987 814 Updated Jun 13, 2025

Codename's rvc fork version 3, based on Applio.

Python 33 5 Updated Jun 14, 2025

Realtime AI Voice Converter for NVIDIA GPUs

Python 19 1 Updated Jun 2, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,932 1,360 Updated May 28, 2025

Streamable Text-to-Speech model using a language modeling approach, without vector quantization

Python 90 5 Updated May 20, 2025

GUI for Beatrice Voice Changer

Rust 9 1 Updated Apr 22, 2025

Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model

Python 25 6 Updated Apr 29, 2025

Interface for OuteTTS models.

Python 1,302 105 Updated May 28, 2025
Python 5,530 392 Updated May 11, 2025

batch files for setup environments and training the DiffSinger models

Python 5 Updated Mar 22, 2025

speaker-disentangled speech linguistic content quantizer

Python 18 4 Updated Mar 19, 2025

Spark-TTS Inference Code

Python 9,775 1,028 Updated Apr 9, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 2,649 232 Updated May 29, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,737 749 Updated Mar 5, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 572 43 Updated Apr 8, 2025

GPT-4o-level, real-time spoken dialogue system.

Python 330 23 Updated Jan 27, 2025

J-Moshi: A Japanese Full-duplex Spoken Dialogue System

JavaScript 249 15 Updated Jun 4, 2025
Python 78 5 Updated Jan 22, 2025

Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models

Python 398 35 Updated Jun 13, 2025

G2P

Python 254 45 Updated Apr 30, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 3,217 351 Updated May 3, 2025

VoicePrompter: Robust Zero-Shot Voice Conversion with Voice Prompt and Conditional Flow Matching (ICASSP '25)

7 Updated Jan 11, 2025

[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion

Python 68 8 Updated May 8, 2025

Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.

Python 48 10 Updated Mar 15, 2025

A Fast TTS Engine

Python 511 38 Updated Jan 23, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,927 230 Updated Jun 10, 2025

a Frontier Japanese Speech Generation net

Jupyter Notebook 42 11 Updated May 15, 2025

under-construction

6 Updated Feb 9, 2025
Next
0