aixingxy

xingxy aixingxy

TTS

10 followers · 132 following

Starred repositories

Eddie-Wang1120 / Professional-CUDA-C-Programming-Code-and-Notes

CUDA C 编程权威指南代码实现包含了书上第二章到第八章的大部分代码实现和作者笔记，全由作者本人手动实现，难免有错误的地方，请大家谨慎参考，非常欢迎对错误的指正。如果有帮助的话请Star一下，对作者帮助很大，谢谢！

Cuda 343 24 Updated Oct 20, 2022

DaiYvhang / AISHELL-5

In-car multi-channel speech transcription system of AISHELL-5.

Python 22 1 Updated Jun 9, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,073 686 Updated Jun 14, 2025

taresh18 / TTSizer

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨

Python 83 11 Updated May 20, 2025

thuhcsi / SpeechCraft

The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.

Python 150 5 Updated Apr 14, 2025

resemble-ai / Perth

Open Audio Watermarking Tool

Python 193 15 Updated May 13, 2025

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 3,324 345 Updated Jun 2, 2025

open-dict-data / ipa-dict

Monolingual wordlists with pronunciation information in IPA

626 95 Updated May 24, 2025

jiaqili3 / DualCodec

A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation

Jupyter Notebook 34 4 Updated Jun 4, 2025

FlagOpen / FlagEval

7 Updated May 16, 2025

facebookresearch / llama-hd-dataset

This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.

18 1 Updated Jan 22, 2024

lars76 / fastspeech2-clean

Clean and modernized implementation of FastSpeech2/LightSpeech using IPA

Python 8 1 Updated Aug 16, 2024

Ksuriuri / index-tts-vllm

Added vLLM support to IndexTTS for faster inference.

Python 227 20 Updated Jun 9, 2025

zhangshaolei1998 / Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

586 8 Updated Jun 7, 2024

SparkAudio / VoxBox

A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.

Python 75 2 Updated May 5, 2025

nu-dialogue / moshi-finetune

Fine-tuning Moshi/J-Moshi on your own spoken dialogue data

Python 57 6 Updated Apr 8, 2025

idiap / knn-tts

Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model

Python 25 6 Updated Apr 29, 2025

codebyzeb / g2p-plus

Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories

Python 12 1 Updated Apr 10, 2025

mtkresearch / TASTE-SpokenLM

A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenization stage.

Python 78 7 Updated Jun 11, 2025

lenML / Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 1,258 170 Updated May 25, 2025

jasonppy / VoiceStar

VoiceStar: Robust, Duration-controllable TTS that can Extrapolate

Python 250 18 Updated May 31, 2025

MatthewCYM / VoiceBench

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 220 11 Updated Jun 12, 2025

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,031 692 Updated Aug 13, 2024

LqNoob / Neural-Codec-and-Speech-Language-Models

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 170 13 Updated Jun 13, 2025

Stylish-TTS / stylish-tts

High quality text-to-speech based on StyleTTS 2.

Python 50 10 Updated Jun 13, 2025

pengzhendong / g2p-mix

Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.

Python 101 12 Updated Mar 20, 2025

RLHFlow / Online-DPO-R1

Codebase for Iterative DPO Using Rule-based Rewards

Python 247 31 Updated Apr 11, 2025

qi-hua / async_cosyvoice

使用vllm加速cosyvoice2的推理

Jupyter Notebook 336 43 Updated Apr 26, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 2,660 232 Updated May 29, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,610 7,966 Updated Jun 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly