8000 aixingxy (xingxy) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View aixingxy's full-sized avatar

Block or report aixingxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

CUDA C 编程权威指南代码实现 包含了书上第二章到第八章的大部分代码实现和作者笔记,全由作者本人手动实现,难免有错误的地方,请大家谨慎参考,非常欢迎对错误的指正。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!

Cuda 343 24 Updated Oct 20, 2022

In-car multi-channel speech transcription system of AISHELL-5.

Python 22 1 Updated Jun 9, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,073 686 Updated Jun 14, 2025

🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨

Python 83 11 Updated May 20, 2025

The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.

Python 150 5 Updated Apr 14, 2025

Open Audio Watermarking Tool

Python 193 15 Updated May 13, 2025

Generative models for conditional audio generation

Python 3,324 345 Updated Jun 2, 2025

Monolingual wordlists with pronunciation information in IPA

626 95 Updated May 24, 2025

A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation

Jupyter Notebook 34 4 Updated Jun 4, 2025
7 Updated May 16, 2025

This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.

18 1 Updated Jan 22, 2024

Clean and modernized implementation of FastSpeech2/LightSpeech using IPA

Python 8 1 Updated Aug 16, 2024

Added vLLM support to IndexTTS for faster inference.

Python 227 20 Updated Jun 9, 2025

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

586 8 Updated Jun 7, 2024

A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.

Python 75 2 Updated May 5, 2025

Fine-tuning Moshi/J-Moshi on your own spoken dialogue data

Python 57 6 Updated Apr 8, 2025

Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model

Python 25 6 Updated Apr 29, 2025

Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories

Python 12 1 Updated Apr 10, 2025

A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenization stage.

Python 78 7 Updated Jun 11, 2025

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 1,258 170 Updated May 25, 2025

VoiceStar: Robust, Duration-controllable TTS that can Extrapolate

Python 250 18 Updated May 31, 2025

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 220 11 Updated Jun 12, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,031 692 Updated Aug 13, 2024

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 170 13 Updated Jun 13, 2025

High quality text-to-speech based on StyleTTS 2.

Python 50 10 Updated Jun 13, 2025

Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.

Python 101 12 Updated Mar 20, 2025

Codebase for Iterative DPO Using Rule-based Rewards

Python 247 31 Updated Apr 11, 2025

使用vllm加速cosyvoice2的推理

Jupyter Notebook 336 43 Updated Apr 26, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 2,660 232 Updated May 29, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,610 7,966 Updated Jun 14, 2025
Next
0