8000 zyzisyz's list / speech · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zyzisyz's full-sized avatar
🚩
Focusing
🚩
Focusing

Organizations

@CSLT-THU @thuhcsi

Block or report zyzisyz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

speech

7 repositories

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Python 149 31 Updated May 2, 2024

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 569 72 Updated Apr 29, 2025

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 617 32 Updated Nov 19, 2024

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,160 100 Updated Mar 2, 2025

Audio Codec Speech processing Universal PERformance Benchmark

Python 258 25 Updated Apr 14, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 6,115 580 Updated Jun 11, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,958 252 Updated Dec 5, 2024
0