8000 shuipi100 (shenquanbo) / Starred · GitHub

More Web Proxy on the site http://driver.im/

shuipi100

Follow

💭

I may be slow to respond.

shenquanbo shuipi100

💭

I may be slow to respond.

Follow

16 followers · 201 following

scistor
Beijing，china

Achievements

Achievements

10000

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Starred repositories

dpirch / libfvad

Voice activity detection (VAD) library, based on WebRTC's VAD engine

C 539 185 Updated Apr 2, 2024

TEN-framework / ten-vad

A Low-Latency, Lightweight and High-Performance Streaming VAD

C 370 29 Updated May 20, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,246 1,290 Updated May 21, 2025

xieyuankun / All-Type-ADD

This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”

Python 14 Updated May 21, 2025

stepfun-ai / Step-Audio

Python 4,301 348 Updated Mar 12, 2025

sunface / rust-by-practice

Learning Rust By Practice, narrowing the gap between beginner and skilled-dev through challenging examples, exercises and projects.

Rust 12,961 1,060 Updated Apr 25, 2025

ibeatai / BeatAI

持续分享/翻译 AI 领域的优秀内容，帮你战胜 AI，Just beat it! 欢迎 star 订阅，记住域名不迷路 https://BeatAI.cn

Handlebars 3,876 215 Updated May 23, 2025

TheAlgorithms / Rust

All Algorithms implemented in Rust

Rust 23,986 2,388 Updated May 22, 2025

78 / xiaozhi-esp32

Build your own AI friend

C++ 13,410 2,617 Updated May 22, 2025

Liu-Tianchi / Nes2Net

Python 33 4 Updated Apr 29, 2025

rust-lang / book

The Rust Programming Language

Rust 16,136 3,629 Updated May 22, 2025

luotianze666 / WaveFM

[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching

Python 91 7 Updated Mar 27, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 1,879 168 Updated May 23, 2025

bytedance / MegaTTS3

Python 5,359 378 Updated May 11, 2025

swesterfeld / audiowmark

Audio Watermarking

C++ 448 89 Updated May 15, 2025

Ling-Ink / MorseAudioDecoder

自动从音频中提取摩斯密码

Python 23 4 Updated Jun 12, 2023

AI-S2-Lab / HMSCF-ADD

[Information Fusion'2025] Hierarchical multi-source cues fusion for mono-to-binaural based Audio Deepfake Detection

1 Updated Jul 10, 2024

MaorAssayag / morse-deep-learning-detect-and-decode

Morse Code Decoder & Detector with Deep Learning

Jupyter Notebook 17 3 Updated Apr 17, 2024

seongho608 / RingFormer

Python 46 2 Updated Jan 9, 2025

yl4579 / StyleTTS-ZS

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

179 13 Updated Sep 27, 2024

duixcom / Duix.Heygem

C 8,725 1,447 Updated May 20, 2025

jixiaozhong / Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 2,740 232 Updated May 7, 2025

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 4,068 611 Updated May 16, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 9,510 998 Updated Apr 9, 2025

ytdl-org / youtube-dl

Command-line program to download videos from YouTube.com and other video sites

Python 135,765 10,336 Updated May 4, 2025

viitor-ai / viitor-voice

An LLM base TTS engine

Python 79 6 Updated Dec 25, 2024

Yaselley / SSL_Layerwise_Deepfake

SSL Layerwise analysis for speech deepfake detection

Python 22 1 Updated Feb 17, 2025

changjinhan / ADD-arxiv-daily

🕵️‍♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)

Python 8 Updated May 23, 2025

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,638 731 Updated Mar 5, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,925 247 Updated Dec 5, 2024

Starred topics

text-to-speech

0