Stars
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
An Application Framework for AI Engineering
无锁异步化、事件驱动架构设计的 java netty 网络编程框架; 轻量级,无需依赖任何第三方中间件或数据库就能支持集群、分布式; 适用于网络游戏服务器、物联网、内部系统及各种需要长连接的场景; 通过 ioGame 你可以很容易的搭建出一个集群无中心节点、集群自动化、分布式的网络服务器;FXGL、Unity、UE、Cocos Creator、Godot、Netty、Protobuf、web…
zero-shot voice conversion & singing voice conversion, with real-time support
GUI for a Vocal Remover that uses Deep Neural Networks.
Model Context Protocol Servers
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Stable diffusion for real-time music generation
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
Cross-platform, customizable ML solutions for live and streaming media.
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Offline Text To Speech synthesis for python
[NeurIPS 2024] The official implementation of HairFastGAN. A framework for virtual hairstyle fitting.
Collection of publicly available IPTV channels from all over the world
Software defined radio receiver powered by GNU Radio and Qt.
library for turning a RTL2832 based DVB dongle into a Software DefinedReceiver; mirror from https://gitea.osmocom.org/sdr/rtl-sdr
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
A generative speech model for daily dialogue.
Inference and training library for high-quality TTS models.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An AI agent that beats the classic game "Snake".