Stars
web audio, cracked.
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
A TTS model capable of generating ultra-realistic dialogue in one pass.
🚀 The fast, Pythonic way to build MCP servers and clients
OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to in…
SpatialLM: Large Language Model for Spatial Understanding
real time face swap and one-click video deepfake with only a single image
FastSAM TouchDesigner Plugin – A TouchDesigner .tox plugin for real-time segmentation using FastSAM
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A lightweight data processing framework built on DuckDB and 3FS.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
Simple, unified interface to multiple Generative AI providers
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Let's be honest, who really knows how to use ffmpeg. Its that tool that is so helpful but not needed enough to justify learning all of its inner workings. The days of scrolling through stackoverflo…
Curated list of Creative Technology groups, companies, studios, collectives, etc.
Redirect eye gaze while eyes are not in correct position in video conference using python and its openCV library
TouchPy is a high-performance toolset to work with TouchDesigner components in Python
A generative world for general-purpose robotics & embodied AI learning.
On-device Image Generation for Apple Silicon
An extremely fast Python package and project manager, written in Rust.
OneTrainer is a one-stop solution for all your stable diffusion training needs.
A terminal assistant for the hopelessly confused
Memory-Guided Diffusion for Expressive Talking Video Generation