Stars
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
This is a simple HTTP service that uses the Edge-TTS library to generate text-to-speech audio files.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Controllable and fast Text-to-Speech for over 7000 languages!
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Generate imagined websites on an infinite canvas
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.
chroma's fork of @xenova/transformers
A high-throughput and memory-efficient inference and serving engine for LLMs
TTS with The Massively Multilingual Speech (MMS) project
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Generative Agents: Interactive Simulacra of Human Behavior
😄 Recognizes human faces and their corresponding emotions from a video or webcam feed. Powered by OpenCV and Deep Learning.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
A natural language interface for computers
Foundational Models for State-of-the-Art Speech and Text Translation
Single binary full stack Nuxt3 PocketBase application
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Persian/Farsi text to speech(TTS) training using coqui tts
Simple CVs: A collaborative repository offering a variety of CV templates for job seekers and an open platform for anyone willing to share their simple CVs, helping each other in the journey to fin…