Stars
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
A collection of best practice cookiecutter templates for all domains and languages with extensive Github support βΊ
Learn the basics of robotics through hands-on experience using ROS 2 and Gazebo simulation.
A Python wrapper for the high-quality vocoder "World"
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
create dataset from list of youtube links easily
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, Dβ¦
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
SoftVC VITS Singing Voice Conversion
This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support
Wyoming protocol server for Piper text to speech system
React Photo Studio is a free online photo editor for photography and design
Parlant is the open-source conversation modeling engine for building better, deliberate Agentic UX. It gives you the power of LLMs without the unpredictability.
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
YingqingHe / Awesome-Controllable-Video-Generation
Forked from mayuelala/Awesome-Controllable-Video-GenerationπππA curated list of papers on controllable video generation.
π Cherry Studio is a desktop client that supports for multiple LLM providers.
Vision Transformer Cookbook with Tensorflow
ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
Cheatsheet collection for Math, ML, DL, AI. Update frequently
π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Real-Time Deepfake Pipeline
Easily create Piper text-to-speech models in any voice. Make a text-to-speech model with your own voice recordings, or use thousands of RVC voices. Works offline on a Raspberry pi. Rapidly record cβ¦