Lists (1)
Sort Name ascending (A-Z)
Stars
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
Set of plugins, models and worlds to use with OSRF Gazebo Simulator in SITL and HITL.
Deezer source separation library including pretrained models.
Attentional guide metoduyla deep convolutional neural networks kullanılarak hazırlanan text to speech modelinin türkçeye derlenmiş halidir.
Modern desktop RSS reader built with Electron, React, and Fluent UI
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Provides an API to the Fabric, and its helper tools
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Converts .py to .exe using a simple graphical interface
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Plugins and models for vehicle simulation in Gazebo Sim with ArduPilot SITL controllers
An open-source cross-platform alternative to AirDrop
exo-explore / llama98.c
Forked from karpathy/llama2.cInference Llama models in one file of pure C for Windows 98 running on 25-year-old hardware
Personal CRM. Remember everything about your friends, family and business relationships.
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
RandomInternetPreson / text_generation_webui_xtt_Alts
Forked from kanttouchthis/text_generation_webui_xttsXTTSv2 Extension for oobabooga text-generation-webui
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
During the training process of custom images in jetson nano developer kit, if you face issues like insufficient memory and low processing speed , you can use Label image tool. Here we are doing the…
UnoJoy! allows you to easily turn an Arduino Uno (or Mega or Leonardo) into a PS3-compatible USB game controller
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
A simple FastAPI Server to run XTTSv2
IT Toolbox - Güçlü ve Pratik BT Araç Kutusu 🚀 IT Toolbox, BT profesyonelleri için günlük operasyonlarını kolaylaştırmayı hedefleyen kapsamlı bir araç setidir. Bu araç, sistem yöneticilerinin ve tek…
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Hasan-Naseer / whisperX
Forked from m-bain/whisperXThis fork has whisperx modified to include verbose and segment level printing for better debugging. Also tweaked to include support for the latest faster-whisper and more models like distil-large-v3.