Stars
Snapclient on ESP32
Code for the Computational Auditory Scene Analysis class with Professor Pardo
Home Assistant Voice Assistant Satellite Raspberry PI setup/config
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Everything you need to build state-of-the-art foundation models, end-to-end.
Multiroom music for Home Assistant (via snapcast)
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Expressive Anechoic Recordings of Speech (EARS)
verl: Volcano Engine Reinforcement Learning for LLMs
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Control adaptive filters with neural networks.
Acoustic Echo Cancellation with Nerual Kalman Filtering
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link a…
Demo the principle of Echo Cancellation using simple audio samples.
Official implementation of MatterGen -- a generative model for inorganic materials design across the periodic table that can be fine-tuned to steer the generation towards a wide range of property c…