Stars
Python interface to the WebRTC Voice Activity Detector
😎 Curated list of awesome things regarding the WebAssembly (wasm) ecosystem.
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
An awesome README template to jumpstart your projects!
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Open standard for machine learning interoperability
Curated list of project-based tutorials
A high-performance C++ headers for real-time object detection and segmentation using YOLO models, leveraging ONNX Runtime and OpenCV for seamless integration. Supports multiple YOLO (v5, v7, v8, v9…
An extremely fast Python package and project manager, written in Rust.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Extract music from YouTube videos
ArduPlane, ArduCopter, ArduRover, ArduSub source
A bare metal programming guide (ARM microcontrollers)
Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.
Learn LeetCode and prepare for coding interviews with free resources.
⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other building blocks