Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Material inspired stylesheet for PySide2, PySide6, PyQt5 and PyQt6
Official inference repo for FLUX.1 models
Protocol Buffers - Google's data interchange format
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
基于系统代理的抖音弹幕wss抓取程序,能够获取所有数据来源,包括chrome,抖音直播伴侣等,可进行进程过滤
TTSFM is a reverse-engineered API server that mirrors OpenAI's TTS service, providing a compatible interface for text-to-speech conversion with multiple voice options.
Multi-Joint dynamics with Contact. A general purpose physics simulator.
An add-on for Blender allowing to create URDF, SDF and SMURF robot models in a WYSIWYG environment.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Robot kinematics implemented in pytorch
NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
🎥 Python and OpenCV-based scene cut/transition detection program & library.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Robust Speech Recognition via Large-Scale Weak Supervision
A tool for reverse engineering Android apk files
Mobile UI viewer in browser, view the UI in a tree view, and generate XPath automatically.
Using system APIs directly with adb/root privileges from normal apps through a Java process started with app_process.
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…