Stars
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
10000DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. Th…
NEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
RTSP/RTP/RTMP/FLV/HLS/MPEG-TS/MPEG-PS/MPEG-DASH/MP4/fMP4/MKV/WebM
WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11
A lightweight and very fast event bus / event framework for C++17
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream
Sample codes for my CUDA programming book
A simple C++11 Thread Pool implementation
An easy to use and efficient memory pool allocator written in C++.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation