-
Baidu Inc.
- Shanghai
-
11:58
(UTC +08:00)
Lists (32)
Sort Name ascending (A-Z)
🌟 accelerate & optimize
anormaly detection
awesome-series
chatroom
create a world
cuda
datasets
gpt
jax+differentiable renderer+其他..
LLMs
模型,训练,部署LLM应用
loss
Multimodal
Music
nas + 远程
ner
paper
physics
proxy
py+js
Risk Management
rl
rust-ml
sentence vec
time series
transformer
trie
Vec (or not) Search
Vision 👁️ 👁️
✌️ voice
模型基础结构
系统相关
- All languages
- AutoHotkey
- Awk
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Crystal
- Cuda
- Cython
- D
- Dart
- Dockerfile
- Elixir
- Fortran
- GDScript
- Go
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- MLIR
- Markdown
- Mojo
- Nim
- PHP
- Pascal
- PostScript
- PowerShell
- Processing
- Python
- Rust
- SCSS
- SWIG
- Sass
- Shell
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
- Zig
Starred repositories
The simplest, fastest repository for training/finetuning small-sized VLMs.
Modeling, training, eval, and inference code for OLMo
Build your personal knowledge base with TriliumNext Notes
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.
Implementing DeepSeek R1's GRPO algorithm from scratch
An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
APOLLO: SGD-like Memory, AdamW-level Performance
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
A generative world for general-purpose robotics & embodied AI learning.
Python implementation of MPPI (Model Predictive Path-Integral) controller to understand the basic idea. Mandatory dependencies are numpy and matplotlib only.
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
The official Python SDK for Model Context Protocol servers and clients
Deep research agent to help you find the best GitHub repositories 🕵️!
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
verl: Volcano Engine Reinforcement Learning for LLMs
Solve Visual Understanding with Reinforced VLMs
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
🪄 Create rich visualizations with AI
RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.
Minimal reproduction of DeepSeek R1-Zero