Stars
- All languages
- Batchfile
- C
- C++
- CSS
- CoffeeScript
- Cython
- Dart
- Dockerfile
- Go
- Groff
- HTML
- Haskell
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Objective-C
- Objective-C++
- OpenEdge ABL
- PHP
- PLpgSQL
- Perl
- PureBasic
- Python
- Ruby
- Rust
- Scala
- Scheme
- Shell
- Solidity
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Automatical Arbitrage opportunity searching in Convertible Bonds Market with five ranking/comparing stratigies
提供同花顺客户端/miniqmt/雪球的股票量化交易,支持跟踪 joinquant /ricequant 模拟交易 和 实盘雪球组合
遇事不决,Vibe 力学! One-Person Company AI Tools Series – continuously updated to help boost productivity and empower your solo business!
Build effective agents using Model Context Protocol and simple workflow patterns
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning
Integrate the DeepSeek API into popular softwares
ChatMCP is an AI chat client implementing the Model Context Protocol (MCP).
这是一个为大模型提供 A 股数据的的 MCP(Model Content Protocol) 服务。
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Align Anything: Training All-modality Model with Feedback
A fork to add multimodal model training to open-r1
Fully open reproduction of DeepSeek-R1
Witness the aha moment of VLM with less than $3.
Recipes to train reward model for RLHF.
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Example models using DeepSpeed
Aligning LMMs with Factually Augmented RLHF