8000 chhluo (Cedric Luo) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View chhluo's full-sized avatar
  • Zhejiang University
  • Hangzhou, China

Block or report chhluo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips

Jupyter Notebook 8,052 500 Updated Jul 1, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,314 1,713 Updated Jul 4, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,285 50 Updated Jun 14, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,347 172 Updated Mar 28, 2025

Model Context Protocol Servers

TypeScript 57,567 6,656 Updated Jul 4, 2025

A course on aligning smol models.

Jupyter Notebook 5,990 2,137 Updated Jul 1, 2025

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,510 200 Updated Dec 26, 2023

Solve Visual Understanding with Reinforced VLMs

Python 5,244 321 Updated Jun 26, 2025

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…

Python 709 91 Updated Mar 13, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,837 127 Updated Jun 16, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,042 84 Updated Jun 26, 2025

[KDD 2024] Team up GBDTs and DNNs: Advancing Efficient and Effective Tabular Prediction with Tree-hybrid MLPs

Python 10 Updated Mar 3, 2025

s1: Simple test-time scaling

Python 6,470 751 Updated Jun 25, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,640 95 Updated Mar 18, 2025

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 968 61 Updated Jun 14, 2025

Witness the aha moment of VLM with less than $3.

Python 3,818 289 Updated May 19, 2025

Fully open reproduction of DeepSeek-R1

Python 24,951 2,320 Updated Jul 3, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,973 1,491 Updated Apr 24, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,765 380 Updated Jul 3, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,764 1,438 Updated Jun 30, 2025

An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.

Python 260 35 Updated Jun 19, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,998 265 Updated Jul 3, 2025

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 671 70 Updated Jun 30, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

445 10 Updated Jan 17, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,626 567 Updated Jul 3, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,934 1,759 Updated Feb 26, 2025

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

2,466 112 Updated Jun 20, 2025
Next
0