8000 SakurajimaMaiii (Shuai Wang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View SakurajimaMaiii's full-sized avatar
😇
😇

Block or report SakurajimaMaiii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Our paper about robust LLM fingerprints.

145 8 Updated Jul 3, 2025

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.

Python 514 11 Updated Jul 3, 2025

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,168 1,363 Updated Jul 3, 2025

WorldVLA: Towards Autoregressive Action World Model

Python 194 5 Updated Jun 28, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 51,638 4,430 Updated Jul 4, 2025

收集全国各高校招生时不会写明,却会实实在在影响大学生活质量的要求与细节

Python 4,356 657 Updated Jul 1, 2025
Python 389 2 Updated Jul 3, 2025

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

459 21 Updated Jul 3, 2025

Main source code of SRPO framework.

Python 28 1 Updated Jun 25, 2025

An agent benchmark with tasks in a simulated software company.

Python 430 62 Updated Jun 28, 2025

slime is a LLM post-training framework aiming at scaling RL.

Python 530 31 Updated Jul 3, 2025
Python 64 3 Updated May 15, 2025

Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grained visual understanding".

62 1 Updated Jun 19, 2025

Nano vLLM

Python 4,837 560 Updated Jun 27, 2025

Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning

Python 193 10 Updated Jun 30, 2025
Python 42 Updated Jun 20, 2025

Muon: An optimizer for hidden layers in neural networks

Python 977 49 Updated Jul 3, 2025

Awesome Unified Multimodal Models

383 11 Updated Jul 2, 2025

🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to …

Python 150 2 Updated Jun 25, 2025
Python 4 Updated May 22, 2025
Python 6 Updated Jun 10, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,636 263 Updated Jun 18, 2025

The official repository of the dots.llm1 base and instruct models proposed by rednote-hilab.

422 21 Updated Jun 11, 2025

Kinetics: Rethinking Test-Time Scaling Laws

Python 30 1 Updated Jun 18, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 38 1 Updated Jun 27, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

517 25 Updated Jul 1, 2025

VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

Python 67 1 Updated Jul 13, 2024

A version of verl to support tool use

Python 279 20 Updated Jul 3, 2025
Python 579 49 Updated Apr 15, 2025
Next
0