8000 WangRongsheng / Starred · GitHub

More Web Proxy on the site http://driver.im/

WangRongsheng

Follow

🎯

Focusing

WangRongsheng

🎯

Focusing

Follow

🎧 Keep passion for all research. I'm always open to collaborate on interesting works!

653 followers · 267 following

ShenZhen,China
https://wangrongsheng.github.io/

Achievements

Achievements

Highlights

Developer Program Member

Lists (10)

Sort

AIGC

Awesome

KAN

LLM-applications

16 repositories

LLM-reaserch

28 repositories

Med

Multimodal

18 repositories

RAG

15 repositories

Transportation

Tutorials

Stars

linshenkx / prompt-optimizer

一款提示词优化器，助力于编写高质量的提示词

TypeScript 7,305 928 Updated Jun 18, 2025

MiniMax-AI / MiniMax-M1

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 1,867 101 Updated Jun 19, 2025

yezanting / Med-VLM-Bench-Summary

A Curated Benchmark Repository for Medical Vision-Language Models

107 7 Updated Jun 17, 2025

VRPO / VRPO

Python 7 1 Updated Mar 27, 2025

SakanaAI / text-to-lora

Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input

Python 678 37 Updated Jun 8, 2025

NVlabs / Fast-dLLM

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 231 10 Updated Jun 10, 2025

FreedomIntelligence / FusionAudio

Towards Fine-grained Audio Captioning with Multimodal Contextual Cues

Python 67 3 Updated Jun 8, 2025

QwenLM / Qwen3-Embedding

Python 812 45 Updated Jun 13, 2025

OpenGVLab / VeBrain

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

63 5 Updated Jun 6, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support tool use

Python 250 15 Updated Jun 18, 2025

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 786 72 Updated Mar 14, 2025

Paper2Poster / Paper2Poster

Open-source Multi-agent Poster Generation from Papers

Python 2,121 114 Updated Jun 17, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 4,225 355 Updated Jun 17, 2025

junchenzhi / Awesome-LLM-Ensemble

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

72 7 Updated Jun 20, 2025

Tencent-Hunyuan / HunyuanVideo-Avatar

Python 1,359 175 Updated Jun 17, 2025

Tencent-Hunyuan / HunyuanPortrait

[CVPR 2025] HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation

Python 249 28 Updated Jun 6, 2025

Visual-Agent / DeepEyes

Python 509 20 Updated Jun 19, 2025

Gen-Verse / MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,108 52 Updated Jun 13, 2025

TIGER-AI-Lab / General-Reasoner

General Reasoner: Advancing LLM Reasoning Across All Domains

Python 141 6 Updated Jun 10, 2025

Intelligent-Internet / ii-agent

II-Agent: a new open-source framework to build and deploy intelligent agents

Python 2,487 368 Updated Jun 20, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 4,227 226 Updated May 5, 2025

alibaba / MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 11,957 1,938 Updated Jun 20, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,245 47 Updated Jun 14, 2025

ReTool-RL / ReTool

Python 119 8 Updated Apr 28, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 14,033 1,673 Updated Jun 19, 2025

microsoft / x-reasoner

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

44 2 Updated May 9, 2025

ByteDance-Seed / Seed-Coder

Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.

500 35 Updated Jun 6, 2025

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,538 260 Updated Jun 2, 2025

refly-ai / refly

The world's first open-source "Vibe Workflow" for complex tasks.

TypeScript 4,091 353 Updated Jun 20, 2025

wdrink / SimpleAR

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 371 20 Updated Jun 20, 2025

0