- ShenZhen,China
- https://wangrongsheng.github.io/
Highlights
Lists (10)
Sort Name ascending (A-Z)
Stars
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
A Curated Benchmark Repository for Medical Vision-Language Models
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
Towards Fine-grained Audio Captioning with Multimodal Contextual Cues
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
OLMoE: Open Mixture-of-Experts Language Models
Open-source Multi-agent Poster Generation from Papers
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
[CVPR 2025] HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
General Reasoner: Advancing LLM Reasoning Across All Domains
II-Agent: a new open-source framework to build and deploy intelligent agents
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.
Official Repository of Absolute Zero Reasoner
The world's first open-source "Vibe Workflow" for complex tasks.
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"