Gumpest

🎯

Focusing

Yuan Zhang Gumpest

🎯

Focusing

Ph.D. Student @ Peking Uni.

131 followers · 35 following

Peking University
Beijing
01:57 (UTC +08:00)
https://yuanzhang.cc/
@YuanZhang_PKU

Achievements

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

SkyworkAI / Skywork-R1V

Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning

Python 2,603 251 Updated May 30, 2025

FlagOpen / RoboBrain

[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.

Python 212 13 Updated May 9, 2025

design-edit / DesignEdit

DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework

Python 341 23 Updated Dec 10, 2024

PKU-HMI-Lab / LIFT3D

[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 142 10 Updated Mar 6, 2025

RoyZry98 / MoLe-VLA-Pytorch

[Arxiv 2025: MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation]

Python 34 2 Updated Apr 7, 2025

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,908 132 Updated Oct 30, 2024

hey-cjj / MoVE-KD

[CVPR'25] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".

Python 25 Updated Mar 13, 2025

wzzheng / GPD

GPD-1: Generative Pre-training for Driving

Python 73 1 Updated Dec 12, 2024

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

142 3 Updated May 30, 2025

foundation-multimodal-models / CAL

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Python 56 2 Updated Sep 26, 2024

Gumpest / SparseVLMs

[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".

Python 112 7 Updated May 18, 2025

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,368 464 Updated Nov 6, 2024

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,057 63 Updated Mar 9, 2025

open-mmlab / mmrazor

OpenMMLab Model Compression Toolbox and Benchmark.

Python 1,600 236 Updated Jun 11, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,548 289 Updated Jun 2, 2025

foundation-multimodal-models / ConBench

[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".

Python 35 2 Updated Oct 23, 2024

tinyvision / DAMO-YOLO

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,060 385 Updated May 25, 2024

Gumpest / FreeKD

[CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".

Python 44 5 Updated Apr 20, 2024

hunto / LocalMamba

Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan

Python 248 13 Updated May 6, 2024

WongKinYiu / yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,266 1,538 Updated Aug 9, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,944 444 Updated Aug 7, 2024

PKU-YuanGroup / Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 939 46 Updated Oct 16, 2024

zhoubolei / bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,662 150 Updated May 9, 2023

Gumpest / AvatarKD

[ACM MM'23] Official implementation of paper "Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty".

Python 13 3 Updated Nov 22, 2023

hunto / DiffKD

Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023

86 8 Updated Jan 24, 2024

byoungd / English-level-up-tips

An advanced guide to learn English which might benefit you a lot 🎉 . 离谱的英语学习指南/英语学习教程。

HTML 38,627 4,215 Updated Jul 13, 2024

he-y / Awesome-Pruning

A curated list of neural network pruning resources.

2,449 330 Updated Apr 4, 2024

Gumpest / MasKD

Forked from hunto/MasKD

Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.

Python 9 Updated Mar 13, 2023

ccfddl / ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 7,484 501 Updated Jun 1, 2025

open-mmlab / mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,673 1,091 Updated Nov 1, 2024