8000 Gumpest (Yuan Zhang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Gumpest's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Gumpest

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning

Python 2,603 251 Updated May 30, 2025

[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.

Python 212 13 Updated May 9, 2025

DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework

Python 341 23 Updated Dec 10, 2024

[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 142 10 Updated Mar 6, 2025

[Arxiv 2025: MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation]

Python 34 2 Updated Apr 7, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,908 132 Updated Oct 30, 2024

[CVPR'25] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".

Python 25 Updated Mar 13, 2025

GPD-1: Generative Pre-training for Driving

Python 73 1 Updated Dec 12, 2024

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

142 3 Updated May 30, 2025

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Python 56 2 Updated Sep 26, 2024

[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".

Python 112 7 Updated May 18, 2025

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,368 464 Updated Nov 6, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,057 63 Updated Mar 9, 2025

OpenMMLab Model Compression Toolbox and Benchmark.

Python 1,600 236 Updated Jun 11, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,548 289 Updated Jun 2, 2025

[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".

Python 35 2 Updated Oct 23, 2024

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,060 385 Updated May 25, 2024

[CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".

Python 44 5 Updated Apr 20, 2024

Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan

Python 248 13 Updated May 6, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,266 1,538 Updated Aug 9, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,944 444 Updated Aug 7, 2024

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 939 46 Updated Oct 16, 2024

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,662 150 Updated May 9, 2023

[ACM MM'23] Official implementation of paper "Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty".

Python 13 3 Updated Nov 22, 2023

Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023

86 8 Updated Jan 24, 2024

An advanced guide to learn English which might benefit you a lot 🎉 . 离谱的英语学习指南/英语学习教程。

HTML 38,627 4,215 Updated Jul 13, 2024

A curated list of neural network pruning resources.

2,449 330 Updated Apr 4, 2024

Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.

Python 9 Updated Mar 13, 2023

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 7,484 501 Updated Jun 1, 2025

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,673 1,091 Updated Nov 1, 2024
Next
0