8000 ShaohuaDong2021 (ShaohuaDong) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ShaohuaDong2021's full-sized avatar

Block or report ShaohuaDong2021

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Python 219 12 Updated Apr 13, 2025

🔥 🔥 🔥 A paper list of some recent Computer Vision(CV) works

440 29 Updated Jun 27, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 250 13 Updated Jun 27, 2025

A Paper List for Humanoid Robot Learning.

560 31 Updated Jun 23, 2025

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 404 16 Updated May 17, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,561 276 Updated Jun 19, 2025

Code for the Molmo Vision-Language Model

Python 524 40 Updated Dec 12, 2024

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,145 85 Updated Jun 27, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,271 162 Updated Feb 16, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,165 3,271 Updated Jun 27, 2025

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 435 28 Updated May 23, 2025

A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms

Python 1,774 154 Updated Jun 27, 2025

This repository collects papers on VLLM applications. We will update new papers irregularly.

141 12 Updated May 25, 2025

The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"

Python 113 6 Updated Jun 19, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,448 67 Updated Apr 18, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,341 76 Updated May 28, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 1,642 135 Updated Jun 5, 2025

Official inference framework for 1-bit LLMs

Python 20,342 1,526 Updated Jun 3, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 46,560 5,311 Updated Jun 24, 2025

Lets make video diffusion practical!

Python 14,710 1,322 Updated Jun 27, 2025

Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning

Python 192 7 Updated Apr 19, 2025

dynamic Arm for Robitc Mischief

80 7 Updated Jun 23, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,428 648 Updated May 29, 2025

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

Python 635 57 Updated May 19, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,408 312 Updated May 13, 2025

flow-pilot is an openpilot based driver assistance system that runs on linux, windows and android powered machines.

C 1,756 247 Updated Sep 19, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,232 1,499 Updated Jun 26, 2025

一个全开源低成本的双足机器人(2万元($3000))A Fully Opensourced Humanoid Robot with only $3000

C 162 26 Updated Mar 18, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,612 1,123 Updated Jun 17, 2025

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 908 37 Updated Jun 8, 2025
Next
0