AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python 830 84 Updated Jun 14, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,337 154 Updated Jun 17, 2025

OpenDriveLab / UniVLA

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 442 20 Updated Jun 18, 2025

Little-Podi / AdaWorld

[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".

Python 115 5 Updated Jun 17, 2025

IntologyAI / Zochi

Repository for Zochi's Research

Python 216 20 Updated May 30, 2025

OpenDriveLab / AgiBot-World

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,114 132 Updated Jun 11, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,293 1,481 Updated Jun 13, 2025

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

5,795 374 Updated Jun 11, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 32,573 2,910 Updated Feb 25, 2024

mayuelala / Awesome-Controllable-Video-Generation

🚀🚀🚀A curated list of papers on controllable video generation.

267 22 Updated Jun 15, 2025

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,129 107 Updated May 30, 2025

operator22th / awesome-world-models-for-robots

21 Updated Jun 18, 2025

LMD0311 / Awesome-World-Model

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,075 38 Updated Jun 15, 2025

fangzr / TOC-Edge-Aerial

The repository for paper 'Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude Economy'.

10 Updated May 28, 2025

CP-Security / CP-Guard

This is the official implenmentation of "CP-Guard: Malicious agent detection and defense in collaborative bird's eye view segmentation"

Python 3 Updated May 23, 2025

Jiaaqiliu / Awesome-VLA-Robotics

A comprehensive list of excellent research papers, models, datasets, and other resources on Vision-Language-Action (VLA) models in robotics.

301 4 Updated Jun 17, 2025

dl-m9 / dl-m9.github.io

CSS 2 Updated Jun 17, 2025

dl-m9 / academic-homepage-modernism

A modern, responsive academic personal website.

CSS 15 7 Updated Apr 5, 2025

dl-m9 / Multi-Agent-Autonomous-Driving

All you need for Multi-Agent Autonomous Driving (MAAD)

37 2 Updated May 28, 2025

Grandzxw / awesome-multi-robot-system

Recent multi-robot projects and papers: Including SLAM, place recognition, Large Language Models navigation. (continually updated)

84 2 Updated Mar 24, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 17,079 1,998 Updated Jun 11, 2025

Chongjie-Si / Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

Python 142 5 Updated Feb 7, 2025

DarkLight dl-m9

Lists (8)

🧬 AI Companion Stack

AIGC

Autonomous Driving

🌎 Collaborative Perception

💫 LLM Agent

PEFT

Research Agent

steamlit

Starred repositories

imbalanced-learning