8000 dl-m9 (DarkLight) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View dl-m9's full-sized avatar
🌴
On vacation
🌴
On vacation
  • Hong Kong
  • 04:13 (UTC +08:00)

Block or report dl-m9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Awesome Unified Multimodal Models

318 8 Updated May 22, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,091 49 Updated Jun 13, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,537 6,408 Updated Jun 18, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

739 48 Updated Jun 1, 2025
Python 6,460 436 Updated May 21, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,994 147 Updated Jun 3, 2025

Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

Python 112 8 Updated May 26, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 1,985 84 Updated May 21, 2025

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python 830 84 Updated Jun 14, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,337 154 Updated Jun 17, 2025

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 442 20 Updated Jun 18, 2025

[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".

Python 115 5 Updated Jun 17, 2025

Repository for Zochi's Research

Python 216 20 Updated May 30, 2025

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,114 132 Updated Jun 11, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,293 1,481 Updated Jun 13, 2025

[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide

5,795 374 Updated Jun 11, 2025

Let us control diffusion models!

Python 32,573 2,910 Updated Feb 25, 2024

🚀🚀🚀A curated list of papers on controllable video generation.

267 22 Updated Jun 15, 2025

[CSUR] A Survey on Video Diffusion Models

2,129 107 Updated May 30, 2025

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,075 38 Updated Jun 15, 2025

The repository for paper 'Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude Economy'.

10 Updated May 28, 2025

This is the official implenmentation of "CP-Guard: Malicious agent detection and defense in collaborative bird's eye view segmentation"

Python 3 Updated May 23, 2025

A comprehensive list of excellent research papers, models, datasets, and other resources on Vision-Language-Action (VLA) models in robotics.

301 4 Updated Jun 17, 2025
CSS 2 Updated Jun 17, 2025

A modern, responsive academic personal website.

CSS 15 7 Updated Apr 5, 2025

All you need for Multi-Agent Autonomous Driving (MAAD)

37 2 Updated May 28, 2025

Recent multi-robot projects and papers: Including SLAM, place recognition, Large Language Models navigation. (continually updated)

84 2 Updated Mar 24, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 17,079 1,998 Updated Jun 11, 2025

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

Python 142 5 Updated Feb 7, 2025
Next
0