Stars
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
The Next Step Forward in Multimodal LLM Alignment
Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
LPY1219 / GR-MG
Forked from bytedance/GR-MGOfficial implementation of GR-MG
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
[CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
This repo has the expert data generation infrastructure and Pytorch implementation of MPiNets.
LPY1219 / ai-edu
Forked from microsoft/ai-eduAI education materials for Chinese students, teachers and IT professionals.
AI education materials for Chinese students, teachers and IT professionals.