VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning

🌟 Highlights

🎯 General Manipulation: Improving OpenVLA-7B with outcome-based multi-task reinforcement learing.
⚡️ Cutting-edge Architecture: Built with Ray+vLLM+LoRA+FSDP, our codebase delivers both scalability and flexibility.
📝 Clean Implementation: Following cleanrl's philosophy, we provide a single-file implementation for easy reading and modification.
🚧 Active Development: Work in Progress, let's build it together.

📝 TODO

Support SERL-style Real-world RL
Support More Environments (e.g., Roboverse)
Support More VLAs (e.g., MiniVLA)

🛠️ Installation

See INSTALL.md for installation instructions.

See ERROR_CATCH.md for error catching.

🚀 Quick Start

Before launching distributed training, please edit the script with the appropriate dataset and model paths first.

📈 Training

# bash scripts/train_rl_vllm_ray_fsdp.sh <gpus> <task_ids>
# e.g., 
bash scripts/train_rl_vllm_ray_fsdp.sh 0,1 0,1,2,3,4,5,6,7,8,9

🧪 Evaluation

# parallel evaluation with vectorized environment
bash scripts/eval_vllm_ray.sh 0,1

🏷️ License

This repository is released under the Apache-2.0 license.

🙏 Acknowledgement

Our code is built upon open-instruct, OpenRLHF, verl and openvla. We thank all these authors for their nicely open sourced code and their great contributions to the community.

🥰 Citation

If you find this repository helpful, please consider citing:

@misc{lu2025vlarl,
  title={VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning},
  author={Guanxing Lu, Chubin Zhang, Haonan Jiang, Yuheng Zhou, Zifeng Gao, Yansong Tang and Ziwei Wang},
  year={2025},
  howpublished={\url{https://congruous-farmhouse-8db.notion.site/VLA-RL-Towards-Masterful-and-General-Robotic-Manipulation-with-Scalable-Reinforcement-Learning-1953a2cd706280ecaad4e93a5bd2b8e3?pvs=4}},
  note={Notion Blog}
}

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
docs		docs
experiments/robot		experiments/robot
openpi		openpi
ppo		ppo
prismatic		prismatic
scripts		scripts
test/tools		test/tools
vla-scripts		vla-scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
aggregate_results.py		aggregate_results.py
environment.yml		environment.yml
ppo_vllm_thread_ray_fsdp_vla_v3.py		ppo_vllm_thread_ray_fsdp_vla_v3.py
pyproject.toml		pyproject.toml
requirements-min.txt		requirements-min.txt
run_libero_eval_vllm.py		run_libero_eval_vllm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning

🌟 Highlights

📝 TODO

🛠️ Installation

🚀 Quick Start

📈 Training

🧪 Evaluation

🏷️ License

🙏 Acknowledgement

🥰 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

GuanxingLu/vlarl

Folders and files

Latest commit

History

Repository files navigation

VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning

🌟 Highlights

📝 TODO

🛠️ Installation

🚀 Quick Start

📈 Training

🧪 Evaluation

🏷️ License

🙏 Acknowledgement

🥰 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages