This codebase contains an extensible framework for implementing various Unsupervised Environment Design (UED) algorithms, including state-of-the-art Dual Curriculum Design (DCD) algorithms with minimax-regret robustness properties like ACCEL and Robust PLR:
- ACCEL
- Robust PLR
- PLR
- REPAIRED
- PAIRED
- ALP-GMM
- Minimax adversarial training
- Domain randomization (DR)
- PAIRED+HiEnt+BC+Evo
We also include experiment configurations for the main experiments in the following papers on DCD methods:
- Replay-Guided Adversarial Design. Jiang et al, 2021 (NeurIPS 2021)
- Evolving Curricula with Regret-Based Environment Design. Parker-Holder et al, 2022 (ICML 2022)
- Stabilizing Unsupervised Environment Design with a Learned Adversary. Mediratta et al, 2023 (CoLLAs 2023)