Computer Science > Neural and Evolutionary Computing

arXiv:2303.03832 (cs)

[Submitted on 7 Mar 2023]

Title:MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy

Authors:Maxence Faldor, Félix Chalumeau, Manon Flageat, Antoine Cully

View PDF

Abstract:Quality-Diversity algorithms, such as MAP-Elites, are a branch of Evolutionary Computation generating collections of diverse and high-performing solutions, that have been successfully applied to a variety of domains and particularly in evolutionary robotics. However, MAP-Elites performs a divergent search based on random mutations originating from Genetic Algorithms, and thus, is limited to evolving populations of low-dimensional solutions. PGA-MAP-Elites overcomes this limitation by integrating a gradient-based variation operator inspired by Deep Reinforcement Learning which enables the evolution of large neural networks. Although high-performing in many environments, PGA-MAP-Elites fails on several tasks where the convergent search of the gradient-based operator does not direct mutations towards archive-improving solutions. In this work, we present two contributions: (1) we enhance the Policy Gradient variation operator with a descriptor-conditioned critic that improves the archive across the entire descriptor space, (2) we exploit the actor-critic training to learn a descriptor-conditioned policy at no additional cost, distilling the knowledge of the archive into one single versatile policy that can execute the entire range of behaviors contained in the archive. Our algorithm, DCG-MAP-Elites improves the QD score over PGA-MAP-Elites by 82% on average, on a set of challenging locomotion tasks.

Comments:	Under review at GECCO 2023
Subjects:	Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2303.03832 [cs.NE]
	(or arXiv:2303.03832v1 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.2303.03832

Submission history

From: Maxence Faldor [view email]
[v1] Tue, 7 Mar 2023 11:58:01 UTC (6,489 KB)

Computer Science > Neural and Evolutionary Computing

Title:MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators