Computer Science > Machine Learning

arXiv:2306.08388 (cs)

[Submitted on 14 Jun 2023 (v1), last revised 12 Jul 2024 (this version, v3)]

Title:Skill-Critic: Refining Learned Skills for Hierarchical Reinforcement Learning

Authors:Ce Hao, Catherine Weaver, Chen Tang, Kenta Kawamoto, Masayoshi Tomizuka, Wei Zhan

Abstract:Hierarchical reinforcement learning (RL) can accelerate long-horizon decision-making by temporally abstracting a policy into multiple levels. Promising results in sparse reward environments have been seen with skills, i.e. sequences of primitive actions. Typically, a skill latent space and policy are discovered from offline data. However, the resulting low-level policy can be unreliable due to low-coverage demonstrations or distribution shifts. As a solution, we propose the Skill-Critic algorithm to fine-tune the low-level policy in conjunction with high-level skill selection. Our Skill-Critic algorithm optimizes both the low-level and high-level policies; these policies are initialized and regularized by the latent space learned from offline demonstrations to guide the parallel policy optimization. We validate Skill-Critic in multiple sparse-reward RL environments, including a new sparse-reward autonomous racing task in Gran Turismo Sport. The experiments show that Skill-Critic's low-level policy fine-tuning and demonstration-guided regularization are essential for good performance. Code and videos are available at our website: this https URL.

Comments:	Preprint
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.08388 [cs.LG]
	(or arXiv:2306.08388v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.08388

Submission history

From: Ce Hao [view email]
[v1] Wed, 14 Jun 2023 09:24:32 UTC (6,851 KB)
[v2] Fri, 16 Jun 2023 02:03:30 UTC (6,851 KB)
[v3] Fri, 12 Jul 2024 01:59:00 UTC (44,937 KB)

Computer Science > Machine Learning

Title:Skill-Critic: Refining Learned Skills for Hierarchical Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Skill-Critic: Refining Learned Skills for Hierarchical Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators