Computer Science > Machine Learning

arXiv:2301.11181 (cs)

[Submitted on 26 Jan 2023 (v1), last revised 9 Jun 2023 (this version, v2)]

Title:Deep Laplacian-based Options for Temporally-Extended Exploration

Authors:Martin Klissarov, Marlos C. Machado

View PDF

Abstract:Selecting exploratory actions that generate a rich stream of experience for better learning is a fundamental challenge in reinforcement learning (RL). An approach to tackle this problem consists in selecting actions according to specific policies for an extended period of time, also known as options. A recent line of work to derive such exploratory options builds upon the eigenfunctions of the graph Laplacian. Importantly, until now these methods have been mostly limited to tabular domains where (1) the graph Laplacian matrix was either given or could be fully estimated, (2) performing eigendecomposition on this matrix was computationally tractable, and (3) value functions could be learned exactly. Additionally, these methods required a separate option discovery phase. These assumptions are fundamentally not scalable. In this paper we address these limitations and show how recent results for directly approximating the eigenfunctions of the Laplacian can be leveraged to truly scale up options-based exploration. To do so, we introduce a fully online deep RL algorithm for discovering Laplacian-based options and evaluate our approach on a variety of pixel-based tasks. We compare to several state-of-the-art exploration methods and show that our approach is effective, general, and especially promising in non-stationary settings.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2301.11181 [cs.LG]
	(or arXiv:2301.11181v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.11181

Submission history

From: Martin Klissarov [view email]
[v1] Thu, 26 Jan 2023 15:45:39 UTC (4,740 KB)
[v2] Fri, 9 Jun 2023 16:33:08 UTC (9,956 KB)

Computer Science > Machine Learning

Title:Deep Laplacian-based Options for Temporally-Extended Exploration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Laplacian-based Options for Temporally-Extended Exploration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators