Computer Science > Robotics

arXiv:2305.15244 (cs)

[Submitted on 24 May 2023 (v1), last revised 15 Feb 2024 (this version, v4)]

Title:Neural Lyapunov and Optimal Control

Authors:Daniel Layeghi, Steve Tonneau, Michael Mistry

Abstract:Despite impressive results, reinforcement learning (RL) suffers from slow convergence and requires a large variety of tuning strategies. In this paper, we investigate the ability of RL algorithms on simple continuous control tasks. We show that without reward and environment tuning, RL suffers from poor convergence. In turn, we introduce an optimal control (OC) theoretic learning-based method that can solve the same problems robustly with simple parsimonious costs. We use the Hamilton-Jacobi-Bellman (HJB) and first-order gradients to learn optimal time-varying value functions and therefore, policies. We show the relaxation of our objective results in time-varying Lyapunov functions, further verifying our approach by providing guarantees over a compact set of initial conditions. We compare our method to Soft Actor Critic (SAC) and Proximal Policy Optimisation (PPO). In this comparison, we solve all tasks, we never underperform in task cost and we show that at the point of our convergence, we outperform SAC and PPO in the best case by 4 and 2 orders of magnitude.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2305.15244 [cs.RO]
	(or arXiv:2305.15244v4 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2305.15244

Submission history

From: Daniel Layeghi [view email]
[v1] Wed, 24 May 2023 15:29:59 UTC (850 KB)
[v2] Mon, 18 Sep 2023 13:59:35 UTC (3,336 KB)
[v3] Mon, 5 Feb 2024 12:11:31 UTC (3,433 KB)
[v4] Thu, 15 Feb 2024 11:08:44 UTC (3,433 KB)

Computer Science > Robotics

Title:Neural Lyapunov and Optimal Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Neural Lyapunov and Optimal Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators