Computer Science > Machine Learning

arXiv:2209.11908 (cs)

[Submitted on 24 Sep 2022 (v1), last revised 12 Apr 2023 (this version, v7)]

Title:Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations

Authors:Letian Chen, Sravan Jayanthi, Rohan Paleja, Daniel Martin, Viacheslav Zakharov, Matthew Gombolay

View PDF

Abstract:Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. However, current LfD frameworks are not capable of fast adaptation to heterogeneous human demonstrations nor the large-scale deployment in ubiquitous robotics applications. In this paper, we propose a novel LfD framework, Fast Lifelong Adaptive Inverse Reinforcement learning (FLAIR). Our approach (1) leverages learned strategies to construct policy mixtures for fast adaptation to new demonstrations, allowing for quick end-user personalization, (2) distills common knowledge across demonstrations, achieving accurate task inference; and (3) expands its model only when needed in lifelong deployments, maintaining a concise set of prototypical strategies that can approximate all behaviors via policy mixtures. We empirically validate that FLAIR achieves adaptability (i.e., the robot adapts to heterogeneous, user-specific task preferences), efficiency (i.e., the robot achieves sample-efficient adaptation), and scalability (i.e., the model grows sublinearly with the number of demonstrations while maintaining high performance). FLAIR surpasses benchmarks across three control tasks with an average 57% improvement in policy returns and an average 78% fewer episodes required for demonstration modeling using policy mixtures. Finally, we demonstrate the success of FLAIR in a table tennis task and find users rate FLAIR as having higher task (p<.05) and personalization (p<.05) performance.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2209.11908 [cs.LG]
	(or arXiv:2209.11908v7 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.11908
Journal reference:	Proceedings of Conference on Robot Learning (CoRL) 2022

Submission history

From: Letian Chen [view email]
[v1] Sat, 24 Sep 2022 02:48:02 UTC (2,297 KB)
[v2] Wed, 16 Nov 2022 21:22:08 UTC (2,879 KB)
[v3] Sat, 19 Nov 2022 02:46:08 UTC (4,901 KB)
[v4] Tue, 6 Dec 2022 23:07:50 UTC (4,919 KB)
[v5] Wed, 29 Mar 2023 09:22:21 UTC (4,919 KB)
[v6] Thu, 6 Apr 2023 19:36:13 UTC (4,920 KB)
[v7] Wed, 12 Apr 2023 14:19:36 UTC (4,920 KB)

Computer Science > Machine Learning

Title:Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators