Thabet et al., 2019 - Google Patents

Sample-efficient deep reinforcement learning with imaginary rollouts for human-robot interaction

Thabet et al., 2019

Document ID: 13531545060384721578
Author: Thabet M; Patacchiola M; Cangelosi A
Publication year: 2019
Publication venue: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

External Links

Cited by

Snippet

Deep reinforcement learning has proven to be a great success in allowing agents to learn complex tasks. However, its application to actual robots can be prohibitively expensive. Furthermore, the unpredictability of human behavior in human-robot interaction tasks can …

Continue reading at arxiv.org (PDF) (other versions)

230000003993 interaction 0 title abstract description 16

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems

Similar Documents

Publication	Publication Date	Title
Vecerik et al.	2017	Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
Karkus et al.	2019	Differentiable algorithm networks for composable robot learning
Mandlekar et al.	2020	Iris: Implicit reinforcement without interaction at scale for learning control from offline robot manipulation data
Wang et al.	2017	Robust imitation of diverse behaviors
US20210390653A1 (en)	2021-12-16	Learning robotic tasks using one or more neural networks
Rahmatizadeh et al.	2018	Vision-based multi-task manipulation for inexpensive robots using end-to-end learning from demonstration
Thabet et al.	2019	Sample-efficient deep reinforcement learning with imaginary rollouts for human-robot interaction
Ding et al.	2020	Challenges of reinforcement learning
Ren et al.	2021	Generalization guarantees for imitation learning
Cuccu et al.	2011	Intrinsically motivated neuroevolution for vision-based reinforcement learning
Hu et al.	2024	On Transforming Reinforcement Learning With Transformers: The Development Trajectory
CN113379027A (en)	2021-09-10	Method, system, storage medium and application for generating confrontation interactive simulation learning
Sakunthala et al.	2017	A review on artificial intelligence techniques in electrical drives: Neural networks, fuzzy logic, and genetic algorithm
Hafez et al.	2019	Efficient intrinsically motivated robotic grasping with learning-adaptive imagination in latent space
Tanwani	2018	Generative models for learning robot manipulation skills from humans
Seo et al.	2024	Continuous control with coarse-to-fine reinforcement learning
JP2021192141A (en)	2021-12-16	Learning device, learning method, and learning program
Dinerstein et al.	2008	Learning policies for embodied virtual agents through demonstration
Kobayashi et al.	2022	Latent representation in human–robot interaction with explicit consideration of periodic dynamics
Przystupa et al.	2023	Deep probabilistic movement primitives with a bayesian aggregator
Chien et al.	2020	Stochastic curiosity maximizing exploration
Lötzsch	2018	Using deep reinforcement learning for the continuous control of robotic arms
Li et al.	2016	A hierarchical reinforcement learning method for persistent time-sensitive tasks
Pong	2021	Goal-Directed Exploration and Skill Reuse
Chen et al.	2017	Imitating shortest paths for visual navigation with trajectory-aware deep reinforcement learning