Morales et al., 2021 - Google Patents

A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning

Morales et al., 2021

Document ID: 17229616862935592127
Author: Morales E; Murrieta-Cid R; Becerra I; Esquivel-Basaldua M
Publication year: 2021
Publication venue: Intelligent Service Robotics

External Links

Cited by

Snippet

This article is about deep learning (DL) and deep reinforcement learning (DRL) works applied to robotics. Both tools have been shown to be successful in delivering data-driven solutions for robotics tasks, as well as providing a natural way to develop an end-to-end …

Continue reading at link.springer.com (other versions)

238000004805 robotic 0 title abstract description 80

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/008—Artificial life, i.e. computers simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. robots replicating pets or humans in their appearance or behavior
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0255—Control of position or course in two dimensions specially adapted to land vehicles using acoustic signals, e.g. ultra-sonic singals
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/02—Control of position or course in two dimensions
- G05D1/021—Control of position or course in two dimensions specially adapted to land vehicles
- G05D1/0212—Control of position or course in two dimensions specially adapted to land vehicles with means for defining a desired trajectory

Similar Documents

Publication	Publication Date	Title
Morales et al.	2021	A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Garaffa et al.	2021	Reinforcement learning for mobile robotics exploration: A survey
Xiao et al.	2022	Motion planning and control for mobile robot navigation using machine learning: a survey
Ibarz et al.	2021	How to train your robot with deep reinforcement learning: lessons we have learned
Qureshi et al.	2019	Motion planning networks
Polydoros et al.	2017	Survey of model-based reinforcement learning: Applications on robotics
Tai et al.	2016	A survey of deep network solutions for learning control in robotics: From reinforcement to imitation
Chen et al.	2020	Stabilization approaches for reinforcement learning-based end-to-end autonomous driving
Amarjyoti	2017	Deep reinforcement learning for robotic manipulation-the state of the art
Chen et al.	2020	Driving maneuvers prediction based autonomous driving control by deep Monte Carlo tree search
Cai et al.	2019	Lets-drive: Driving in a crowd by learning from tree search
Chaffre et al.	2020	Sim-to-real transfer with incremental environment complexity for reinforcement learning of depth-based robot navigation
Katyal et al.	2021	High-speed robot navigation using predicted occupancy maps
Fan et al.	2020	Learning resilient behaviors for navigation under uncertainty
Polevoy et al.	2022	Complex terrain navigation via model error prediction
Tavassoli et al.	2023	Learning skills from demonstrations: A trend from motion primitives to experience abstraction
Sun et al.	2023	Event-triggered reconfigurable reinforcement learning motion-planning approach for mobile robot in unknown dynamic environments
Dwivedi et al.	2022	Continuous control of autonomous vehicles using plan-assisted deep reinforcement learning
Abbatematteo et al.	2021	Bootstrapping motor skill learning with motion planning
Cheng	2020	Efficient and principled robot learning: theory and algorithms
Ruud	2023	Reinforcement learning with the TIAGo research robot: manipulator arm control with actor-critic reinforcement learning
Zhang et al.	2023	A Review on Robot Manipulation Methods in Human-Robot Interactions
Chatzilygeroudis	2018	Micro-data reinforcement learning for adaptive robots
Brandonisio	2023	AI-based guidance for spacecraft proximity operations around uncooperative targets
Iskandar et al.	2023	Using Deep Reinforcement Learning to Solve a Navigation Problem for a Swarm Robotics System