Cobo et al., 2014 - Google Patents

Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains

Cobo et al., 2014

Document ID: 14215197176102428054
Author: Cobo L; Subramanian K; Isbell Jr C; Lanterman A; Thomaz A
Publication year: 2014
Publication venue: Artificial Intelligence

External Links

Cited by

Snippet

Reinforcement learning (RL) and learning from demonstration (LfD) are two popular families of algorithms for learning policies for sequential decision problems, but they are often ineffective in high-dimensional domains unless provided with either a great deal of problem …

Continue reading at www.sciencedirect.com (HTML) (other versions)

230000002787 reinforcement 0 title abstract description 46

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/006—Artificial life, i.e. computers simulating life based on simulated virtual individual or collective life forms, e.g. single "avatar", social simulations, virtual worlds
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management

Similar Documents

Publication	Publication Date	Title
Matiisen et al.	2019	Teacher–student curriculum learning
Cobo et al.	2014	Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains
Hamrick et al.	2020	On the role of planning in model-based deep reinforcement learning
Ding et al.	2020	Challenges of reinforcement learning
Jurgenson et al.	2019	Harnessing reinforcement learning for neural motion planning
Hamrick et al.	2019	Combining q-learning and search with amortized value estimates
Dadashi et al.	2021	Continuous control with action quantization from demonstrations
Ghosh et al.	2019	Learning to reach goals without reinforcement learning
Ngo et al.	2013	Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
Subramanian et al.	2022	Multi-agent advisor Q-learning
He et al.	2022	Continuous neural algorithmic planners
Jaegle et al.	2021	Imitation by predicting observations
Hasselmann et al.	2023	Automatic modular design of robot swarms based on repertoires of behaviors generated via novelty search
Olesen et al.	2021	Evolutionary planning in latent space
Hafez et al.	2023	Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcement learning
Duan	2017	Meta learning for control
Sapora et al.	2024	EvIL: Evolution Strategies for Generalisable Imitation Learning
Stuhlmüller	2015	Modeling cognition with probabilistic programs: representations and algorithms
Togelius et al.	2023	Evolutionary Machine Learning and Games
Ge	2018	Solving planning problems with deep reinforcement learning and tree search
Matthews et al.	2020	Crowd grounding: finding semantic and behavioral alignment through human robot interaction.
Bakhmadov et al.	2020	Combining Reinforcement Learning and Unreal Engine's AI-tools to Create Intelligent Bots
Grattarola	2017	Deep Feature Extraction for Sample-Efficient Reinforcement Learning
Kelly et al.	2023	Evolutionary Computation and the Reinforcement Learning Problem
Vignesh Kumar et al.	2020	Fitness Function Design for Neuroevolution in Goal-Finding Game Environments