[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Cobo et al., 2014 - Google Patents

Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains

Cobo et al., 2014

View HTML
Document ID
14215197176102428054
Author
Cobo L
Subramanian K
Isbell Jr C
Lanterman A
Thomaz A
Publication year
Publication venue
Artificial Intelligence

External Links

Snippet

Reinforcement learning (RL) and learning from demonstration (LfD) are two popular families of algorithms for learning policies for sequential decision problems, but they are often ineffective in high-dimensional domains unless provided with either a great deal of problem …
Continue reading at www.sciencedirect.com (HTML) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/004Artificial life, i.e. computers simulating life
    • G06N3/006Artificial life, i.e. computers simulating life based on simulated virtual individual or collective life forms, e.g. single "avatar", social simulations, virtual worlds
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/005Probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/18Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management

Similar Documents

Publication Publication Date Title
Matiisen et al. Teacher–student curriculum learning
Cobo et al. Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains
Hamrick et al. On the role of planning in model-based deep reinforcement learning
Ding et al. Challenges of reinforcement learning
Jurgenson et al. Harnessing reinforcement learning for neural motion planning
Hamrick et al. Combining q-learning and search with amortized value estimates
Dadashi et al. Continuous control with action quantization from demonstrations
Ghosh et al. Learning to reach goals without reinforcement learning
Ngo et al. Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots
Subramanian et al. Multi-agent advisor Q-learning
He et al. Continuous neural algorithmic planners
Jaegle et al. Imitation by predicting observations
Hasselmann et al. Automatic modular design of robot swarms based on repertoires of behaviors generated via novelty search
Olesen et al. Evolutionary planning in latent space
Hafez et al. Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcement learning
Duan Meta learning for control
Sapora et al. EvIL: Evolution Strategies for Generalisable Imitation Learning
Stuhlmüller Modeling cognition with probabilistic programs: representations and algorithms
Togelius et al. Evolutionary Machine Learning and Games
Ge Solving planning problems with deep reinforcement learning and tree search
Matthews et al. Crowd grounding: finding semantic and behavioral alignment through human robot interaction.
Bakhmadov et al. Combining Reinforcement Learning and Unreal Engine's AI-tools to Create Intelligent Bots
Grattarola Deep Feature Extraction for Sample-Efficient Reinforcement Learning
Kelly et al. Evolutionary Computation and the Reinforcement Learning Problem
Vignesh Kumar et al. Fitness Function Design for Neuroevolution in Goal-Finding Game Environments