[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

van der Heijden et al., 2021 - Google Patents

DeepKoCo: Efficient latent planning with a task-relevant Koopman representation

van der Heijden et al., 2021

View PDF
Document ID
11114127324362841368
Author
van der Heijden B
Ferranti L
Kober J
Babuška R
Publication year
Publication venue
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

External Links

Snippet

This paper presents DeepKoCo, a novel modelbased agent that learns a latent Koopman representation from images. This representation allows DeepKoCo to plan efficiently using linear control methods, such as linear model predictive control. Compared to traditional …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B17/00Systems involving the use of models or simulators of said systems
    • G05B17/02Systems involving the use of models or simulators of said systems electric
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models

Similar Documents

Publication Publication Date Title
Lütjens et al. Safe reinforcement learning with model uncertainty estimates
Li et al. Propagation networks for model-based control under partial observation
Laskey et al. Dart: Noise injection for robust imitation learning
US20200348630A1 (en) Empirical modeling with globally enforced general constraints
Englert et al. Combined Optimization and Reinforcement Learning for Manipulation Skills.
Xu et al. Benchmarking reinforcement learning techniques for autonomous navigation
Lambert et al. Learning accurate long-term dynamics for model-based reinforcement learning
Wu et al. Semi-parametric Gaussian process for robot system identification
Kortvelesy et al. ModGNN: Expert policy approximation in multi-agent systems with a modular graph neural network architecture
Beckers et al. Stable Gaussian process based tracking control of Lagrangian systems
Mullins et al. Accelerated testing and evaluation of autonomous vehicles via imitation learning
Lee et al. Safe end-to-end imitation learning for model predictive control
van der Heijden et al. DeepKoCo: Efficient latent planning with a task-relevant Koopman representation
Baert et al. Maximum causal entropy inverse constrained reinforcement learning
Tekden et al. Object and relation centric representations for push effect prediction
Allevato et al. Iterative residual tuning for system identification and sim-to-real robot learning
Possas et al. Online bayessim for combined simulator parameter inference and policy improvement
Lafmejani et al. Nmpc-lbf: Nonlinear mpc with learned barrier function for decentralized safe navigation of multiple robots in unknown environments
Kaushik et al. Safeapt: Safe simulation-to-real robot learning using diverse policies learned in simulation
Ruano et al. An overview of nonlinear identification and control with neural networks
Whitman et al. Modular mobile robot design selection with deep reinforcement learning
Findik et al. Influence of team interactions on multi-robot cooperation: A relational network perspective
van der Heijden et al. DeepKoCo: Efficient latent planning with an invariant Koopman representation
Lee et al. Early failure detection of deep end-to-end control policy by reinforcement learning
Agarwal et al. Synthesizing adversarial visual scenarios for model-based robotic control