van der Heijden et al., 2021 - Google Patents

DeepKoCo: Efficient latent planning with a task-relevant Koopman representation

van der Heijden et al., 2021

Document ID: 11114127324362841368
Author: van der Heijden B; Ferranti L; Kober J; Babuška R
Publication year: 2021
Publication venue: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

External Links

Cited by

Snippet

This paper presents DeepKoCo, a novel modelbased agent that learns a latent Koopman representation from images. This representation allows DeepKoCo to plan efficiently using linear control methods, such as linear model predictive control. Compared to traditional …

Continue reading at arxiv.org (PDF) (other versions)

239000003795 chemical substances by application 0 description 17

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models

Similar Documents

Publication	Publication Date	Title
Lütjens et al.	2019	Safe reinforcement learning with model uncertainty estimates
Li et al.	2019	Propagation networks for model-based control under partial observation
Laskey et al.	2017	Dart: Noise injection for robust imitation learning
US20200348630A1 (en)	2020-11-05	Empirical modeling with globally enforced general constraints
Englert et al.	2016	Combined Optimization and Reinforcement Learning for Manipulation Skills.
Xu et al.	2023	Benchmarking reinforcement learning techniques for autonomous navigation
Lambert et al.	2021	Learning accurate long-term dynamics for model-based reinforcement learning
Wu et al.	2012	Semi-parametric Gaussian process for robot system identification
Kortvelesy et al.	2021	ModGNN: Expert policy approximation in multi-agent systems with a modular graph neural network architecture
Beckers et al.	2017	Stable Gaussian process based tracking control of Lagrangian systems
Mullins et al.	2018	Accelerated testing and evaluation of autonomous vehicles via imitation learning
Lee et al.	2018	Safe end-to-end imitation learning for model predictive control
van der Heijden et al.	2021	DeepKoCo: Efficient latent planning with a task-relevant Koopman representation
Baert et al.	2023	Maximum causal entropy inverse constrained reinforcement learning
Tekden et al.	2024	Object and relation centric representations for push effect prediction
Allevato et al.	2020	Iterative residual tuning for system identification and sim-to-real robot learning
Possas et al.	2020	Online bayessim for combined simulator parameter inference and policy improvement
Lafmejani et al.	2022	Nmpc-lbf: Nonlinear mpc with learned barrier function for decentralized safe navigation of multiple robots in unknown environments
Kaushik et al.	2022	Safeapt: Safe simulation-to-real robot learning using diverse policies learned in simulation
Ruano et al.	2005	An overview of nonlinear identification and control with neural networks
Whitman et al.	2020	Modular mobile robot design selection with deep reinforcement learning
Findik et al.	2023	Influence of team interactions on multi-robot cooperation: A relational network perspective
van der Heijden et al.	2021	DeepKoCo: Efficient latent planning with an invariant Koopman representation
Lee et al.	2019	Early failure detection of deep end-to-end control policy by reinforcement learning
Agarwal et al.	2023	Synthesizing adversarial visual scenarios for model-based robotic control