van der Heijden et al., 2021 - Google Patents
DeepKoCo: Efficient latent planning with a task-relevant Koopman representationvan der Heijden et al., 2021
View PDF- Document ID
- 11114127324362841368
- Author
- van der Heijden B
- Ferranti L
- Kober J
- Babuška R
- Publication year
- Publication venue
- 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
External Links
Snippet
This paper presents DeepKoCo, a novel modelbased agent that learns a latent Koopman representation from images. This representation allows DeepKoCo to plan efficiently using linear control methods, such as linear model predictive control. Compared to traditional …
- 239000003795 chemical substances by application 0 description 17
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Lütjens et al. | Safe reinforcement learning with model uncertainty estimates | |
Li et al. | Propagation networks for model-based control under partial observation | |
Laskey et al. | Dart: Noise injection for robust imitation learning | |
US20200348630A1 (en) | Empirical modeling with globally enforced general constraints | |
Englert et al. | Combined Optimization and Reinforcement Learning for Manipulation Skills. | |
Xu et al. | Benchmarking reinforcement learning techniques for autonomous navigation | |
Lambert et al. | Learning accurate long-term dynamics for model-based reinforcement learning | |
Wu et al. | Semi-parametric Gaussian process for robot system identification | |
Kortvelesy et al. | ModGNN: Expert policy approximation in multi-agent systems with a modular graph neural network architecture | |
Beckers et al. | Stable Gaussian process based tracking control of Lagrangian systems | |
Mullins et al. | Accelerated testing and evaluation of autonomous vehicles via imitation learning | |
Lee et al. | Safe end-to-end imitation learning for model predictive control | |
van der Heijden et al. | DeepKoCo: Efficient latent planning with a task-relevant Koopman representation | |
Baert et al. | Maximum causal entropy inverse constrained reinforcement learning | |
Tekden et al. | Object and relation centric representations for push effect prediction | |
Allevato et al. | Iterative residual tuning for system identification and sim-to-real robot learning | |
Possas et al. | Online bayessim for combined simulator parameter inference and policy improvement | |
Lafmejani et al. | Nmpc-lbf: Nonlinear mpc with learned barrier function for decentralized safe navigation of multiple robots in unknown environments | |
Kaushik et al. | Safeapt: Safe simulation-to-real robot learning using diverse policies learned in simulation | |
Ruano et al. | An overview of nonlinear identification and control with neural networks | |
Whitman et al. | Modular mobile robot design selection with deep reinforcement learning | |
Findik et al. | Influence of team interactions on multi-robot cooperation: A relational network perspective | |
van der Heijden et al. | DeepKoCo: Efficient latent planning with an invariant Koopman representation | |
Lee et al. | Early failure detection of deep end-to-end control policy by reinforcement learning | |
Agarwal et al. | Synthesizing adversarial visual scenarios for model-based robotic control |