Li et al., 2021 - Google Patents

Planning in learned latent action spaces for generalizable legged locomotion

Li et al., 2021

Document ID: 9621077624110067208
Author: Li T; Calandra R; Pathak D; Tian Y; Meier F; Rai A
Publication year: 2021
Publication venue: IEEE Robotics and Automation Letters

External Links

Cited by

Snippet

Hierarchical learning has been successful at learning generalizable locomotion skills on walking robots in a sample-efficient manner. However, the low-dimensional “latent” action used to communicate between two layers of the hierarchy is typically user-designed. In this …

Continue reading at arxiv.org (PDF) (other versions)

230000033001 locomotion 0 title abstract description 32

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/048—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators using a predictor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/008—Artificial life, i.e. computers simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. robots replicating pets or humans in their appearance or behavior
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation

Similar Documents

Publication	Publication Date	Title
Li et al.	2021	Planning in learned latent action spaces for generalizable legged locomotion
Gangapurwala et al.	2022	Rloc: Terrain-aware legged locomotion using reinforcement learning and optimal control
Shi et al.	2022	Reinforcement learning with evolutionary trajectory generator: A general approach for quadrupedal locomotion
Thor et al.	2020	Generic neural locomotion control framework for legged robots
Erez et al.	2013	An integrated system for real-time model predictive control of humanoid robots
Li et al.	2020	Learning generalizable locomotion skills with hierarchical reinforcement learning
Ouyang et al.	2021	Adaptive locomotion control of a hexapod robot via bio-inspired learning
Peters et al.	2016	Robot learning
Bjelonic et al.	2023	Learning-based design and control for quadrupedal robots with parallel-elastic actuators
CN113093779B (en)	2022-06-07	Robot motion control method and system based on deep reinforcement learning
Wang et al.	2021	CPG-based hierarchical locomotion control for modular quadrupedal robots using deep reinforcement learning
Lembono et al.	2020	Learning how to walk: Warm-starting optimal control solver with memory of motion
Prakash et al.	2019	Dynamic trajectory generation and a robust controller to intercept a moving ball in a game setting
Yin et al.	2014	Learning nonlinear dynamical system for movement primitives
Ota et al.	2021	Data-efficient learning for complex and real-time physical problem solving using augmented simulation
Viereck et al.	2021	Learning a centroidal motion planner for legged locomotion
Zhang et al.	2024	Whole-body humanoid robot locomotion with human reference
Viereck et al.	2018	Learning a structured neural network policy for a hopping task
Zhang et al.	2022	Leveraging imitation learning on pose regulation problem of a robotic fish
Ding et al.	2023	Robust jumping with an articulated soft quadruped via trajectory optimization and iterative learning
Hu et al.	2023	An overview on bipedal gait control methods
Lee et al.	2022	Control of wheeled-legged quadrupeds using deep reinforcement learning
Schperberg et al.	2023	Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse Data using a Learning-based Unscented Kalman Filter
Bao et al.	2024	Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey
Yeom et al.	2021	A dynamic gait stabilization algorithm for quadrupedal locomotion through contact time modulation