Li et al., 2021 - Google Patents
Planning in learned latent action spaces for generalizable legged locomotionLi et al., 2021
View PDF- Document ID
- 9621077624110067208
- Author
- Li T
- Calandra R
- Pathak D
- Tian Y
- Meier F
- Rai A
- Publication year
- Publication venue
- IEEE Robotics and Automation Letters
External Links
Snippet
Hierarchical learning has been successful at learning generalizable locomotion skills on walking robots in a sample-efficient manner. However, the low-dimensional “latent” action used to communicate between two layers of the hierarchy is typically user-designed. In this …
- 230000033001 locomotion 0 title abstract description 32
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/048—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators using a predictor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/008—Artificial life, i.e. computers simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. robots replicating pets or humans in their appearance or behavior
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/002—Quantum computers, i.e. information processing by using quantum superposition, coherence, decoherence, entanglement, nonlocality, teleportation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Li et al. | Planning in learned latent action spaces for generalizable legged locomotion | |
Gangapurwala et al. | Rloc: Terrain-aware legged locomotion using reinforcement learning and optimal control | |
Shi et al. | Reinforcement learning with evolutionary trajectory generator: A general approach for quadrupedal locomotion | |
Thor et al. | Generic neural locomotion control framework for legged robots | |
Erez et al. | An integrated system for real-time model predictive control of humanoid robots | |
Li et al. | Learning generalizable locomotion skills with hierarchical reinforcement learning | |
Ouyang et al. | Adaptive locomotion control of a hexapod robot via bio-inspired learning | |
Peters et al. | Robot learning | |
Bjelonic et al. | Learning-based design and control for quadrupedal robots with parallel-elastic actuators | |
CN113093779B (en) | Robot motion control method and system based on deep reinforcement learning | |
Wang et al. | CPG-based hierarchical locomotion control for modular quadrupedal robots using deep reinforcement learning | |
Lembono et al. | Learning how to walk: Warm-starting optimal control solver with memory of motion | |
Prakash et al. | Dynamic trajectory generation and a robust controller to intercept a moving ball in a game setting | |
Yin et al. | Learning nonlinear dynamical system for movement primitives | |
Ota et al. | Data-efficient learning for complex and real-time physical problem solving using augmented simulation | |
Viereck et al. | Learning a centroidal motion planner for legged locomotion | |
Zhang et al. | Whole-body humanoid robot locomotion with human reference | |
Viereck et al. | Learning a structured neural network policy for a hopping task | |
Zhang et al. | Leveraging imitation learning on pose regulation problem of a robotic fish | |
Ding et al. | Robust jumping with an articulated soft quadruped via trajectory optimization and iterative learning | |
Hu et al. | An overview on bipedal gait control methods | |
Lee et al. | Control of wheeled-legged quadrupeds using deep reinforcement learning | |
Schperberg et al. | Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse Data using a Learning-based Unscented Kalman Filter | |
Bao et al. | Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey | |
Yeom et al. | A dynamic gait stabilization algorithm for quadrupedal locomotion through contact time modulation |