Xie et al., 2018 - Google Patents

Feedback control for cassie with deep reinforcement learning

Xie et al., 2018

Document ID: 15536684406070768647
Author: Xie Z; Berseth G; Clary P; Hurst J; Van de Panne M
Publication year: 2018
Publication venue: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

External Links

Cited by

Snippet

Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully …

Continue reading at arxiv.org (PDF) (other versions)

235000010643 Leucaena leucocephala 0 title abstract description 19

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design

Similar Documents

Publication	Publication Date	Title
Xie et al.	2018	Feedback control for cassie with deep reinforcement learning
Gangapurwala et al.	2022	Rloc: Terrain-aware legged locomotion using reinforcement learning and optimal control
Yang et al.	2020	Data efficient reinforcement learning for legged robots
Escontrela et al.	2022	Adversarial motion priors make good substitutes for complex reward functions
Tsounis et al.	2020	Deepgait: Planning and control of quadrupedal gaits using deep reinforcement learning
Ha et al.	2020	Learning to walk in the real world with minimal human effort
Castillo et al.	2020	Hybrid zero dynamics inspired feedback control policy design for 3d bipedal locomotion using reinforcement learning
Felis et al.	2016	Synthesis of full-body 3-d human gait using optimal control methods
Bledt	2020	Regularized predictive control framework for robust dynamic legged locomotion
Park et al.	2013	Inverse optimal control for humanoid locomotion
Castillo et al.	2022	Reinforcement learning-based cascade motion policy design for robust 3d bipedal locomotion
Huang et al.	2022	Reward-adaptive reinforcement learning: Dynamic policy gradient optimization for bipedal locomotion
Marcucci et al.	2016	A two-stage trajectory optimization strategy for articulated bodies with unscheduled contact sequences
Castillo et al.	2019	Reinforcement learning meets hybrid zero dynamics: A case study for rabbit
Atmeh et al.	2014	Implementation of an adaptive, model free, learning controller on the Atlas robot
Kang et al.	2021	Animal gaits on quadrupedal robots using motion matching and model-based control
Huang et al.	2024	Diffuseloco: Real-time legged locomotion control with diffusion from offline datasets
Levine	2014	Motor skill learning with local trajectory methods
Bravo-Palacios et al.	2022	Robust co-design: Coupling morphology and feedback design through stochastic programming
Allen et al.	2007	On the beat! timing and tension for dynamic characters
Hu et al.	2023	An overview on bipedal gait control methods
Ordonez-Apraez et al.	2022	An adaptable approach to learn realistic legged locomotion without examples
Gu et al.	2025	Humanoid locomotion and manipulation: Current progress and challenges in control, planning, and learning
Felis et al.	2012	Using optimal control methods to generate human walking motions
Bhat et al.	2017	Towards a learnt neural body schema for dexterous coordination of action in humanoid and industrial robots