Xie et al., 2018 - Google Patents
Feedback control for cassie with deep reinforcement learningXie et al., 2018
View PDF- Document ID
- 15536684406070768647
- Author
- Xie Z
- Berseth G
- Clary P
- Hurst J
- Van de Panne M
- Publication year
- Publication venue
- 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
External Links
Snippet
Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully …
- 235000010643 Leucaena leucocephala 0 title abstract description 19
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Xie et al. | Feedback control for cassie with deep reinforcement learning | |
Gangapurwala et al. | Rloc: Terrain-aware legged locomotion using reinforcement learning and optimal control | |
Yang et al. | Data efficient reinforcement learning for legged robots | |
Escontrela et al. | Adversarial motion priors make good substitutes for complex reward functions | |
Tsounis et al. | Deepgait: Planning and control of quadrupedal gaits using deep reinforcement learning | |
Ha et al. | Learning to walk in the real world with minimal human effort | |
Castillo et al. | Hybrid zero dynamics inspired feedback control policy design for 3d bipedal locomotion using reinforcement learning | |
Felis et al. | Synthesis of full-body 3-d human gait using optimal control methods | |
Bledt | Regularized predictive control framework for robust dynamic legged locomotion | |
Park et al. | Inverse optimal control for humanoid locomotion | |
Castillo et al. | Reinforcement learning-based cascade motion policy design for robust 3d bipedal locomotion | |
Huang et al. | Reward-adaptive reinforcement learning: Dynamic policy gradient optimization for bipedal locomotion | |
Marcucci et al. | A two-stage trajectory optimization strategy for articulated bodies with unscheduled contact sequences | |
Castillo et al. | Reinforcement learning meets hybrid zero dynamics: A case study for rabbit | |
Atmeh et al. | Implementation of an adaptive, model free, learning controller on the Atlas robot | |
Kang et al. | Animal gaits on quadrupedal robots using motion matching and model-based control | |
Huang et al. | Diffuseloco: Real-time legged locomotion control with diffusion from offline datasets | |
Levine | Motor skill learning with local trajectory methods | |
Bravo-Palacios et al. | Robust co-design: Coupling morphology and feedback design through stochastic programming | |
Allen et al. | On the beat! timing and tension for dynamic characters | |
Hu et al. | An overview on bipedal gait control methods | |
Ordonez-Apraez et al. | An adaptable approach to learn realistic legged locomotion without examples | |
Gu et al. | Humanoid locomotion and manipulation: Current progress and challenges in control, planning, and learning | |
Felis et al. | Using optimal control methods to generate human walking motions | |
Bhat et al. | Towards a learnt neural body schema for dexterous coordination of action in humanoid and industrial robots |