[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Xie et al., 2018 - Google Patents

Feedback control for cassie with deep reinforcement learning

Xie et al., 2018

View PDF
Document ID
15536684406070768647
Author
Xie Z
Berseth G
Clary P
Hurst J
Van de Panne M
Publication year
Publication venue
2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

External Links

Snippet

Bipedal locomotion skills are challenging to develop. Control strategies often use local linearization of the dynamics in conjunction with reduced-order abstractions to yield tractable solutions. In these model-based control strategies, the controller is often not fully …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design

Similar Documents

Publication Publication Date Title
Xie et al. Feedback control for cassie with deep reinforcement learning
Gangapurwala et al. Rloc: Terrain-aware legged locomotion using reinforcement learning and optimal control
Yang et al. Data efficient reinforcement learning for legged robots
Escontrela et al. Adversarial motion priors make good substitutes for complex reward functions
Tsounis et al. Deepgait: Planning and control of quadrupedal gaits using deep reinforcement learning
Ha et al. Learning to walk in the real world with minimal human effort
Castillo et al. Hybrid zero dynamics inspired feedback control policy design for 3d bipedal locomotion using reinforcement learning
Felis et al. Synthesis of full-body 3-d human gait using optimal control methods
Bledt Regularized predictive control framework for robust dynamic legged locomotion
Park et al. Inverse optimal control for humanoid locomotion
Castillo et al. Reinforcement learning-based cascade motion policy design for robust 3d bipedal locomotion
Huang et al. Reward-adaptive reinforcement learning: Dynamic policy gradient optimization for bipedal locomotion
Marcucci et al. A two-stage trajectory optimization strategy for articulated bodies with unscheduled contact sequences
Castillo et al. Reinforcement learning meets hybrid zero dynamics: A case study for rabbit
Atmeh et al. Implementation of an adaptive, model free, learning controller on the Atlas robot
Kang et al. Animal gaits on quadrupedal robots using motion matching and model-based control
Huang et al. Diffuseloco: Real-time legged locomotion control with diffusion from offline datasets
Levine Motor skill learning with local trajectory methods
Bravo-Palacios et al. Robust co-design: Coupling morphology and feedback design through stochastic programming
Allen et al. On the beat! timing and tension for dynamic characters
Hu et al. An overview on bipedal gait control methods
Ordonez-Apraez et al. An adaptable approach to learn realistic legged locomotion without examples
Gu et al. Humanoid locomotion and manipulation: Current progress and challenges in control, planning, and learning
Felis et al. Using optimal control methods to generate human walking motions
Bhat et al. Towards a learnt neural body schema for dexterous coordination of action in humanoid and industrial robots