[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Castillo et al., 2020 - Google Patents

Hybrid zero dynamics inspired feedback control policy design for 3d bipedal locomotion using reinforcement learning

Castillo et al., 2020

View PDF
Document ID
3633846451251984497
Author
Castillo G
Weng B
Zhang W
Hereid A
Publication year
Publication venue
2020 IEEE International Conference on Robotics and Automation (ICRA)

External Links

Snippet

This paper presents a novel model-free reinforcement learning (RL) framework to design feedback control policies for 3D bipedal walking. Existing RL algorithms are often trained in an end-to-end manner or rely on prior knowledge of some reference joint trajectories …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B62LAND VEHICLES FOR TRAVELLING OTHERWISE THAN ON RAILS
    • B62DMOTOR VEHICLES; TRAILERS
    • B62D57/00Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track
    • B62D57/02Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track with ground-engaging propulsion means, e.g. walking members
    • B62D57/032Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track with ground-engaging propulsion means, e.g. walking members with alternately or sequentially lifted supporting base and legs; with alternately or sequentially lifted feet or skid
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1628Programme controls characterised by the control loop
    • B25J9/163Programme controls characterised by the control loop learning, adaptive, model based, rule based expert control

Similar Documents

Publication Publication Date Title
Castillo et al. Hybrid zero dynamics inspired feedback control policy design for 3d bipedal locomotion using reinforcement learning
Xie et al. Feedback control for cassie with deep reinforcement learning
Xiong et al. 3-d underactuated bipedal walking via h-lip based gait synthesis and stepping stabilization
Hereid et al. Dynamic humanoid locomotion: A scalable formulation for HZD gait optimization
Ma et al. Bipedal robotic running with DURUS-2D: Bridging the gap between theory and experiment
Castillo et al. Reinforcement learning meets hybrid zero dynamics: A case study for rabbit
Hereid et al. Hybrid zero dynamics based multiple shooting optimization with applications to robotic walking
CN101847009A (en) Biped robot gait energy efficiency optimization method
Castillo et al. Reinforcement learning-based cascade motion policy design for robust 3D bipedal locomotion
Grizzle et al. Virtual constraints and hybrid zero dynamics for realizing underactuated bipedal locomotion
Tucker et al. Preference-based learning for user-guided hzd gait generation on bipedal walking robots
Sharon et al. Synthesis of controllers for stylized planar bipedal walking
Castaneda et al. Improving input-output linearizing controllers for bipedal robots via reinforcement learning
Kim et al. Torque-based deep reinforcement learning for task-and-robot agnostic learning on bipedal robots using sim-to-real transfer
Ma et al. Efficient HZD gait generation for three-dimensional underactuated humanoid running
Arcos-Legarda et al. Robust compound control of dynamic bipedal robots
Krishna et al. Learning linear policies for robust bipedal locomotion on terrains with varying slopes
Dangol et al. Performance satisfaction in midget, a thruster-assisted bipedal robot
Hu et al. An overview on bipedal gait control methods
Liu et al. Multiphase trajectory generation for planar biped robot using direct collocation method
Kim et al. Quadratic encoding of optimized humanoid walking
Cuevas et al. Polynomial trajectory algorithm for a biped robot
Savin Neural network-based reaction estimator for walking robots
Castillo et al. Velocity regulation of 3d bipedal walking robots with uncertain dynamics through adaptive neural network controller
Dangol et al. Performance satisfaction in Harpy, a thruster-assisted bipedal robot