Castillo et al., 2020 - Google Patents
Hybrid zero dynamics inspired feedback control policy design for 3d bipedal locomotion using reinforcement learningCastillo et al., 2020
View PDF- Document ID
- 3633846451251984497
- Author
- Castillo G
- Weng B
- Zhang W
- Hereid A
- Publication year
- Publication venue
- 2020 IEEE International Conference on Robotics and Automation (ICRA)
External Links
Snippet
This paper presents a novel model-free reinforcement learning (RL) framework to design feedback control policies for 3D bipedal walking. Existing RL algorithms are often trained in an end-to-end manner or rely on prior knowledge of some reference joint trajectories …
- 230000002787 reinforcement 0 title abstract description 6
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B62—LAND VEHICLES FOR TRAVELLING OTHERWISE THAN ON RAILS
- B62D—MOTOR VEHICLES; TRAILERS
- B62D57/00—Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track
- B62D57/02—Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track with ground-engaging propulsion means, e.g. walking members
- B62D57/032—Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track with ground-engaging propulsion means, e.g. walking members with alternately or sequentially lifted supporting base and legs; with alternately or sequentially lifted feet or skid
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1628—Programme controls characterised by the control loop
- B25J9/163—Programme controls characterised by the control loop learning, adaptive, model based, rule based expert control
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Castillo et al. | Hybrid zero dynamics inspired feedback control policy design for 3d bipedal locomotion using reinforcement learning | |
Xie et al. | Feedback control for cassie with deep reinforcement learning | |
Xiong et al. | 3-d underactuated bipedal walking via h-lip based gait synthesis and stepping stabilization | |
Hereid et al. | Dynamic humanoid locomotion: A scalable formulation for HZD gait optimization | |
Ma et al. | Bipedal robotic running with DURUS-2D: Bridging the gap between theory and experiment | |
Castillo et al. | Reinforcement learning meets hybrid zero dynamics: A case study for rabbit | |
Hereid et al. | Hybrid zero dynamics based multiple shooting optimization with applications to robotic walking | |
CN101847009A (en) | Biped robot gait energy efficiency optimization method | |
Castillo et al. | Reinforcement learning-based cascade motion policy design for robust 3D bipedal locomotion | |
Grizzle et al. | Virtual constraints and hybrid zero dynamics for realizing underactuated bipedal locomotion | |
Tucker et al. | Preference-based learning for user-guided hzd gait generation on bipedal walking robots | |
Sharon et al. | Synthesis of controllers for stylized planar bipedal walking | |
Castaneda et al. | Improving input-output linearizing controllers for bipedal robots via reinforcement learning | |
Kim et al. | Torque-based deep reinforcement learning for task-and-robot agnostic learning on bipedal robots using sim-to-real transfer | |
Ma et al. | Efficient HZD gait generation for three-dimensional underactuated humanoid running | |
Arcos-Legarda et al. | Robust compound control of dynamic bipedal robots | |
Krishna et al. | Learning linear policies for robust bipedal locomotion on terrains with varying slopes | |
Dangol et al. | Performance satisfaction in midget, a thruster-assisted bipedal robot | |
Hu et al. | An overview on bipedal gait control methods | |
Liu et al. | Multiphase trajectory generation for planar biped robot using direct collocation method | |
Kim et al. | Quadratic encoding of optimized humanoid walking | |
Cuevas et al. | Polynomial trajectory algorithm for a biped robot | |
Savin | Neural network-based reaction estimator for walking robots | |
Castillo et al. | Velocity regulation of 3d bipedal walking robots with uncertain dynamics through adaptive neural network controller | |
Dangol et al. | Performance satisfaction in Harpy, a thruster-assisted bipedal robot |