Yang et al., 2021 - Google Patents
Null space based efficient reinforcement learning with hierarchical safety constraintsYang et al., 2021
View PDF- Document ID
- 11257959818114946844
- Author
- Yang Q
- Stork J
- Stoyanov T
- Publication year
- Publication venue
- 2021 European Conference on Mobile Robots (ECMR)
External Links
Snippet
Reinforcement learning is inherently unsafe for use in physical systems, as learning by trial- and-error can cause harm to the environment or the robot itself. One way to avoid unpredictable exploration is to add constraints in the action space to restrict the robot …
- 230000002787 reinforcement 0 title abstract description 16
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/04—Programme control other than numerical control, i.e. in sequence controllers or logic controllers
- G05B19/05—Programmable logic controllers, e.g. simulating logic interconnections of signals according to ladder diagrams or function charts
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/39—Robotics, robotics to robotics hand
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G05B2219/40—Robotics, robotics mapping to robotics vision
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Li et al. | A general framework of motion planning for redundant robot manipulator based on deep reinforcement learning | |
Honerkamp et al. | Learning kinematic feasibility for mobile manipulation through deep reinforcement learning | |
Marzari et al. | Towards hierarchical task decomposition using deep reinforcement learning for pick and place subtasks | |
Zhang et al. | Probabilistic roadmap with self-learning for path planning of a mobile robot in a dynamic and unstructured environment | |
Huang et al. | Reward-adaptive reinforcement learning: Dynamic policy gradient optimization for bipedal locomotion | |
Sun et al. | Iterative learning control based robust distributed algorithm for non-holonomic mobile robots formation | |
Zhang et al. | Reinforcement learning behavioral control for nonlinear autonomous system | |
Chen | Dynamic structure adaptive neural fuzzy control for MIMO uncertain nonlinear systems | |
Yang et al. | Null space based efficient reinforcement learning with hierarchical safety constraints | |
Kegeleirs et al. | Transferability in the automatic off-line design of robot swarms: from sim-to-real to embodiment and design-method transfer across different platforms | |
Desaraju et al. | Leveraging experience for computationally efficient adaptive nonlinear model predictive control | |
Dewangan et al. | Digrad: Multi-task reinforcement learning with shared actions | |
Pavlichenko et al. | Real-robot deep reinforcement learning: Improving trajectory tracking of flexible-joint manipulator with reference correction | |
Esteban et al. | Hierarchical reinforcement learning for concurrent discovery of compound and composable policies | |
Incremona et al. | Experimental assessment of deep reinforcement learning for robot obstacle avoidance: a lpv control perspective | |
Raza et al. | Constructive policy: Reinforcement learning approach for connected multi-agent systems | |
Tamiz et al. | A novel attention control modeling method for sensor selection based on fuzzy neural network learning | |
Li et al. | Model predictive control for constrained robot manipulator visual servoing tuned by reinforcement learning | |
Tang et al. | Reinforcement learning for robots path planning with rule-based shallow-trial | |
Qian et al. | Path Planning Algorithm of Mobile Robot Based on Improved Q-learning Algorithm | |
Yang et al. | Exploiting redundancy to implement multiobjective behavior | |
US20240361753A1 (en) | Sequential Behavior for Intelligent Control in Subsumption-Like Architecture | |
Kobelrausch et al. | Collision-Free Deep Reinforcement Learning for Mobile Robots using Crash-Prevention Policy | |
Stonier et al. | Intelligent hierarchical control for obstacle-avoidance | |
Lundell et al. | Safe-to-explore state spaces: Ensuring safe exploration in policy search with hierarchical task optimization |