Zhang et al., 2022 - Google Patents
Reinforcement learning behavioral control for nonlinear autonomous systemZhang et al., 2022
- Document ID
- 10882725450255836019
- Author
- Zhang Z
- Mo Z
- Chen Y
- Huang J
- Publication year
- Publication venue
- IEEE/CAA Journal of Automatica Sinica
External Links
Snippet
Behavior-based autonomous systems rely on human intelligence to resolve multi-mission conflicts by designing mission priority rules and nonlinear controllers. In this work, a novel two-layer reinforcement learning behavioral control (RLBC) method is proposed to reduce …
- 230000002787 reinforcement 0 title abstract description 10
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B11/00—Automatic controllers
- G05B11/01—Automatic controllers electric
- G05B11/32—Automatic controllers electric with inputs from more than one sensing element; with outputs to more than one correcting element
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot
- G05D1/0011—Control of position, course or altitude of land, water, air, or space vehicles, e.g. automatic pilot associated with a remote control arrangement
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Mohajerin et al. | Multistep prediction of dynamic systems with recurrent neural networks | |
Yu et al. | Fuzzy swarm control based on sliding-mode strategy with self-organized omnidirectional mobile robots system | |
Zhang et al. | Reinforcement learning behavioral control for nonlinear autonomous system | |
Hu et al. | A reinforcement learning neural network for robotic manipulator control | |
Yang et al. | Adaptive H∞ tracking control for a class of uncertain nonlinear systems using radial-basis-function neural networks | |
Campos et al. | PSO tuning for fuzzy PD+ I controller applied to a mobile robot trajectory control | |
Elhaki et al. | Robust amplitude-limited interval type-3 neuro-fuzzy controller for robot manipulators with prescribed performance by output feedback | |
CN111882184B (en) | Multi-agent system null space behavior control dynamic task priority planning method | |
Wu et al. | Optimal control of non-holonomic robotic systems based on type-3 fuzzy model | |
Guo et al. | A nearly optimal chattering reduction method of sliding mode control with an application to a two-wheeled mobile robot | |
Li et al. | A model predictive obstacle avoidance method based on dynamic motion primitives and a Kalman filter | |
Guo et al. | Decoupled sliding mode control of underactuated nonlinear systems using a fuzzy brain emotional cerebellar model control system | |
Scola et al. | Limbic system-inspired performance-guaranteed control for nonlinear multi-agent systems with uncertainties | |
Lin et al. | Hybrid adaptive fuzzy controllers with application to robotic systems | |
Du et al. | Model Predictive Formation Tracking-Containment Control for Multi-UAVs With Obstacle Avoidance | |
Tsai et al. | Adaptive reinforcement learning formation control using ORFBLS for omnidirectional mobile multi-robots | |
Tsai et al. | Adaptive RFWCMAC cooperative formation control for multiple ballbots incorporated with coupling dynamics | |
Ngo et al. | A novel self-organizing fuzzy cerebellar model articulation controller based overlapping Gaussian membership function for controlling robotic system | |
Chen et al. | Learning Agile Quadrotor Flight in Restricted Environments with Safety Guarantees | |
Kanoon et al. | An Intelligent Path Planning Algorithm and Control Strategy Design for Multi-Mobile Robots based on a Modified Elman Recurrent Neural Network | |
Esteban et al. | Hierarchical reinforcement learning for concurrent discovery of compound and composable policies | |
Hwang et al. | Fuzzy adaptive finite-time cooperative control with input saturation for nonlinear multiagent systems and its application | |
Huang et al. | Intelligent mission supervisor design for null-space-based behavioral control system: A reinforcement learning approach | |
Zhang et al. | A behavior-based adaptive dynamic programming method for multiple mobile manipulators coordination control | |
Wang et al. | Intelligent control algorithm for USV with input saturation based on RBF network compensation |