Yasini et al., 2015 - Google Patents
Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systemsYasini et al., 2015
- Document ID
- 10567827515643781276
- Author
- Yasini S
- Karimpour A
- Naghibi Sistani M
- Modares H
- Publication year
- Publication venue
- International Journal of Adaptive Control and Signal Processing
External Links
Snippet
Online adaptive optimal control methods based on reinforcement learning algorithms typically need to check for the persistence of excitation condition, which is necessary to be known a priori for convergence of the algorithm. However, this condition is often infeasible to …
- 230000002787 reinforcement 0 title abstract description 14
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B15/00—Systems controlled by a computer
- G05B15/02—Systems controlled by a computer electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B11/00—Automatic controllers
- G05B11/01—Automatic controllers electric
- G05B11/32—Automatic controllers electric with inputs from more than one sensing element; with outputs to more than one correcting element
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/04—Programme control other than numerical control, i.e. in sequence controllers or logic controllers
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Modares et al. | Online solution of nonquadratic two‐player zero‐sum games arising in the H∞ control of constrained input systems | |
Vamvoudakis et al. | Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration | |
Wen et al. | Optimized backstepping for tracking control of strict-feedback systems | |
Vamvoudakis et al. | Online adaptive algorithm for optimal control with integral reinforcement learning | |
Yasini et al. | Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems | |
Lee et al. | Observer-Based $\mathcal {H} _ {\infty} $ Fault-Tolerant Control for Linear Systems With Sensor and Actuator Faults | |
Modares et al. | Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems | |
Polyakov et al. | Robust stabilization of MIMO systems in finite/fixed time | |
Gómez‐Gutiérrez | On the design of nonautonomous fixed‐time controllers with a predefined upper bound of the settling time | |
Zhang et al. | Online adaptive policy learning algorithm for $ H_ {\infty} $ state feedback control of unknown affine nonlinear discrete-time systems | |
Lopez‐Ramirez et al. | Fixed‐time output stabilization and fixed‐time estimation of a chain of integrators | |
Zargarzadeh et al. | Adaptive neural network‐based optimal control of nonlinear continuous‐time systems in strict‐feedback form | |
Fan et al. | Adaptive fault‐tolerant control for affine non‐linear systems based on approximate dynamic programming | |
Tutsoy | Design and comparison base analysis of adaptive estimator for completely unknown linear systems in the presence of OE noise and constant input time delay | |
Perrusquia et al. | Robust control under worst‐case uncertainty for unknown nonlinear systems using modified reinforcement learning | |
Zhao et al. | Finite‐horizon near optimal adaptive control of uncertain linear discrete‐time systems | |
Liu et al. | Infinite time linear quadratic Stackelberg game problem for unknown stochastic discrete‐time systems via adaptive dynamic programming approach | |
Yang et al. | Robust adaptive control for unmatched systems with guaranteed parameter estimation convergence | |
Liu et al. | Multiperson zero‐sum differential games for a class of uncertain nonlinear systems | |
Xu et al. | Terminal Sliding Mode Control Using Adaptive Fuzzy‐Neural Observer | |
Zhang et al. | Observer‐based single‐network incremental adaptive dynamic programming for fault‐tolerant control of nonlinear systems with actuator faults | |
Das et al. | Lyapunov‐based offset‐free model predictive control of nonlinear process systems | |
Sadeghi et al. | Real‐time identification of nonlinear multiple‐input–multiple‐output systems with unknown input time delay using Wiener model with Neuro‐Laguerre structure | |
Lewis et al. | A Hamilton-Jacobi setup for constrained neural network control | |
Tapia-Herrera et al. | Tuning of a TS fuzzy output regulator using the steepest descent approach and ANFIS |