Yasini et al., 2015 - Google Patents

Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems

Yasini et al., 2015

Document ID: 10567827515643781276
Author: Yasini S; Karimpour A; Naghibi Sistani M; Modares H
Publication year: 2015
Publication venue: International Journal of Adaptive Control and Signal Processing

External Links

Cited by

Snippet

Online adaptive optimal control methods based on reinforcement learning algorithms typically need to check for the persistence of excitation condition, which is necessary to be known a priori for convergence of the algorithm. However, this condition is often infeasible to …

Continue reading at onlinelibrary.wiley.com (other versions)

230000002787 reinforcement 0 title abstract description 14

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B15/00—Systems controlled by a computer
- G05B15/02—Systems controlled by a computer electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B11/00—Automatic controllers
- G05B11/01—Automatic controllers electric
- G05B11/32—Automatic controllers electric with inputs from more than one sensing element; with outputs to more than one correcting element
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/04—Programme control other than numerical control, i.e. in sequence controllers or logic controllers

Similar Documents

Publication	Publication Date	Title
Modares et al.	2014	Online solution of nonquadratic two‐player zero‐sum games arising in the H∞ control of constrained input systems
Vamvoudakis et al.	2012	Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration
Wen et al.	2018	Optimized backstepping for tracking control of strict-feedback systems
Vamvoudakis et al.	2014	Online adaptive algorithm for optimal control with integral reinforcement learning
Yasini et al.	2015	Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems
Lee et al.	2018	Observer-Based $\mathcal {H} _ {\infty} $ Fault-Tolerant Control for Linear Systems With Sensor and Actuator Faults
Modares et al.	2014	Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
Polyakov et al.	2016	Robust stabilization of MIMO systems in finite/fixed time
Gómez‐Gutiérrez	2020	On the design of nonautonomous fixed‐time controllers with a predefined upper bound of the settling time
Zhang et al.	2014	Online adaptive policy learning algorithm for $ H_ {\infty} $ state feedback control of unknown affine nonlinear discrete-time systems
Lopez‐Ramirez et al.	2018	Fixed‐time output stabilization and fixed‐time estimation of a chain of integrators
Zargarzadeh et al.	2014	Adaptive neural network‐based optimal control of nonlinear continuous‐time systems in strict‐feedback form
Fan et al.	2016	Adaptive fault‐tolerant control for affine non‐linear systems based on approximate dynamic programming
Tutsoy	2016	Design and comparison base analysis of adaptive estimator for completely unknown linear systems in the presence of OE noise and constant input time delay
Perrusquia et al.	2020	Robust control under worst‐case uncertainty for unknown nonlinear systems using modified reinforcement learning
Zhao et al.	2015	Finite‐horizon near optimal adaptive control of uncertain linear discrete‐time systems
Liu et al.	2021	Infinite time linear quadratic Stackelberg game problem for unknown stochastic discrete‐time systems via adaptive dynamic programming approach
Yang et al.	2019	Robust adaptive control for unmatched systems with guaranteed parameter estimation convergence
Liu et al.	2014	Multiperson zero‐sum differential games for a class of uncertain nonlinear systems
Xu et al.	2013	Terminal Sliding Mode Control Using Adaptive Fuzzy‐Neural Observer
Zhang et al.	2023	Observer‐based single‐network incremental adaptive dynamic programming for fault‐tolerant control of nonlinear systems with actuator faults
Das et al.	2015	Lyapunov‐based offset‐free model predictive control of nonlinear process systems
Sadeghi et al.	2019	Real‐time identification of nonlinear multiple‐input–multiple‐output systems with unknown input time delay using Wiener model with Neuro‐Laguerre structure
Lewis et al.	2003	A Hamilton-Jacobi setup for constrained neural network control
Tapia-Herrera et al.	2013	Tuning of a TS fuzzy output regulator using the steepest descent approach and ANFIS