[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Yasini et al., 2015 - Google Patents

Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems

Yasini et al., 2015

Document ID
10567827515643781276
Author
Yasini S
Karimpour A
Naghibi Sistani M
Modares H
Publication year
Publication venue
International Journal of Adaptive Control and Signal Processing

External Links

Snippet

Online adaptive optimal control methods based on reinforcement learning algorithms typically need to check for the persistence of excitation condition, which is necessary to be known a priori for convergence of the algorithm. However, this condition is often infeasible to …
Continue reading at onlinelibrary.wiley.com (other versions)

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • G05B13/027Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B15/00Systems controlled by a computer
    • G05B15/02Systems controlled by a computer electric
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B11/00Automatic controllers
    • G05B11/01Automatic controllers electric
    • G05B11/32Automatic controllers electric with inputs from more than one sensing element; with outputs to more than one correcting element
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/04Programme control other than numerical control, i.e. in sequence controllers or logic controllers

Similar Documents

Publication Publication Date Title
Modares et al. Online solution of nonquadratic two‐player zero‐sum games arising in the H∞ control of constrained input systems
Vamvoudakis et al. Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration
Wen et al. Optimized backstepping for tracking control of strict-feedback systems
Vamvoudakis et al. Online adaptive algorithm for optimal control with integral reinforcement learning
Yasini et al. Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems
Lee et al. Observer-Based $\mathcal {H} _ {\infty} $ Fault-Tolerant Control for Linear Systems With Sensor and Actuator Faults
Modares et al. Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
Polyakov et al. Robust stabilization of MIMO systems in finite/fixed time
Gómez‐Gutiérrez On the design of nonautonomous fixed‐time controllers with a predefined upper bound of the settling time
Zhang et al. Online adaptive policy learning algorithm for $ H_ {\infty} $ state feedback control of unknown affine nonlinear discrete-time systems
Lopez‐Ramirez et al. Fixed‐time output stabilization and fixed‐time estimation of a chain of integrators
Zargarzadeh et al. Adaptive neural network‐based optimal control of nonlinear continuous‐time systems in strict‐feedback form
Fan et al. Adaptive fault‐tolerant control for affine non‐linear systems based on approximate dynamic programming
Tutsoy Design and comparison base analysis of adaptive estimator for completely unknown linear systems in the presence of OE noise and constant input time delay
Perrusquia et al. Robust control under worst‐case uncertainty for unknown nonlinear systems using modified reinforcement learning
Zhao et al. Finite‐horizon near optimal adaptive control of uncertain linear discrete‐time systems
Liu et al. Infinite time linear quadratic Stackelberg game problem for unknown stochastic discrete‐time systems via adaptive dynamic programming approach
Yang et al. Robust adaptive control for unmatched systems with guaranteed parameter estimation convergence
Liu et al. Multiperson zero‐sum differential games for a class of uncertain nonlinear systems
Xu et al. Terminal Sliding Mode Control Using Adaptive Fuzzy‐Neural Observer
Zhang et al. Observer‐based single‐network incremental adaptive dynamic programming for fault‐tolerant control of nonlinear systems with actuator faults
Das et al. Lyapunov‐based offset‐free model predictive control of nonlinear process systems
Sadeghi et al. Real‐time identification of nonlinear multiple‐input–multiple‐output systems with unknown input time delay using Wiener model with Neuro‐Laguerre structure
Lewis et al. A Hamilton-Jacobi setup for constrained neural network control
Tapia-Herrera et al. Tuning of a TS fuzzy output regulator using the steepest descent approach and ANFIS