Vamvoudakis et al., 2012 - Google Patents

Adaptive optimal control algorithm for zero-sum Nash games with integral reinforcement learning

Vamvoudakis et al., 2012

Document ID: 17453342626310372899
Author: Vamvoudakis K; Vrabie D; Lewis F
Publication year: 2012
Publication venue: AIAA guidance, navigation, and control conference

External Links

Cited by

Snippet

In this paper we introduce an adaptive optimal algorithm that uses integral reinforcement knowledge for learning the continuous-time zero sum game solution for nonlinear systems with infinite horizon costs and partial knowledge of the system dynamics. This algorithm is a …

Continue reading at arc.aiaa.org (other versions)

230000003044 adaptive 0 title abstract description 25

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/04—Programme control other than numerical control, i.e. in sequence controllers or logic controllers
- G05B19/042—Programme control other than numerical control, i.e. in sequence controllers or logic controllers using digital processors
- G05B19/0426—Programming the control sequence
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/418—Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS], computer integrated manufacturing [CIM]
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B15/00—Systems controlled by a computer
- G05B15/02—Systems controlled by a computer electric
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design

Similar Documents

Publication	Publication Date	Title
He et al.	2019	Adaptive optimal control for a class of nonlinear systems: The online policy iteration approach
Na et al.	2020	Adaptive identifier-critic-based optimal tracking control for nonlinear systems with experimental validation
Vamvoudakis et al.	2012	Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration
Modares et al.	2013	Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks
Yang et al.	2018	Adaptive critic designs for event-triggered robust control of nonlinear systems with unknown dynamics
Kiumarsi et al.	2017	Optimal and autonomous control using reinforcement learning: A survey
Wang et al.	2017	Adaptive neural output-feedback control for a class of nonlower triangular nonlinear systems with unmodeled dynamics
Modares et al.	2016	Optimal output-feedback control of unknown continuous-time linear systems using off-policy reinforcement learning
Zhang et al.	2014	Online adaptive policy learning algorithm for $ H_ {\infty} $ state feedback control of unknown affine nonlinear discrete-time systems
Vamvoudakis et al.	2014	Online adaptive algorithm for optimal control with integral reinforcement learning
Wang et al.	2016	Adaptive neural tracking control for a class of nonlinear systems with dynamic uncertainties
Liu et al.	2015	Reinforcement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints
Liu et al.	2014	Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems
Lee et al.	2014	Integral reinforcement learning for continuous-time input-affine nonlinear systems with simultaneous invariant explorations
Kiumarsi et al.	2015	Optimal tracking control of unknown discrete-time linear systems using input-output measured data
Wang et al.	2015	Fault-tolerant controller design for a class of nonlinear MIMO discrete-time systems via online reinforcement learning algorithm
Liu et al.	2014	Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems
Liu et al.	2011	A one-layer recurrent neural network for constrained nonsmooth optimization
He et al.	2005	Reinforcement learning-based output feedback control of nonlinear systems with input constraints
Wang et al.	2022	Fuzzy H∞ control of discrete-time nonlinear Markov jump systems via a novel hybrid reinforcement Q-learning method
Wang et al.	2016	Backstepping-based Lyapunov function construction using approximate dynamic programming and sum of square techniques
Wang et al.	2021	Dynamic learning from adaptive neural control for discrete-time strict-feedback systems
Inoue et al.	2019	“Weak” control for human-in-the-loop systems
Yang et al.	2007	Adaptive H∞ tracking control for a class of uncertain nonlinear systems using radial-basis-function neural networks
Zhao et al.	2022	Adaptive uniform performance control of strict-feedback nonlinear systems with time-varying control gain