Vamvoudakis et al., 2012 - Google Patents
Adaptive optimal control algorithm for zero-sum Nash games with integral reinforcement learningVamvoudakis et al., 2012
- Document ID
- 17453342626310372899
- Author
- Vamvoudakis K
- Vrabie D
- Lewis F
- Publication year
- Publication venue
- AIAA guidance, navigation, and control conference
External Links
Snippet
In this paper we introduce an adaptive optimal algorithm that uses integral reinforcement knowledge for learning the continuous-time zero sum game solution for nonlinear systems with infinite horizon costs and partial knowledge of the system dynamics. This algorithm is a …
- 230000003044 adaptive 0 title abstract description 25
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/04—Programme control other than numerical control, i.e. in sequence controllers or logic controllers
- G05B19/042—Programme control other than numerical control, i.e. in sequence controllers or logic controllers using digital processors
- G05B19/0426—Programming the control sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/418—Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS], computer integrated manufacturing [CIM]
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B15/00—Systems controlled by a computer
- G05B15/02—Systems controlled by a computer electric
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
Similar Documents
Publication | Publication Date | Title |
---|---|---|
He et al. | Adaptive optimal control for a class of nonlinear systems: The online policy iteration approach | |
Na et al. | Adaptive identifier-critic-based optimal tracking control for nonlinear systems with experimental validation | |
Vamvoudakis et al. | Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration | |
Modares et al. | Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks | |
Yang et al. | Adaptive critic designs for event-triggered robust control of nonlinear systems with unknown dynamics | |
Kiumarsi et al. | Optimal and autonomous control using reinforcement learning: A survey | |
Wang et al. | Adaptive neural output-feedback control for a class of nonlower triangular nonlinear systems with unmodeled dynamics | |
Modares et al. | Optimal output-feedback control of unknown continuous-time linear systems using off-policy reinforcement learning | |
Zhang et al. | Online adaptive policy learning algorithm for $ H_ {\infty} $ state feedback control of unknown affine nonlinear discrete-time systems | |
Vamvoudakis et al. | Online adaptive algorithm for optimal control with integral reinforcement learning | |
Wang et al. | Adaptive neural tracking control for a class of nonlinear systems with dynamic uncertainties | |
Liu et al. | Reinforcement-learning-based robust controller design for continuous-time uncertain nonlinear systems subject to input constraints | |
Liu et al. | Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems | |
Lee et al. | Integral reinforcement learning for continuous-time input-affine nonlinear systems with simultaneous invariant explorations | |
Kiumarsi et al. | Optimal tracking control of unknown discrete-time linear systems using input-output measured data | |
Wang et al. | Fault-tolerant controller design for a class of nonlinear MIMO discrete-time systems via online reinforcement learning algorithm | |
Liu et al. | Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems | |
Liu et al. | A one-layer recurrent neural network for constrained nonsmooth optimization | |
He et al. | Reinforcement learning-based output feedback control of nonlinear systems with input constraints | |
Wang et al. | Fuzzy H∞ control of discrete-time nonlinear Markov jump systems via a novel hybrid reinforcement Q-learning method | |
Wang et al. | Backstepping-based Lyapunov function construction using approximate dynamic programming and sum of square techniques | |
Wang et al. | Dynamic learning from adaptive neural control for discrete-time strict-feedback systems | |
Inoue et al. | “Weak” control for human-in-the-loop systems | |
Yang et al. | Adaptive H∞ tracking control for a class of uncertain nonlinear systems using radial-basis-function neural networks | |
Zhao et al. | Adaptive uniform performance control of strict-feedback nonlinear systems with time-varying control gain |