Vrabie et al., 2008 - Google Patents
Adaptive optimal control algorithm for continuous-time nonlinear systems based on policy iterationVrabie et al., 2008
View PDF- Document ID
- 14088161762287331723
- Author
- Vrabie D
- Lewis F
- Publication year
- Publication venue
- 2008 47th IEEE Conference on Decision and Control
External Links
Snippet
In this paper we develop a new online adaptive control scheme, for partially unknown nonlinear systems, which converges to the optimal state feedback control solution for affine in the inputs nonlinear systems. The derivation of the optimal adaptive control algorithm is …
- 230000003044 adaptive 0 title abstract description 35
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/30—Nc systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B15/00—Systems controlled by a computer
- G05B15/02—Systems controlled by a computer electric
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Vrabie et al. | Adaptive optimal control algorithm for continuous-time nonlinear systems based on policy iteration | |
Modares et al. | Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems | |
Gao et al. | Adaptive neural network-based control for a class of nonlinear pure-feedback systems with time-varying full state constraints | |
Dierks et al. | Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update | |
Ge et al. | Adaptive NN control for a class of strict-feedback discrete-time nonlinear systems | |
Yan et al. | Robust model predictive control of nonlinear systems with unmodeled dynamics and bounded uncertainties based on neural networks | |
Zhou et al. | Incremental model based online dual heuristic programming for nonlinear adaptive control | |
Ge et al. | Adaptive neural network control for a class of MIMO nonlinear systems with disturbances in discrete-time | |
Zerari et al. | Neural network based adaptive tracking control for a class of pure feedback nonlinear systems with input saturation | |
Sheikholeslam et al. | Design of adaptive fuzzy wavelet neural sliding mode controller for uncertain nonlinear systems | |
Vrabie et al. | Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework | |
Hsu | Adaptive backstepping Elman-based neural control for unknown nonlinear systems | |
Keighobadi et al. | Adaptive neural dynamic surface control of mechanical systems using integral terminal sliding mode | |
Yasini et al. | Approximate dynamic programming for two-player zero-sum game related to H∞ control of unknown nonlinear continuous-time systems | |
ZHANG et al. | Nearly optimal control scheme using adaptive dynamic programming based on generalized fuzzy hyperbolic model | |
Tutsoy et al. | An analysis of value function learning with piecewise linear control | |
Yasini et al. | Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems | |
Wang et al. | Youla-REN: Learning nonlinear feedback policies with robust stability guarantees | |
Yu | Adaptive Fuzzy Stabilization for a Class of Pure-Feedback Systems with Unknown Dead-Zones. | |
Vamvoudakis et al. | Adaptive optimal control algorithm for zero-sum Nash games with integral reinforcement learning | |
Machón-González et al. | Feedforward nonlinear control using neural gas network | |
Nichols | A comparison of action selection methods for implicit policy method reinforcement learning in continuous action-space | |
Sharma et al. | Wavelet reduced order observer based adaptive tracking control for a class of uncertain nonlinear systems using reinforcement learning | |
Sharma et al. | Wavelet neural network observer based adaptive tracking control for a class of uncertain nonlinear delayed systems using reinforcement learning | |
Vrabie et al. | Neural network-based adaptive optimal controller–a continuous-time formulation |