Guo et al., 2021 - Google Patents

A Survey of Linear Value Function Approximation in Reinforcement Learning

Guo et al., 2021

Document ID: 5342499530885102218
Author: Guo S; Wei X; Xu Y; Xue W; Wu X; Wei B
Publication year: 2021
Publication venue: International Symposium on Intelligence Computation and Applications

External Links

Cited by

Snippet

In reinforcement learning, when the state space is enormous or infinite, it is not feasible to find the exact value for each state in the memory. A common way to tackle this problem is to adopt linear value function approximation technique. In this paper, we review some …

Continue reading at link.springer.com (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30289—Database design, administration or maintenance
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme

Similar Documents

Publication	Publication Date	Title
Dubey et al.	2019	Comparative study of convolution neural network’s relu and leaky-relu activation functions
Hassib et al.	2020	WOA+ BRNN: An imbalanced big data classification framework using Whale optimization and deep neural network
Yu et al.	2020	Search what you want: Barrier panelty nas for mixed precision quantization
Emary et al.	2016	Impact of chaos functions on modern swarm optimizers
Li et al.	2019	A novel double incremental learning algorithm for time series prediction
Al-hnaity et al.	2016	Predicting financial time series data using hybrid model
Xu et al.	2011	Continuous-action reinforcement learning with fast policy search and adaptive basis function selection
Jlassi et al.	2021	Bayesian hyperparameter optimization of deep neural network algorithms based on ant colony optimization
Ahmad et al.	2019	Whale–crow optimization (WCO)-based optimal regression model for software cost estimation
Guo et al.	2021	A Survey of Linear Value Function Approximation in Reinforcement Learning
He et al.	2024	CL-BPUWM: continuous learning with Bayesian parameter updating and weight memory
Kalita et al.	2023	A lightweight knowledge-based PSO for SVM hyper-parameters tuning in a dynamic environment
Li et al.	2016	Fuzzy quadrature particle filter for maneuvering target tracking
Liu et al.	2024	Compositional Prompting for Anti-Forgetting in Domain Incremental Learning
Garouani et al.	2022	Scalable meta-bayesian based hyperparameters optimization for machine learning
Martynova	2019	A novel approach of the approximation by patterns using Hybrid RBF NN with flexible parameters
He et al.	2021	Accelerated proximal stochastic variance reduction for DC optimization
Lan et al.	2023	Efficient reinforcement learning with least-squares soft Bellman residual for robotic grasping
Kim et al.	2022	A Survey on Automated Machine Learning: Problems, Methods and Frameworks
Zhang et al.	2018	Short-Term load forecasting based on RBM and NARX neural network
Shi et al.	2018	Power missing data filling based on improved k-means algorithm and rbf neural network
Jia et al.	2021	Consistency regularization for ensemble model based reinforcement learning
Kostrzewa et al.	2017	Adjusting parameters of the classifiers in multiclass classification
Ding et al.	2019	Multi-label k-nearest neighbor classification method based on semi-supervised
Wu et al.	2022	Afer: Automated feature engineering for robotic prediction on intelligent automation