Guo et al., 2021 - Google Patents
A Survey of Linear Value Function Approximation in Reinforcement LearningGuo et al., 2021
- Document ID
- 5342499530885102218
- Author
- Guo S
- Wei X
- Xu Y
- Xue W
- Wu X
- Wei B
- Publication year
- Publication venue
- International Symposium on Intelligence Computation and Applications
External Links
Snippet
In reinforcement learning, when the state space is enormous or infinite, it is not feasible to find the exact value for each state in the memory. A common way to tackle this problem is to adopt linear value function approximation technique. In this paper, we review some …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30289—Database design, administration or maintenance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Dubey et al. | Comparative study of convolution neural network’s relu and leaky-relu activation functions | |
Hassib et al. | WOA+ BRNN: An imbalanced big data classification framework using Whale optimization and deep neural network | |
Yu et al. | Search what you want: Barrier panelty nas for mixed precision quantization | |
Emary et al. | Impact of chaos functions on modern swarm optimizers | |
Li et al. | A novel double incremental learning algorithm for time series prediction | |
Al-hnaity et al. | Predicting financial time series data using hybrid model | |
Xu et al. | Continuous-action reinforcement learning with fast policy search and adaptive basis function selection | |
Jlassi et al. | Bayesian hyperparameter optimization of deep neural network algorithms based on ant colony optimization | |
Ahmad et al. | Whale–crow optimization (WCO)-based optimal regression model for software cost estimation | |
Guo et al. | A Survey of Linear Value Function Approximation in Reinforcement Learning | |
He et al. | CL-BPUWM: continuous learning with Bayesian parameter updating and weight memory | |
Kalita et al. | A lightweight knowledge-based PSO for SVM hyper-parameters tuning in a dynamic environment | |
Li et al. | Fuzzy quadrature particle filter for maneuvering target tracking | |
Liu et al. | Compositional Prompting for Anti-Forgetting in Domain Incremental Learning | |
Garouani et al. | Scalable meta-bayesian based hyperparameters optimization for machine learning | |
Martynova | A novel approach of the approximation by patterns using Hybrid RBF NN with flexible parameters | |
He et al. | Accelerated proximal stochastic variance reduction for DC optimization | |
Lan et al. | Efficient reinforcement learning with least-squares soft Bellman residual for robotic grasping | |
Kim et al. | A Survey on Automated Machine Learning: Problems, Methods and Frameworks | |
Zhang et al. | Short-Term load forecasting based on RBM and NARX neural network | |
Shi et al. | Power missing data filling based on improved k-means algorithm and rbf neural network | |
Jia et al. | Consistency regularization for ensemble model based reinforcement learning | |
Kostrzewa et al. | Adjusting parameters of the classifiers in multiclass classification | |
Ding et al. | Multi-label k-nearest neighbor classification method based on semi-supervised | |
Wu et al. | Afer: Automated feature engineering for robotic prediction on intelligent automation |