[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Guo et al., 2021 - Google Patents

A Survey of Linear Value Function Approximation in Reinforcement Learning

Guo et al., 2021

Document ID
5342499530885102218
Author
Guo S
Wei X
Xu Y
Xue W
Wu X
Wei B
Publication year
Publication venue
International Symposium on Intelligence Computation and Applications

External Links

Snippet

In reinforcement learning, when the state space is enormous or infinite, it is not feasible to find the exact value for each state in the memory. A common way to tackle this problem is to adopt linear value function approximation technique. In this paper, we review some …
Continue reading at link.springer.com (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30587Details of specialised database models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30289Database design, administration or maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/005Probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/18Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme

Similar Documents

Publication Publication Date Title
Dubey et al. Comparative study of convolution neural network’s relu and leaky-relu activation functions
Hassib et al. WOA+ BRNN: An imbalanced big data classification framework using Whale optimization and deep neural network
Yu et al. Search what you want: Barrier panelty nas for mixed precision quantization
Emary et al. Impact of chaos functions on modern swarm optimizers
Li et al. A novel double incremental learning algorithm for time series prediction
Al-hnaity et al. Predicting financial time series data using hybrid model
Xu et al. Continuous-action reinforcement learning with fast policy search and adaptive basis function selection
Jlassi et al. Bayesian hyperparameter optimization of deep neural network algorithms based on ant colony optimization
Ahmad et al. Whale–crow optimization (WCO)-based optimal regression model for software cost estimation
Guo et al. A Survey of Linear Value Function Approximation in Reinforcement Learning
He et al. CL-BPUWM: continuous learning with Bayesian parameter updating and weight memory
Kalita et al. A lightweight knowledge-based PSO for SVM hyper-parameters tuning in a dynamic environment
Li et al. Fuzzy quadrature particle filter for maneuvering target tracking
Liu et al. Compositional Prompting for Anti-Forgetting in Domain Incremental Learning
Garouani et al. Scalable meta-bayesian based hyperparameters optimization for machine learning
Martynova A novel approach of the approximation by patterns using Hybrid RBF NN with flexible parameters
He et al. Accelerated proximal stochastic variance reduction for DC optimization
Lan et al. Efficient reinforcement learning with least-squares soft Bellman residual for robotic grasping
Kim et al. A Survey on Automated Machine Learning: Problems, Methods and Frameworks
Zhang et al. Short-Term load forecasting based on RBM and NARX neural network
Shi et al. Power missing data filling based on improved k-means algorithm and rbf neural network
Jia et al. Consistency regularization for ensemble model based reinforcement learning
Kostrzewa et al. Adjusting parameters of the classifiers in multiclass classification
Ding et al. Multi-label k-nearest neighbor classification method based on semi-supervised
Wu et al. Afer: Automated feature engineering for robotic prediction on intelligent automation