Abed-alguni, 2018 - Google Patents
Action-selection method for reinforcement learning based on cuckoo search algorithmAbed-alguni, 2018
- Document ID
- 15949005986243953403
- Author
- Abed-alguni B
- Publication year
- Publication venue
- Arabian Journal for Science and Engineering
External Links
Snippet
A fundamental challenge in reinforcement learning is how to balance between exploration and exploitation of actions. A balanced ratio of exploration/exploitation can significantly affect the total learning time and the quality of learned policies. Thus, several sophisticated …
- 241000544061 Cuculus canorus 0 title abstract description 43
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G06N5/043—Distributed expert systems, blackboards
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G06N5/025—Extracting rules from data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/02—Computer systems based on specific mathematical models using fuzzy logic
- G06N7/023—Learning or tuning the parameters of a fuzzy system
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/0275—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using fuzzy logic only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Abed-alguni | Action-selection method for reinforcement learning based on cuckoo search algorithm | |
Juang et al. | Hierarchical cluster-based multispecies particle-swarm optimization for fuzzy-system optimization | |
EP3467717A1 (en) | Machine learning system | |
Van Den Bergh | An analysis of particle swarm optimizers | |
Soto et al. | Time series prediction using ensembles of ANFIS models with genetic optimization of interval type-2 and type-1 fuzzy integrators | |
Hapke et al. | Pareto simulated annealing for fuzzy multi-objective combinatorial optimization | |
Sun et al. | Multi-agent reinforcement learning: weighting and partitioning | |
Zhan et al. | Model-based offline planning with trajectory pruning | |
Wang et al. | On the convergence of the monte carlo exploring starts algorithm for reinforcement learning | |
Hein et al. | Generating interpretable fuzzy controllers using particle swarm optimization and genetic programming | |
Huang et al. | Interpretable policies for reinforcement learning by empirical fuzzy sets | |
Liu et al. | The eigenoption-critic framework | |
Jin et al. | A game-theoretic reinforcement learning approach for adaptive interaction at intersections | |
Wan et al. | Lotus: Continual imitation learning for robot manipulation through unsupervised skill discovery | |
Peng et al. | Compensatory neural fuzzy network with symbiotic particle swarm optimization for temperature control | |
Chen et al. | Improve the accuracy of recurrent fuzzy system design using an efficient continuous ant colony optimization | |
Torrey et al. | Transfer learning via advice taking | |
Mac Parthaláin et al. | Fuzzy-rough feature selection using flock of starlings optimisation | |
Lu et al. | Inferring preferences from demonstrations in multi-objective reinforcement learning: A dynamic weight-based approach | |
Bagga et al. | Deep learnable strategy templates for multi-issue bilateral negotiation | |
Ghasemi et al. | An introduction to reinforcement learning: Fundamental concepts and practical applications | |
Aoun et al. | Self inertia weight adaptation for the particle swarm optimization | |
Uma et al. | A hybrid PSO with dynamic inertia weight and GA approach for discovering classification rule in data mining | |
Galea et al. | Fuzzy rules from ant-inspired computation | |
Aseri et al. | Review of the meta-heuristic algorithms for fuzzy modeling in the classification problem |