Sumiea et al., 2023 - Google Patents

Enhanced deep deterministic policy gradient algorithm using grey wolf optimizer for continuous control tasks

Sumiea et al., 2023

Document ID: 3903991611122620881
Author: Sumiea E; Abdulkadir S; Ragab M; Al-Selwi S; Fati S; AlQushaibi A; Alhussian H
Publication year: 2023
Publication venue: IEEE Access

External Links

Cited by

Snippet

Deep Reinforcement Learning (DRL) allows agents to make decisions in a specific environment based on a reward function, without prior knowledge. Adapting hyperparameters significantly impacts the learning process and time. Precise estimation of …

Continue reading at ieeexplore.ieee.org (PDF) (other versions)

238000004422 calculation algorithm 0 title abstract description 124

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation

Similar Documents

Publication	Publication Date	Title
Zhu et al.	2021	A survey of deep RL and IL for autonomous driving policy learning
Han et al.	2023	A survey on deep reinforcement learning algorithms for robotic manipulation
Arulkumaran et al.	2017	A brief survey of deep reinforcement learning
Sutton et al.	2011	Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
Sumiea et al.	2023	Enhanced deep deterministic policy gradient algorithm using grey wolf optimizer for continuous control tasks
Jalali et al.	2019	Optimal autonomous driving through deep imitation learning and neuroevolution
Auddy et al.	2023	Continual learning from demonstration of robotics skills
Fang et al.	2020	Learn to make decision with small data for autonomous driving: deep gaussian process and feedback control
Ramamurthy et al.	2019	Leveraging domain knowledge for reinforcement learning using MMC architectures
Jiang et al.	2022	Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle
CN117223011A (en)	2023-12-12	Multi-objective reinforcement learning using weighted strategy projection
Liang et al.	2023	Multi-UAV autonomous collision avoidance based on PPO-GIC algorithm with CNN–LSTM fusion network
Woodford et al.	2016	Concurrent controller and simulator neural network development for a differentially-steered robot in evolutionary robotics
Parhi et al.	2017	Navigational strategy for underwater mobile robot based on adaptive neuro-fuzzy inference system model embedded with shuffled frog leaping algorithm–based hybrid learning approach
Zhu et al.	2023	Improved PER-DDPG based nonparametric modeling of ship dynamics with uncertainty
Plasencia-Salgueiro	2023	Deep reinforcement learning for autonomous mobile robot navigation
Duraisamy et al.	2024	Real-time implementation of deep reinforcement learning controller for speed tracking of robotic fish through data-assisted modeling
CN118043824A (en)	2024-05-14	Retrieval enhanced reinforcement learning
Li et al.	2022	Research on the agricultural machinery path tracking method based on deep reinforcement learning
CN114219066A (en)	2022-03-22	Unsupervised reinforcement learning method and unsupervised reinforcement learning device based on Watherstein distance
Baxter et al.	2009	Shared Potential Fields and their place in a multi-robot co-ordination taxonomy
Kim et al.	2024	Strangeness-driven exploration in multi-agent reinforcement learning
Feng et al.	2019	Mobile robot obstacle avoidance based on deep reinforcement learning
Amroun et al.	2022	How statistical modeling and machine learning could help in the calibration of numerical simulation and fluid mechanics models? Application to the calibration of models reproducing the vibratory behavior of an overhead line conductor
Cui et al.	2024	Mobile robot sequential decision making using a deep reinforcement learning hyper-heuristic approach