Seo et al., 2024 - Google Patents

Continuous control with coarse-to-fine reinforcement learning

Seo et al., 2024

Document ID: 2441016515703928715
Author: Seo Y; Uruç J; James S
Publication year: 2024
Publication venue: arXiv preprint arXiv:2407.07787

External Links

Cited by

Snippet

Despite recent advances in improving the sample-efficiency of reinforcement learning (RL) algorithms, designing an RL algorithm that can be practically deployed in real-world environments remains a challenge. In this paper, we present Coarse-to-fine Reinforcement …

Continue reading at arxiv.org (PDF) (other versions)

230000002787 reinforcement 0 title abstract description 11

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators

Similar Documents

Publication	Publication Date	Title
Ibarz et al.	2021	How to train your robot with deep reinforcement learning: lessons we have learned
Pertsch et al.	2021	Accelerating reinforcement learning with learned skill priors
Bıyık et al.	2020	Active preference-based gaussian process regression for reward learning
Sanchez-Gonzalez et al.	2018	Graph networks as learnable physics engines for inference and control
Pertsch et al.	2021	Guided reinforcement learning with learned skills
Tobin et al.	2018	Domain randomization and generative models for robotic grasping
Weber et al.	2017	Imagination-augmented agents for deep reinforcement learning
Pinto et al.	2017	Asymmetric actor critic for image-based robot learning
Zhu et al.	2022	Sample efficient grasp learning using equivariant models
Zhu et al.	2022	Bottom-up skill discovery from unsegmented demonstrations for long-horizon robot manipulation
Mendez et al.	2022	Modular lifelong reinforcement learning via neural composition
Murali et al.	2018	Cassl: Curriculum accelerated self-supervised learning
Jalali et al.	2019	Optimal autonomous driving through deep imitation learning and neuroevolution
Dadashi et al.	2021	Continuous control with action quantization from demonstrations
Seo et al.	2024	Continuous control with coarse-to-fine reinforcement learning
Wang et al.	2020	Policy learning in se (3) action spaces
Thabet et al.	2019	Sample-efficient deep reinforcement learning with imaginary rollouts for human-robot interaction
Mohtasib et al.	2021	A study on dense and sparse (visual) rewards in robot policy learning
Allevato et al.	2020	Iterative residual tuning for system identification and sim-to-real robot learning
Sumiea et al.	2023	Enhanced deep deterministic policy gradient algorithm using grey wolf optimizer for continuous control tasks
Tobin	2019	Real-world robotic perception and control using synthetic data
Aslan et al.	2020	End-to-end learning from demonstation for object manipulation of robotis-Op3 humanoid robot
Wang et al.	2022	Learning latent object-centric representations for visual-based robot manipulation
Galashov et al.	2020	Importance weighted policy learning and adaptation
Singh et al.	2019	Model & feature agnostic eye-in-hand visual servoing using deep reinforcement learning with prioritized experience replay