Gupta et al., 2020 - Google Patents

Policy-gradient and actor-critic based state representation learning for safe driving of autonomous vehicles

Gupta et al., 2020

Document ID: 1962880118625787154
Author: Gupta A; Khwaja A; Anpalagan A; Guan L; Venkatesh B
Publication year: 2020
Publication venue: Sensors

External Links

Cited by

Snippet

In this paper, we propose an environment perception framework for autonomous driving using state representation learning (SRL). Unlike existing Q-learning based methods for efficient environment perception and object detection, our proposed method takes the …

Continue reading at www.mdpi.com (HTML) (other versions)

238000000034 method 0 abstract description 42

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Systems or methods specially adapted for a specific business sector, e.g. utilities or tourism
- G06Q50/01—Social networking
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Systems or methods specially adapted for a specific business sector, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/20—Education
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce, e.g. shopping or e-commerce
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis

Similar Documents

Publication	Publication Date	Title
Bae et al.	2019	Multi-robot path planning method using reinforcement learning
García Cuenca et al.	2019	Autonomous driving in roundabout maneuvers using reinforcement learning with Q-learning
Iriondo et al.	2019	Pick and place operations in logistics using a mobile manipulator controlled with deep reinforcement learning
Zeng et al.	2019	Navigation in unknown dynamic environments based on deep reinforcement learning
Li et al.	2019	A dynamic Bayesian network for vehicle maneuver prediction in highway driving scenarios: Framework and verification
Gutiérrez-Moreno et al.	2022	Reinforcement learning-based autonomous driving at intersections in CARLA simulator
Zhou et al.	2019	Vision-based robot navigation through combining unsupervised learning and hierarchical reinforcement learning
Jo et al.	2021	Vehicle trajectory prediction using hierarchical graph neural network for considering interaction among multimodal maneuvers
Gupta et al.	2020	Policy-gradient and actor-critic based state representation learning for safe driving of autonomous vehicles
Qian et al.	2019	Deep, consistent behavioral decision making with planning features for autonomous vehicles
Yu et al.	2021	A dynamic and static context-aware attention network for trajectory prediction
Khanum et al.	2022	Deep-learning-based network for lane following in autonomous vehicles
Kuutti et al.	2021	Weakly supervised reinforcement learning for autonomous highway driving via virtual safety cages
Cortes Gallardo Medina et al.	2021	Object detection, distributed cloud computing and parallelization techniques for autonomous driving systems
Tran et al.	2021	An efficiency enhancing methodology for multiple autonomous vehicles in an Urban network adopting deep reinforcement learning
Singh et al.	2023	A review of deep reinforcement learning algorithms for mobile robot path planning
Lu et al.	2022	Deep reinforcement learning based on social spatial–temporal graph convolution network for crowd navigation
Guillén-Ruiz et al.	2023	Evolution of socially-aware robot navigation
He et al.	2022	Toward the Trajectory Predictor for Automatic Train Operation System Using CNN–LSTM Network
Reda et al.	2023	Design and implementation of reinforcement learning for automated driving compared to classical mpc control
Li et al.	2023	Research into autonomous vehicles following and obstacle avoidance based on deep reinforcement learning method under map constraints
Shi et al.	2023	Model-Based predictive control and reinforcement learning for planning vehicle-parking trajectories for vertical parking spaces
Hu et al.	2023	Path planning for autonomous vehicles in unknown dynamic environment based on deep reinforcement learning
Sun et al.	2023	Risk-Aware Deep Reinforcement Learning for Robot Crowd Navigation
Liu et al.	2023	A multi-task fusion strategy-based decision-making and planning method for autonomous driving vehicles