Gupta et al., 2020 - Google Patents
Policy-gradient and actor-critic based state representation learning for safe driving of autonomous vehiclesGupta et al., 2020
View HTML- Document ID
- 1962880118625787154
- Author
- Gupta A
- Khwaja A
- Anpalagan A
- Guan L
- Venkatesh B
- Publication year
- Publication venue
- Sensors
External Links
Snippet
In this paper, we propose an environment perception framework for autonomous driving using state representation learning (SRL). Unlike existing Q-learning based methods for efficient environment perception and object detection, our proposed method takes the …
- 238000000034 method 0 abstract description 42
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Systems or methods specially adapted for a specific business sector, e.g. utilities or tourism
- G06Q50/01—Social networking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Systems or methods specially adapted for a specific business sector, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/20—Education
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce, e.g. shopping or e-commerce
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bae et al. | Multi-robot path planning method using reinforcement learning | |
García Cuenca et al. | Autonomous driving in roundabout maneuvers using reinforcement learning with Q-learning | |
Iriondo et al. | Pick and place operations in logistics using a mobile manipulator controlled with deep reinforcement learning | |
Zeng et al. | Navigation in unknown dynamic environments based on deep reinforcement learning | |
Li et al. | A dynamic Bayesian network for vehicle maneuver prediction in highway driving scenarios: Framework and verification | |
Gutiérrez-Moreno et al. | Reinforcement learning-based autonomous driving at intersections in CARLA simulator | |
Zhou et al. | Vision-based robot navigation through combining unsupervised learning and hierarchical reinforcement learning | |
Jo et al. | Vehicle trajectory prediction using hierarchical graph neural network for considering interaction among multimodal maneuvers | |
Gupta et al. | Policy-gradient and actor-critic based state representation learning for safe driving of autonomous vehicles | |
Qian et al. | Deep, consistent behavioral decision making with planning features for autonomous vehicles | |
Yu et al. | A dynamic and static context-aware attention network for trajectory prediction | |
Khanum et al. | Deep-learning-based network for lane following in autonomous vehicles | |
Kuutti et al. | Weakly supervised reinforcement learning for autonomous highway driving via virtual safety cages | |
Cortes Gallardo Medina et al. | Object detection, distributed cloud computing and parallelization techniques for autonomous driving systems | |
Tran et al. | An efficiency enhancing methodology for multiple autonomous vehicles in an Urban network adopting deep reinforcement learning | |
Singh et al. | A review of deep reinforcement learning algorithms for mobile robot path planning | |
Lu et al. | Deep reinforcement learning based on social spatial–temporal graph convolution network for crowd navigation | |
Guillén-Ruiz et al. | Evolution of socially-aware robot navigation | |
He et al. | Toward the Trajectory Predictor for Automatic Train Operation System Using CNN–LSTM Network | |
Reda et al. | Design and implementation of reinforcement learning for automated driving compared to classical mpc control | |
Li et al. | Research into autonomous vehicles following and obstacle avoidance based on deep reinforcement learning method under map constraints | |
Shi et al. | Model-Based predictive control and reinforcement learning for planning vehicle-parking trajectories for vertical parking spaces | |
Hu et al. | Path planning for autonomous vehicles in unknown dynamic environment based on deep reinforcement learning | |
Sun et al. | Risk-Aware Deep Reinforcement Learning for Robot Crowd Navigation | |
Liu et al. | A multi-task fusion strategy-based decision-making and planning method for autonomous driving vehicles |