[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Gupta et al., 2020 - Google Patents

Policy-gradient and actor-critic based state representation learning for safe driving of autonomous vehicles

Gupta et al., 2020

View HTML
Document ID
1962880118625787154
Author
Gupta A
Khwaja A
Anpalagan A
Guan L
Venkatesh B
Publication year
Publication venue
Sensors

External Links

Snippet

In this paper, we propose an environment perception framework for autonomous driving using state representation learning (SRL). Unlike existing Q-learning based methods for efficient environment perception and object detection, our proposed method takes the …
Continue reading at www.mdpi.com (HTML) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/04Architectures, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for a specific business sector, e.g. utilities or tourism
    • G06Q50/01Social networking
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for a specific business sector, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/18Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce, e.g. shopping or e-commerce
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis

Similar Documents

Publication Publication Date Title
Bae et al. Multi-robot path planning method using reinforcement learning
García Cuenca et al. Autonomous driving in roundabout maneuvers using reinforcement learning with Q-learning
Iriondo et al. Pick and place operations in logistics using a mobile manipulator controlled with deep reinforcement learning
Zeng et al. Navigation in unknown dynamic environments based on deep reinforcement learning
Li et al. A dynamic Bayesian network for vehicle maneuver prediction in highway driving scenarios: Framework and verification
Gutiérrez-Moreno et al. Reinforcement learning-based autonomous driving at intersections in CARLA simulator
Zhou et al. Vision-based robot navigation through combining unsupervised learning and hierarchical reinforcement learning
Jo et al. Vehicle trajectory prediction using hierarchical graph neural network for considering interaction among multimodal maneuvers
Gupta et al. Policy-gradient and actor-critic based state representation learning for safe driving of autonomous vehicles
Qian et al. Deep, consistent behavioral decision making with planning features for autonomous vehicles
Yu et al. A dynamic and static context-aware attention network for trajectory prediction
Khanum et al. Deep-learning-based network for lane following in autonomous vehicles
Kuutti et al. Weakly supervised reinforcement learning for autonomous highway driving via virtual safety cages
Cortes Gallardo Medina et al. Object detection, distributed cloud computing and parallelization techniques for autonomous driving systems
Tran et al. An efficiency enhancing methodology for multiple autonomous vehicles in an Urban network adopting deep reinforcement learning
Singh et al. A review of deep reinforcement learning algorithms for mobile robot path planning
Lu et al. Deep reinforcement learning based on social spatial–temporal graph convolution network for crowd navigation
Guillén-Ruiz et al. Evolution of socially-aware robot navigation
He et al. Toward the Trajectory Predictor for Automatic Train Operation System Using CNN–LSTM Network
Reda et al. Design and implementation of reinforcement learning for automated driving compared to classical mpc control
Li et al. Research into autonomous vehicles following and obstacle avoidance based on deep reinforcement learning method under map constraints
Shi et al. Model-Based predictive control and reinforcement learning for planning vehicle-parking trajectories for vertical parking spaces
Hu et al. Path planning for autonomous vehicles in unknown dynamic environment based on deep reinforcement learning
Sun et al. Risk-Aware Deep Reinforcement Learning for Robot Crowd Navigation
Liu et al. A multi-task fusion strategy-based decision-making and planning method for autonomous driving vehicles