US20220348220A1 - Method and system for context-aware decision making of an autonomous agent - Google Patents
Method and system for context-aware decision making of an autonomous agent Download PDFInfo
- Publication number
- US20220348220A1 US20220348220A1 US17/858,024 US202217858024A US2022348220A1 US 20220348220 A1 US20220348220 A1 US 20220348220A1 US 202217858024 A US202217858024 A US 202217858024A US 2022348220 A1 US2022348220 A1 US 2022348220A1
- Authority
- US
- United States
- Prior art keywords
- context
- agent
- map
- information
- route
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 132
- 230000009471 action Effects 0.000 claims abstract description 107
- 238000013528 artificial neural network Methods 0.000 claims description 24
- 230000003068 static effect Effects 0.000 claims description 20
- 230000007613 environmental effect Effects 0.000 claims description 11
- 238000012545 processing Methods 0.000 claims description 9
- 230000000306 recurrent effect Effects 0.000 claims description 3
- 230000008569 process Effects 0.000 abstract description 34
- 238000002372 labelling Methods 0.000 abstract description 9
- 239000003795 chemical substances by application Substances 0.000 description 157
- 230000006870 function Effects 0.000 description 41
- 230000007704 transition Effects 0.000 description 20
- 230000006399 behavior Effects 0.000 description 19
- 230000008901 benefit Effects 0.000 description 14
- 238000013459 approach Methods 0.000 description 13
- 238000010801 machine learning Methods 0.000 description 12
- 238000013527 convolutional neural network Methods 0.000 description 10
- 230000004807 localization Effects 0.000 description 10
- 230000008859 change Effects 0.000 description 8
- 238000004891 communication Methods 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 230000002787 reinforcement Effects 0.000 description 7
- 230000004044 response Effects 0.000 description 7
- 238000013136 deep learning model Methods 0.000 description 6
- 238000012549 training Methods 0.000 description 6
- 238000010276 construction Methods 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 3
- 238000003066 decision tree Methods 0.000 description 3
- 238000012706 support-vector machine Methods 0.000 description 3
- 238000012384 transportation and delivery Methods 0.000 description 3
- 238000010200 validation analysis Methods 0.000 description 3
- 101100175317 Danio rerio gdf6a gene Proteins 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000003909 pattern recognition Methods 0.000 description 2
- 238000005096 rolling process Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 230000001154 acute effect Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 239000010426 asphalt Substances 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000002089 crippling effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000007499 fusion processing Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000001151 other effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W60/00—Drive control systems specially adapted for autonomous road vehicles
- B60W60/001—Planning or execution of driving tasks
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/26—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
- G01C21/28—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network with correlation of data from several navigational instruments
- G01C21/30—Map- or contour-matching
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/26—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
- G01C21/34—Route searching; Route guidance
- G01C21/3407—Route searching; Route guidance specially adapted for specific applications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/26—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
- G01C21/34—Route searching; Route guidance
- G01C21/3453—Special cost functions, i.e. other than distance or default speed limit of road segments
- G01C21/3461—Preferred or disfavoured areas, e.g. dangerous zones, toll or emission zones, intersections, manoeuvre types, segments such as motorways, toll roads, ferries
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/26—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
- G01C21/34—Route searching; Route guidance
- G01C21/36—Input/output arrangements for on-board computers
- G01C21/3667—Display of a road map
- G01C21/3673—Labelling using text of road map data items, e.g. road names, POI names
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2155—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/285—Selection of pattern recognition techniques, e.g. of classifiers in a multi-classifier system
-
- G06K9/6259—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/006—Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/87—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using selection of the recognition techniques, e.g. of a classifier in a multiple classifier system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/56—Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0108—Measuring and analyzing of parameters relative to traffic conditions based on the source of data
- G08G1/0112—Measuring and analyzing of parameters relative to traffic conditions based on the source of data from the vehicle, e.g. floating car data [FCD]
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0125—Traffic data processing
- G08G1/0129—Traffic data processing for creating historical data or processing based on historical data
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0125—Traffic data processing
- G08G1/0133—Traffic data processing for classifying traffic situation
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0137—Measuring and analyzing of parameters relative to traffic conditions for specific applications
- G08G1/0145—Measuring and analyzing of parameters relative to traffic conditions for specific applications for active traffic flow control
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/09—Arrangements for giving variable traffic instructions
- G08G1/0962—Arrangements for giving variable traffic instructions having an indicator mounted inside the vehicle, e.g. giving voice messages
- G08G1/09623—Systems involving the acquisition of information from passive traffic signs by means mounted on the vehicle
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/09—Arrangements for giving variable traffic instructions
- G08G1/0962—Arrangements for giving variable traffic instructions having an indicator mounted inside the vehicle, e.g. giving voice messages
- G08G1/09626—Arrangements for giving variable traffic instructions having an indicator mounted inside the vehicle, e.g. giving voice messages where the origin of the information is within the own vehicle, e.g. a local storage device, digital map
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/02—Services making use of location information
- H04W4/021—Services related to particular areas, e.g. point of interest [POI] services, venue services or geofences
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2552/00—Input parameters relating to infrastructure
- B60W2552/05—Type of road, e.g. motorways, local streets, paved or unpaved roads
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2552/00—Input parameters relating to infrastructure
- B60W2552/50—Barriers
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2554/00—Input parameters relating to objects
- B60W2554/20—Static objects
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2554/00—Input parameters relating to objects
- B60W2554/40—Dynamic objects, e.g. animals, windblown objects
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2556/00—Input parameters relating to data
- B60W2556/40—High definition maps
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B60—VEHICLES IN GENERAL
- B60W—CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
- B60W2556/00—Input parameters relating to data
- B60W2556/45—External transmission of data to or from the vehicle
- B60W2556/50—External transmission of data to or from the vehicle of positioning data, e.g. GPS [Global Positioning System] data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
Definitions
- This invention relates generally to the autonomous vehicle field, and more specifically to a new and useful system and method for context-aware decision making of an autonomous agent in the autonomous vehicle field.
- FIG. 1 is a schematic of an autonomous vehicle system for context selection.
- FIG. 2 is a schematic of a method for autonomous vehicle context selection.
- FIG. 3 depicts a variation of an integration of the system for context selection (context selector) within an autonomous vehicle architecture.
- FIG. 4 depicts a variation of a set of contexts.
- FIG. 5 depicts a variation of a vehicle executing a creep behavior within a particular context.
- FIG. 6 depicts a schematic of a variation of context switching for fixed routes in comparison to the route options in a geo-fenced operational design domain (ODD).
- ODD operational design domain
- FIGS. 7A-7D depict an example of context selection along a route.
- FIG. 8 depicts a schematic variation of an overall system of the autonomous agent.
- FIGS. 10A-10B depict a variation of a use case of an autonomous vehicle in fixed-route deliveries and a schematic of fixed routes driven by the vehicles.
- FIG. 11 depicts a variation of an integration of a context selector in a trajectory generation architecture.
- FIG. 12 depicts a variation of a context selector in which a context is selected based on a map and a location and/or orientation of the vehicle.
- FIG. 14 depicts a variation of a map indicating a sequential series of contexts for each of a set of example routes.
- FIG. 15 depicts a variation of map indicating an example set of context region assignments.
- a method 200 for context-aware decision making of an autonomous agent includes receiving a set of inputs S 210 and determining a context associated with an autonomous agent based on the set of inputs S 220 . Additionally or alternatively, the method 200 can include any or all of: labeling a map S 205 ; selecting a learning module (context-specific learning module) based on the context S 230 ; defining an action space based on the learning module S 240 ; selecting an action from the action space S 250 ; planning a trajectory based on the action S 260 ; and/or any other suitable processes. The method 200 can be performed with a system as described above and/or any other suitable system.
- the system and method for context-aware decision making of an autonomous agent can confer several benefits over current systems and methods.
- the system and/or method confers the benefit of decision making through a hybrid approach of deep learning and rule-based processes, achieving explainable artificial intelligence (AI).
- AI explainable artificial intelligence
- the system and/or method further confers the benefit of reducing the amount of data required to traverse, validate, and/or add a new route by implementing a small, limited-route ODD including a small number of contexts which can be overly defined and/or described, thereby eliminating and/or reducing a number of edge cases encountered by the vehicle.
- the system and/or method can reduce the need for a large amount of data due to acute confinement of problem while maintaining all the benefits of learning systems (e.g., maintaining an extremely low margin of error in decisions, enabling human-style driving decisions, continued driving progress, etc.).
- learning modules e.g., deep learning models
- inverse reinforcement learning algorithms which conventionally require diverse and significant amounts of data to be trained
- the system and/or method confers the benefit of enabling supply chain growth in short haul logistics (e.g., B2B trucking) applications, by enabling fixed route autonomous delivery of goods between locations.
- short haul logistics e.g., B2B trucking
- the system and/or method further preferably confer the benefit of enabling determination and awareness of a context of an autonomous vehicle, which can confer the subsequent benefits of: achieving smooth switching of contexts (e.g., with minimal validation and testing, based on a predetermined series of contexts specified for particular fixed route, etc.); training models based on a specific context (therefore reducing the training required for each model), such as optimizing for different cost functions and/or parameters in different contexts; selecting safe and appropriate behaviors/actions for the vehicle in light of the context; and/or can confer any other benefit.
- the system and/or method confers the benefit of overcoming the shortcomings of conventional systems and methods, which can include, for instance, any or all of: an inability to adapt to unexpected situations; overgeneralization, which often leads to conservative behavior; hard-to-tune (e.g., hard to manually tune) parameters; constraint monitoring and adaptation of hyper parameters based on environmental changes; unsolvability even in small ODDs; and/or any other shortcomings of conventional systems and methods.
- the system 100 is preferably configured to implement and/or interface with a system which implements a hybrid architecture of decision making (e.g., as shown in FIG. 3 , as shown in FIG. 8 , as shown in FIG. 9 , as shown in FIG. 11 , etc.), the hybrid architecture implementing both classical, rule-based approaches and machine learning approaches, which is enabled by a small ODD, fixed route driving framework.
- a hybrid architecture of decision making e.g., as shown in FIG. 3 , as shown in FIG. 8 , as shown in FIG. 9 , as shown in FIG. 11 , etc.
- the system 100 is implemented in autonomous short-haul (e.g., between 5 and 400 miles) B2B fixed-route applications.
- the autonomous agents preferably receive inventory from sorting centers, but can additionally or alternatively receive inventory for parcel hubs and/or warehouses.
- the agent then preferably delivers the inventory to and/or between any or all of: sorting centers, micro-fulfillment centers, distribution centers, retail stores, and local delivery centers. Additionally or alternatively, the agents can interface with residences (e.g., customer homes), and/or any other suitable locations/facilities.
- the computing system is preferably designed at least partially in a modular format including a set of modular computing components, referred to herein as learning modules (equivalently referred to herein as learning agents or learning models), each associated with predefined inputs and outputs.
- Each computing component contains a specific algorithm module built to process a set of data inputs and generate a set of outputs.
- the computing system can optionally include a middleware framework, which extracts dependencies from these components and links them all together (e.g., with a topological ordering process such as a directed acyclic graph, etc.).
- the framework takes the predefined components described above and combines them with fused data from the sensors to create lightweight user-level tasks. Each task is then scheduled based on resource availability and task priorities and executed as optimized threads.
- system and/or computing system can be otherwise configured and/or designed.
- the computing system includes an onboard computing system onboard (e.g., integrated within) the autonomous agent.
- the autonomous agent includes an autonomous vehicle that is preferably a fully autonomous vehicle and/or able to be operated as a fully autonomous vehicle, but can additionally or alternatively be any semi-autonomous or fully autonomous vehicle, a teleoperated vehicle, and/or any other suitable vehicle.
- the autonomous vehicle is preferably an automobile (e.g., car, driverless car, bus, shuttle, taxi, ride-share vehicle, truck, semi-truck, etc.).
- the computing system can additionally or alternatively include a remote computing system offboard the autonomous agent, such as a cloud computing system.
- the remote computing system is preferably in communication with the onboard computing system (e.g., to collect information from the onboard computing system, to provide updated models to the onboard computing system, etc.), but can additionally or alternatively be in communication with any other suitable components.
- the computing system can include and/or interface with any or all of: a localization module, prediction module, perception module, and/or any other suitable modules for operation of the autonomous agent.
- the computing system (e.g., onboard computing system) is preferably in communication with (e.g., in wireless communication with, in wired communication with, coupled to, physically coupled to, electrically coupled to, etc.) a vehicle control system, which functions to execute commands determined by the computing system.
- a vehicle control system which functions to execute commands determined by the computing system.
- the computing system includes and/or interfaces with a map, which functions to at least partially enable the determination of a context associated with the autonomous agent.
- the map is preferably a high definition, hand-labeled map as described below, which prescribes the context of the autonomous agent based on its location and/or position within the map, but can additionally or alternatively include any other map (e.g., map labeled in an automated fashion, map labeled through both manual and automated processes, etc.) and/or combination of maps.
- the system 100 preferably includes and/or interfaces with a sensor system (equivalently referred to herein as a sensor subsystem), which functions to enable any or all of: a localization of the autonomous agent (e.g., within a map), a detection of surrounding objects (e.g., dynamic objects, static objects, etc.) of the autonomous agent, and/or any other suitable function.
- a sensor system equivalently referred to herein as a sensor subsystem
- the sensor system can include any or all of: cameras (e.g., 360-degree coverage cameras, ultra-high resolution cameras, etc.), light detection and ranging (LiDAR) sensors, radio detection and ranging (RADAR) sensors, motion sensors (e.g., accelerometers, gyroscopes, inertial measurement units [IMUs], speedometers, etc.), location sensors (e.g., Global Navigation Satellite System [GNSS] sensors, Inertial Navigation System [INS] sensors, Global Positioning System [GPS] sensors, any combination, etc.), ultrasonic sensors, and/or any suitable sensors.
- cameras e.g., 360-degree coverage cameras, ultra-high resolution cameras, etc.
- LiDAR light detection and ranging
- RADAR radio detection and ranging
- motion sensors e.g., accelerometers, gyroscopes, inertial measurement units [IMUs], speedometers, etc.
- location sensors e.g., Global Navigation Satellite System [GNSS] sensors, Inertial Navigation
- the sensor system includes: 16-beam LIDARs (e.g., for high fidelity obstacle detection, etc.); short range RADARs (e.g., for blind spot detection, cross traffic alert, emergency braking, etc.); ultrasonic sensors (e.g., for park assist, collision avoidance, etc.); 360-degree coverage cameras (e.g., for surround view for pedestrian/cyclist/urban obstacle detection and avoidance, etc.); 128-beam LIDAR (e.g., for localization of vehicle with high precision); long range ultra-high resolution cameras (e.g., for traffic sign and traffic light detection); long range RADARs (e.g., for long range obstacle tracking and avoidance); GNSS/INS (e.g., for ultra-high precision localization); and/or any other suitable sensors.
- 16-beam LIDARs e.g., for high fidelity obstacle detection, etc.
- short range RADARs e.g., for blind spot detection, cross traffic alert, emergency braking, etc.
- the sensor system is configured to enable a determination of a pose (and/or any other location and/or orientation parameter(s)) of the agent, which is used to select a context associated with that pose (e.g., in a 1:1 fashion, with multiple options for context, etc.) based on a labeled map.
- the sensor system includes a localization subsystem which determines the pose, wherein the localization subsystem includes any combination of GPS, IMU, LIDAR, camera, and/or other sensors mounted on the vehicle to estimate its current position at any given time.
- the sensor system further preferably enables the determination of information (e.g., location, motion, etc.) of objects and/or features in the environment of the agent, such as, but not limited to: dynamic objects, static objects, road infrastructure, environmental conditions (e.g., rain, snow, lighting conditions, etc.), and/or any other suitable information.
- information e.g., location, motion, etc.
- the sensor system can include any other suitable sensors configured to collect any suitable sensor information.
- system 100 can include any other suitable components or combination of components.
- the method 200 includes receiving a set of inputs S 210 and determining a context associated with an autonomous agent based on the set of inputs S 220 . Additionally or alternatively, the method 200 can include any or all of: labeling a map S 205 ; selecting a learning module (context-specific learning module) based on the context S 230 ; defining an action space based on the learning module S 240 ; selecting an action from the action space S 250 ; planning a trajectory based on the action S 260 ; and/or any other suitable processes.
- a learning module context-specific learning module
- the method preferably implements and/or interfaces with a system implementing hybrid decision making as described above, but can additionally or alternatively implement and/or interface with methods implementing any other suitable decision making.
- the method 200 functions to determining a context associated with an autonomous agent and thereby enable context-aware decision making of the autonomous agent. Additionally or alternatively, the method 200 can function to enable selection of an action to be performed by the autonomous agent, the generation of a trajectory to be traveled by the autonomous agent, and/or perform any other suitable function(s).
- the method 200 is preferably performed with a system 100 as described above, but can additionally or alternatively be performed with any suitable system.
- the method 200 is preferably performed throughout the duration of the route being traveled by the autonomous agent and based on a map (e.g., continuously checking for a context change), but can additionally or alternatively be performed at any or all of: a predetermined frequency (e.g., constant frequency), in response to a trigger, at a set of intervals (e.g., random intervals), once, and/or at any other suitable times.
- a predetermined frequency e.g., constant frequency
- intervals e.g., random intervals
- the method 200 can optionally include labeling a map S 205 , which functions to specify the context for the vehicle at any location of the autonomous agent along a route (e.g., predetermined route). Additionally or alternatively, labeling the map can function to specify the location of one or more static objects along a route of the agent; a location and/or other information of lane boundaries and/or other road features (e.g., information about current lane, lane boundaries, lane lines, etc.); the locations of starting points and vehicle destinations; a set of parameters (e.g., speed limit, target vehicle speed, dimensions, etc.) associated with locations of the vehicle; and/or any other suitable information.
- a map S 205 which functions to specify the context for the vehicle at any location of the autonomous agent along a route (e.g., predetermined route). Additionally or alternatively, labeling the map can function to specify the location of one or more static objects along a route of the agent; a location and/or other information of lane boundaries and/or other road features (e.g.
- the map is preferably a high definition map but can additionally or alternatively be or include any other suitable maps.
- the map is further preferably hand labeled with any or all of the information described above, which can be enabled, for instance, by the small ODD and fixed route approach to preferred variations of the system and method. This functions to produce a highly accurate, safe, and dependable map with which to transition between vehicle contexts.
- ODDs e.g., as shown in FIG.
- the number of potential routes available to the vehicle is significant, thereby causing: more required switches between contexts (e.g., increasing the chance of incorrectly selecting a context); increased requirements for storing and switching between a large number of fully learned models (e.g., resulting in a computationally crippling problem); and/or any other effects.
- the contexts are preferably assigned to one or more particular regions in the map (e.g., hard-coded into the map, soft-coded into the map, etc.), such that a particular context relevant to the agent (e.g., context in which agent is located, context in which the agent is about to be located, context that agent has departed, etc.) can be determined (e.g., with one or more inputs received in S 210 such as pose information of the autonomous agent) in S 220 .
- a particular context relevant to the agent e.g., context in which agent is located, context in which the agent is about to be located, context that agent has departed, etc.
- the contexts are preferably assigned to locations and/or regions within the map.
- Each location and/or region in the map can be assigned any or all of: a single context; multiple contexts (e.g., indicating an intersection of multiple routes, wherein a single context is selected based on additional information such as any or all of the inputs received in S 210 , etc.); no context (e.g., indicating a location and/or region not on a fixed route option for the autonomous agent); and/or any combination of contexts.
- the particular context(s) assigned to the location and/or region are preferably determined based on the static environment at that location and/or within that region, such as any or all of: features of the roadway within that region (e.g., number of lanes, highway vs.
- residential road one-way vs. two-way, dirt and/or gravel vs. asphalt, curvature, shoulder vs. no shoulder, etc.
- landmarks and/or features within that region e.g., parking lot, roundabout, etc.
- a type of zone associated with that location and/or region e.g., school zone, construction zone, hospital zone, residential zone, etc.
- a type of dynamic objects encountered at the location and/or region e.g., pedestrians, bicycles, vehicles, animals, etc.
- traffic parameters associated with that location and/or region e.g., speed limit, traffic sign types, height limits for semi trucks, etc.
- any other environmental information e.g., speed limit, traffic sign types, height limits for semi trucks, etc.
- the map is a high-definition map with hard-coded contexts.
- the map is a geo-location file with semantic annotations of context for variations points and/or areas.
- the time of day at which a route is being taken and/or one or more dynamic (e.g., temporal) features can be taken into account, such as any or all of: traffic patterns (e.g., at the time that a fixed route is scheduled to and/or most likely to take place); weather conditions; lighting conditions; time-specific zone information (e.g., times at which school zone restrictions are enforced); and/or any other suitable information.
- traffic patterns e.g., at the time that a fixed route is scheduled to and/or most likely to take place
- weather conditions e.g., at the time that a fixed route is scheduled to and/or most likely to take place
- lighting conditions e.g., time-specific zone information
- time-specific zone information e.g., times at which school zone restrictions are enforced
- dynamic objects e.g., surrounding vehicles, pedestrians, animals, moving objects, etc.
- non-permanent objects or environments e.g., construction sites
- a fallback motion planner can be triggered.
- maps can be dynamically and/or iteratively produced to account for any or all of the features and/or objects.
- the context assignments can be determined based on route information (e.g., fixed route information), such as a set of fixed routes prescribed for the autonomous agent to take.
- the route information can include any or all of: a starting location of the route, a destination of the route, a directionality of the autonomous agent along the route, and/or any other information.
- the contexts assigned to the map are preferably selected based on the vehicle's progression along the route and the contexts that the vehicle would sequentially encounter in doing so. In specific examples, for instance, an intersection at which the agent is planned to pass straight through in a first fixed route may be assigned a different context than the same intersection at which the agent is planned to turn right at in a second fixed route.
- any or all of the contexts can be determined independently of a route and/or a fixed route.
- one or more contexts identified in S 220 can be determined based on a map without assigned and/or prescribed contexts, such as in variations in which the context is identified with one or more programmed processes and/or machine learning processes (e.g., as described in S 220 ) based on a set of inputs (e.g., those described in S 210 ).
- a context is identified with a context identifier module (e.g., as part of the agent's planning module, as part of the computing system, etc.), such as that shown in FIG.
- the context identifier module can receive inputs from the map such as road infrastructure information (e.g., location of, size of, parameters associated with, etc.) and/or static environment features, which can individually and/or collectively include, but is not limited to, any or all of: road signs, lane lines, buildings, railroad tracks, bus routes, and/or other infrastructure information.
- road infrastructure information e.g., location of, size of, parameters associated with, etc.
- static environment features which can individually and/or collectively include, but is not limited to, any or all of: road signs, lane lines, buildings, railroad tracks, bus routes, and/or other infrastructure information.
- the map can optionally include (e.g., assign, prescribe, etc.) one or more transition zones which are arranged between different contexts, and can indicate, for instance, a change in context (e.g., along a fixed route, along a dynamically determined route, etc.), thereby enabling a switching of contexts to occur smoothly (e.g., by defining an action space).
- Assigning transition zones can function, for instance, to define an action space subsequently in the method which smoothly transitions the vehicle from one context to the next (e.g., preventing the availability of certain actions, prescribing that the agent maintain his or her lane, preventing a turn, etc.).
- the transition zones can be any or all of: overlapping with (e.g., partially overlapping with, fully overlapping with, etc.) one or more contexts; non-overlapping with one or more contexts; and/or any combination of overlapping and non-overlapping. Additionally or alternatively, the transition zones can be contexts themselves; the method can be performed in absence of labeled transition zones (e.g., by anticipating the subsequent context); and/or be otherwise performed.
- S 205 includes hand labeling a high definition map to prescribe a set of contexts, further preferably a series of contexts, for at least a set of fixed routes available to an autonomous agent.
- S 205 further preferably includes hand labeling a set of static objects and/or road features associated with the routes.
- each region in the map is assigned a single context (e.g., for all routes, for a specific route, etc.). Additionally or alternatively, each region in the map can be assigned multiple contexts (e.g., for overlapping routes, to be selected from, etc.).
- S 205 includes labeling a map without prescribed contexts, wherein the context is determined subsequently in the method 200 (e.g., with one or more machine learning models).
- the map is labeled with road features and/or landmarks, which are subsequently used (e.g., with sensor information) to determine a context.
- S 205 can be performed in another suitable way or the method 200 can be performed in absence of S 205 .
- the method 200 includes receiving a set of inputs S 210 , which functions to receive information with which to determine a context for the autonomous agent. Additionally or alternatively, S 210 can function to receive information with which to perform any suitable processes of the method (e.g., determine an action and/or behavior, determine a trajectory of the agent, etc.). Additionally or alternatively, S 210 can include determining (e.g., calculating) any or all of the set of inputs, combining inputs (e.g., in one or more sensor fusion processes), processing and/or preprocessing the set of inputs, and/or any other suitable processes.
- S 210 can include determining (e.g., calculating) any or all of the set of inputs, combining inputs (e.g., in one or more sensor fusion processes), processing and/or preprocessing the set of inputs, and/or any other suitable processes.
- S 210 is preferably performed throughout the method 200 , such as any or all of: continuously, at a predetermined frequency, at random intervals, prior to each of a set of processes of the method 200 , and/or at any other suitable times.
- S 210 can additionally or alternatively be performed in response to a trigger (e.g., based on the map, based on sensor information, etc.), at random intervals, and/or at any other suitable time(s) during the method 200 .
- the set of inputs received in S 210 preferably includes sensor information collected at a sensor subsystem of the autonomous agent, such as any or all of: a sensor system onboard the autonomous agent, a sensor system remote from the autonomous agent, and/or a sensor system in communication with the autonomous agent and/or a computing system (e.g., onboard computing system, remote computing system, etc.) of the autonomous agent.
- the sensor information can be collected from any other suitable sensor(s) and/or combination of sensors, S 210 can be performed in absence of collecting sensor inputs, and/or S 210 can be performed in any other suitable way(s).
- the sensor information can additionally or alternatively include motion information and/or other dynamic information associated with the autonomous agent, such as, but not limited to, any or all of: velocity/speed, acceleration, and/or any other suitable information.
- the set of inputs received in S 210 further preferably includes the map and/or any information determined from (e.g., determined based on, derived from, included in, etc.) the map, such as any or all of the information described above in S 205 .
- this includes one or more contexts (and/or transition zones) selected based on (e.g., predetermined/assigned to) a region/location of the autonomous agent (e.g., as determined based on sensor information as described above).
- the map information includes any or all of: road infrastructure information and/or other static environment information, route information, and/or any other suitable information.
- Information associated with the map can optionally be determined based on other information received in S 210 , such as any or all of the sensor information received at one or more sensor systems.
- location information e.g., current pose, current position, current geographical location, etc.
- this information is a particular context selected (e.g., assigned to) based on the location of (e.g., a region including the location, a stretch of road on which the agent is located, a particular intersection in which the agent is located, etc.) the autonomous agent.
- the location of the autonomous agent in comparison with the map includes information associated with the road infrastructure (e.g., road signs, lane lines, buildings, etc.), which is used, preferably along with other inputs (e.g., static environment features, static object information, autonomous agent vehicle state, dynamic environment features, etc.) to determine a context (e.g., using one or more learning-based models, using a pattern recognition and/or classification model, etc.) for the autonomous agent.
- the road infrastructure e.g., road signs, lane lines, buildings, etc.
- other inputs e.g., static environment features, static object information, autonomous agent vehicle state, dynamic environment features, etc.
- a context e.g., using one or more learning-based models, using a pattern recognition and/or classification model, etc.
- the set of inputs can optionally include a route and/or associated route information (e.g., route identifier of a fixed route, agent's progression through route, etc.) assigned to and/or being traversed by the agent (e.g., fixed route selected for the agent, dynamic route being traveled by the agent, predicted route for the agent, etc.), which can function for instance, to select information from one or more maps and/or to select a particular map (e.g., a route-specific map).
- the map information selected based on route can include, but is not limited to, any or all of: a current context, a future context (e.g., next context in a fixed route), a transition zone, and/or any other suitable information from a map.
- the set of inputs S 210 can additionally or alternatively include any other suitable information, such as, but not limited to, any or all of: a state (e.g., operational state, driving state, etc.) of the autonomous agent, a trajectory of the agent, a set of control commands for the agent, historical information associated with the agent and/or an environment of the agent, predicted information associated with the agent and/or the environment (e.g., predicted trajectories of dynamic objects), and/or any other suitable information and/or inputs.
- a state e.g., operational state, driving state, etc.
- a trajectory of the agent e.g., a trajectory of the agent
- a set of control commands for the agent e.g., historical information associated with the agent and/or an environment of the agent
- predicted information associated with the agent and/or the environment e.g., predicted trajectories of dynamic objects
- S 210 includes receiving a map specifying a set of assigned contexts for an agent; optionally a route (e.g., fixed route) of the agent; and sensor information from a set of sensors onboard the autonomous agent, wherein the sensor information includes at least a pose of the autonomous agent, wherein the pose and optionally the route are used to select a context for the agent based on the map. Additionally or alternatively, S 210 can include receiving any other suitable inputs.
- S 210 includes receiving a selected route for the vehicle, a high definition hand-labeled map specifying a context for each portion of the route (and optionally other routes), and sensor information including at least location information associated with the vehicle and optionally any or all of: motion information, object information (e.g., dynamic object information, static object information, etc.), and/or any other suitable information.
- a selected route for the vehicle e.g., a high definition hand-labeled map specifying a context for each portion of the route (and optionally other routes)
- sensor information including at least location information associated with the vehicle and optionally any or all of: motion information, object information (e.g., dynamic object information, static object information, etc.), and/or any other suitable information.
- S 210 includes receiving a map including information associated with road infrastructure, such as the road infrastructure along one or more routes of the agent (e.g., a fixed route of the agent) and optionally any or all of: other map information (e.g., speed limit information, traffic laws, etc.); a state of the agent, equivalently referred to herein as an ego state (e.g., as determined by a sensor subsystem; pose, velocity, and acceleration; etc.); static environment features and/or information; dynamic environment features and/or information; sensor information; and/or any other suitable information, wherein the context is determined with one or more models and/or algorithms (e.g., convolutional neural networks [CNNs], recurrent neural networks [RNNs], support-vector machines [SVMs], etc.).
- CNNs convolutional neural networks
- RNNs recurrent neural networks
- SVMs support-vector machines
- a context identifier module which includes one or more deep learning models, receives as input a map and optionally route information for the agent, a state of the agent, static environment features, and dynamic environment features, with which the context identifier module determines a context for the agent.
- S 210 includes receiving a set of multiple possible contexts for the agent based on a map, wherein a context of the set of multiple contexts is determined based on other inputs received in S 210 .
- S 210 can include any other suitable processes.
- the method 200 includes determining a context associated with the autonomous agent S 220 , which functions to specify the context in which the autonomous agent is operating, and can further function to: select a learning module based on the context (e.g., according to a 1:1 mapping), define and/or a limit a set of behaviors or actions available to the agent, specify particular parameters (e.g., creep distance) associated with the behaviors and/or actions, reduce and/or minimize the amount of data required to training the learning modules, and/or perform any other suitable function(s).
- select a learning module based on the context (e.g., according to a 1:1 mapping), define and/or a limit a set of behaviors or actions available to the agent, specify particular parameters (e.g., creep distance) associated with the behaviors and/or actions, reduce and/or minimize the amount of data required to training the learning modules, and/or perform any other suitable function(s).
- S 220 is preferably performed in response to (e.g., after, based on, etc.) S 210 , but can additionally or alternatively be performed as part of S 210 and/or concurrently with S 210 , in place of S 210 , in absence of S 210 , multiple times throughout the method, and/or at any other time(s) during the method 200 . Further additionally or alternatively, the method 200 can be performed in absence of S 220 .
- a context refers to a high level driving environment of the agent, which can inform and restrict the vehicle's decision at any given time and/or range of times.
- the context can include and/or define and/or be determined based on any or all of: a region type of the vehicle (e.g., residential, non-residential, highway, school, commercial, parking lot, etc.); a lane feature and/or other infrastructure feature of the road the vehicle is traversing (e.g., number of lanes, one-way road, two-way road, intersection, two-way stop and/or intersection, three-way stop and/or intersection, four-way stop and/or intersection, lanes in a roundabout, etc.); a proximity to one or more static objects and/or environmental features (e.g., particular building, body of water, railroad track, parking lot, shoulder, region in which the agent can pull over/pull off to the side of a road, etc.); a proximity a parameter associated with the location (e.g., speed limit, speed limit above a
- contexts can include, but are not limited to, any or all of: a two-way, two-lane residential road (e.g., in which the agent cannot change contexts due to road geometry as shown in FIG. 4 ); a two-way, two-lane non-residential road; a multi-lane highway (e.g., in which the agent can learn it is less likely to see pedestrians); a one-way, single-lane road; a one-way, two-lane road; a one-way road with “n” number (e.g., 1, 2, 3, 4, 5, 6, greater than 6, etc.) of lanes; a two-way road with “n” number (e.g., 1, 2, 3, 4, 5, 6, greater than 6, etc.) of lanes; a single lane road in a parking lot; a single lane road with a yellow boundary on the side; a multi-lane fast moving road (e.g., having a speed above a predetermined threshold); an on ramp of a highway; an off-ramp of a highway
- S 220 is preferably performed based on a set of inputs received in S 210 , but can additionally or alternatively be performed based on any other suitable information.
- S 220 is preferably performed by reading a map (e.g., as described above) to determine the context assigned to a point and/or area corresponding to the location of the agent (e.g., pose) and/or a route of the agent.
- each point and/or region can be any or all of: associated with at most 1 context (in a 1:1 mapping), associated with at most 1 context per route (e.g., wherein a fixed route assignment of the agent is used to select the proper context), associated with multiple contexts (e.g., which are ranked, prioritized, selected from based on other inputs received in S 210 ), and/or otherwise associated.
- the context assignments are hard-coded into the map.
- the context assignments can be soft-coded and/or otherwise assigned.
- S 220 can be performed with any number of algorithms, models (e.g., machine learning models, deep learning models, supervised learning models, unsupervised learning models, semi-supervised learning models, statistical models, pattern recognition models, etc.), finite state machines (FSMs), processes (e.g., traditionally programmed process), decision trees, and/or equations.
- models e.g., machine learning models, deep learning models, supervised learning models, unsupervised learning models, semi-supervised learning models, statistical models, pattern recognition models, etc.
- FSMs finite state machines
- processes e.g., traditionally programmed process
- decision trees e.g., decision trees, and/or equations.
- one or more machine learning models such as, but not limited to: one or more neural networks (e.g., CNNs, RNNs, etc.); SVMs; and/or any other suitable models, are implemented to determine a context (e.g., as a context identifier module of FIG. 13 ) of the agent based on any or all of the inputs in S 210 .
- a context e.g., as a context identifier module of FIG. 13
- the transition is preferably determined (e.g., prescribed) based on the map (e.g., as indicated as a transition zone, as indicated through distinct adjacent contexts in the map, as indicated through distinct sequential contexts in a fixed route, etc.), but can additionally or alternatively be dynamically determined, determined with a trajectory planner, determined based on sensor information, and/or otherwise determined.
- Examples of scenarios for a second context include, but are not limited to, any or all of: lane changing; merging; overtaking a slow moving vehicle; and/or any other scenarios.
- the context triggers the selection of a model and/or algorithm (e.g., a highly-tuned, context-aware custom inverse reinforcement learning (IRL) algorithm), which makes high-level scenario selection and calls a scenario-specific learning module (e.g., as described below) to select an action of the vehicle.
- a model and/or algorithm e.g., a highly-tuned, context-aware custom inverse reinforcement learning (IRL) algorithm
- INL context-aware custom inverse reinforcement learning
- any other suitable algorithms or processes for selecting a scenario can be implemented, an action can be selected in absence of a scenario, a context can be used to select another parameter, and/or S 220 can be otherwise performed.
- S 220 includes selecting a context based on a location and/or orientation of the vehicle (e.g., pose), a labeled map, and optionally any or all of the other information received in S 210 , wherein the context informs how the remaining processes of the method are performed.
- the context then triggers the selection of a particular scenario (e.g., based on a context-specific IRL algorithm).
- a labeled map indicates a sequential series of contexts for each of set of routes (e.g., C1 to C2 to C3 for Route 1; C1′ to C2′ to C3′ to C4′ to C5′ for Route 2; etc.), wherein in an event that the routes overlap in a particular section and/or point (e.g., intersection), a route assignment (e.g., Route 1 vs. Route 2) and/or directionality of the agent (e.g., West vs. East) can be used to select the proper context.
- the map can optionally further include transition zones (e.g., having a non-zero size, having a size of zero and indicating an immediate transition, etc.) between adjacent regions of different context(s) and/or any other information.
- a labeled map assigns a context to each of a set of regions of a labeled map, wherein a location of the agent within the region (and optionally a route assignment and/or directionality of the agent) determines the context for the agent.
- the map can optionally further include transition zones (e.g., having a non-zero size, having a size of zero and indicating an immediate transition, etc.) between adjacent regions of different context(s) and/or any other information.
- S 220 can include any other suitable processes and/or be performed in any other suitable ways.
- S 230 is preferably performed in response to (e.g., after, based on, etc.) S 220 , but can additionally or alternatively be performed as part of S 220 and/or concurrently with S 220 , in place of S 220 , in absence of S 220 , in response to S 210 , multiple times throughout the method, and/or at any other time(s) during the method 200 . Further additionally or alternatively, the method 200 can be performed in absence of S 230 .
- S 230 preferably includes selecting a learning module (equivalently referred to herein as a context-aware learning agent or a deep decision network) which includes a set of machine learning (e.g., deep learning) models and/or algorithms, wherein the learning module is trained based on data associated with that particular context.
- a learning module (equivalently referred to herein as a context-aware learning agent or a deep decision network) which includes a set of machine learning (e.g., deep learning) models and/or algorithms, wherein the learning module is trained based on data associated with that particular context. This functions to divide a large amount of data from all possible contexts into a set of manageable amounts, which cover all or nearly all of the situations the agent would encounter in that context.
- S 230 can optionally include receiving a set of inputs, which can include any or all of the set of inputs described above, a different and/or additional set of inputs, and/or any other suitable inputs.
- S 230 includes receiving any or all of: the inputs described above; the context and/or scenario of the agent; the set of vehicles and/or other dynamic objects surrounding the vehicle, the predicted paths (e.g., where will it be in lane and in which lane, etc.) of the dynamic objects, static objects surrounding the agent; uncertainty values (e.g., of the predicted paths); routing information associated with the agent; and/or any other suitable inputs.
- an environmental representation of the agent is received with the context at a deep decision network selected based on the context.
- the environmental representation (referred to as a full environmental representation) includes a latent space representation of a set of inputs (e.g., as described in S 210 , additional or alternative to those described in S 210 , etc.), the set of inputs including any or all of: a state of the agent (equivalently referred to herein as an ego vehicle state), one or more maps, routing information (e.g., a selected fixed route, parameters associated with a selected fixed route, etc.), dynamic object information/features, static object information/features, and/or any other suitable information.
- routing information e.g., a selected fixed route, parameters associated with a selected fixed route, etc.
- dynamic object information/features e.g., static object information/features, and/or any other suitable information.
- One or more models can optionally be used to process any or all of these inputs (e.g., to determine a latent space representation, to determine another output, to simplify the input(s) to the deep decision network, etc.).
- a first neural network e.g., one or more RNNs, one or more CNNs, a combination of RNNs and CNNs, Order Independent Representation in FIG.
- the first neural network is a CNN including a set of one or more convolution and/or pooling layers. Additionally or alternatively, any other suitable models can be used.
- This representation can optionally be combined with (e.g., appended with in a vector and/or matrix) other environmental information/inputs (e.g., route information, map, agent/ego pose, etc.) and/or outputs from other models, such as a second neural network (e.g., an RNN, a CNN, Ego RNN in FIG.
- a second neural network e.g., an RNN, a CNN, Ego RNN in FIG.
- the latent space network is preferably in the form of and/or includes one or more autoencoders (with one or more encoders, code, and decoders), but can additionally or alternatively include any or all of: other unsupervised learning models, supervised learning models, semi-supervised learning models, and/or any other suitable neural networks and/or models.
- the latent space network is a fully-connected feedforward neural network.
- the output of the latent space representation which is preferably a low-dimensional features vector (but can additionally or alternatively be any other output) is preferably then used as an input to the learning module.
- any other suitable models and/or algorithms can be implemented, the input and/or outputs can be different for these models and/or algorithms, and/or any suitable model architecture can be implemented.
- the learning module is preferably in the form of a machine learning model, further preferably in the form of one or more neural networks and/or models (e.g., convolutional neural network [CNN], inverse reinforcement learning [IRL] model, reinforcement learning [RL] model, imitation learning [IL] model, etc.) trained for a particular context and/or contexts, but can additionally or alternatively include any other suitable models, algorithms, decision trees, lookup tables, and/or other tools.
- CNN convolutional neural network
- RNL reinforcement learning
- RL reinforcement learning
- IL imitation learning
- each of the learning modules is a neural network, further preferably a deep Q-learning network (e.g., IRL algorithm/network), wherein the number of layers (e.g., hidden layers) of the neural network can vary for different contexts and/or actions (e.g., between 3-8 layers, 3 or less layers, 8 or more layers, between 2 and 10 layers, between 1 and 15 layers, etc.).
- a deep Q-learning network e.g., IRL algorithm/network
- any other suitable networks, algorithms, and/or models can be used in the learning module(s), such as, but not limited to, any or all of: policy gradient methods, finite state machines [FSMs], probabilistic methods (e.g., Partially Observable Markov Decision Process [POMDP]), imitation learning [IL], RL or variations of IRL, and/or any other suitable models and/or networks and/or algorithms.
- FSMs finite state machines
- POMDP Partially Observable Markov Decision Process
- IL imitation learning
- RL RL or variations of IRL
- Each of the learning modules is preferably the same type of neural network (e.g., with different numbers of layers) and/or algorithm and/or model, but can alternatively be different.
- Each of the learning modules is preferably trained based on data occurring within the particular context type or context types associated with the learning module and optionally further trained based on data occurring within one or more fixed routes which pass through the context.
- a single learning module applies to a particular context type, wherein the single learning module is trained based on different versions of that context.
- a single learning module applies to a particular context within a particular route, wherein the single learning module is trained based on data associated with that particular context in the particular fixed route.
- the learning module(s) can be trained on any suitable data.
- Each of the learning modules is further preferably trained with inverse reinforcement learning, which functions to determine a reward function and/or an optimal driving policy for each of the context-aware learning agents.
- the output of this training is further preferably a compact fully-connected network model that represents the reward function and an optimal policy for each learning module. Additionally or alternatively, the learning modules can be otherwise suitably trained and/or implemented.
- S 230 includes selecting a context-aware learning agent (equivalently referred to herein as a context-aware learning module) based on a determined context of the agent, wherein a single context-aware learning agent is assigned to each context.
- the context-aware learning agent is preferably trained with an inverse reinforcement learning technique, but can additionally or alternatively be otherwise trained.
- S 230 includes selecting from multiple context-aware learning agents assigned to and/or available to a particular context, wherein the particular context-aware learning agent is selected based on any or all of: machine learning, a decision tree, statistical methods, an algorithm, and/or with any other suitable tool(s).
- any other suitable learning modules can be selected, used, and/or trained.
- the method 200 can include defining an action space based on the learning module S 240 , which functions to define a set of actions (equivalently referred to herein as behaviors) available to the agent in light of the vehicle's context and/or environment. Additionally or alternatively, S 240 can function to minimize a number of available actions to the agent as informed by the context, which functions to simplify the process (e.g., reduce the time, prevent selection of an incompatible action, etc.) required to select an action for the vehicle.
- the method 200 can optionally additionally or alternatively include selecting an action from the action space S 250 , which functions to determine a next behavior (e.g., switching and/or transitioning to a different behavior than current behavior, maintaining a current behavior, etc.) of the vehicle.
- S 240 and/or S 250 are preferably performed in response to (e.g., after, based on, etc.) S 230 , but can additionally or alternatively be performed as part of S 230 and/or concurrently with S 230 , in place of S 230 , in absence of S 230 , in response to S 220 and/or S 210 , multiple times throughout the method, and/or at any other time(s) during the method 200 . Further additionally or alternatively, the method 200 can be performed in absence of S 240 and/or S 250 .
- the action space and/or action is preferably produced as an output (e.g., intermediate output, final output, etc.) of the learning module; additionally or alternatively, the learning module can produce any other suitable outputs.
- the learning module can produce any other suitable outputs.
- a determination of the context and processing based on this context e.g., the specific learning module
- the actions can include, but are not limited to, any or all of: maintaining a lane, changing lanes, turning (e.g., turning right, turning left, performing a U-turn, etc.), merging, creeping, following a vehicle in front of the agent, parking in a lot, pulling over, nudging, passing a vehicle, and/or any other suitable actions such as usual driving actions for human-operated and/or autonomous vehicles.
- Each action is preferably associated with a set of parameters, which are determined based on the particular context of the agent and optionally any other suitable inputs (e.g., sensor information).
- any other suitable inputs e.g., sensor information.
- the extra information and restriction from the context type can reduce the amount of data that is needed to train the different learning approaches and better tune the agent to a specific context to increase accuracy and confidence.
- an output layer of each deep decision network is a softmax layer where the number of output nodes is the number of available actions. Additionally or alternatively, an action space and/or available actions can be determined in any other suitable way(s).
- a multi-lane highway context produces a corresponding action space including: maintaining speed, lane change left, and lane change right.
- a different context such as a residential road produces actions such as those in the highway context and additional actions such as stop, yield, creep, left turn, and right turn.
- an output layer e.g., linear output layer
- an embedding e.g., a vector, a vector of real numbers, etc.
- the embedding could be matched to stored embeddings associated with particular actions (e.g., at a lookup table).
- a length and/or angle of an embedding vector produced by an output layer can be used to match it to a vector associated with a particular action.
- Selecting an action in S 250 can be performed by the context-aware learning agent, performed with another model and/or algorithm and/or process, determined based on other information (e.g., any or all of the set of inputs from S 210 , based on the particular route, based on a next context in the map, etc.), and/or otherwise determined.
- other information e.g., any or all of the set of inputs from S 210 , based on the particular route, based on a next context in the map, etc.
- the action is produced as an output (e.g., single output, multiple outputs, etc.) of the context-aware learning agent.
- the action can be determined based on a state machine or other rule-based method for choosing an action based on context.
- the context of the agent is determined from a map to be a one-lane residential road (e.g., in which the agent cannot change contexts due to road geometry as shown in FIG. 4 ).
- a set of actions determined for this context can include, for instance: maintaining speed, creeping, left turning, right turning, and yielding.
- creep distance refers to the distance the agent should creep forward with extra caution (e.g., before deciding to merge).
- humans tend to creep at a stop sign or before merging on a highway to cautiously gauge any oncoming traffic and pace the speed of the vehicle to merge without collisions or annoyance to road users.
- the value of this parameter is different.
- the creep distance is 2 meters
- the creep distance is 17 meters.
- the context of the agent is determined to be a multi-lane highway in which the agent can learn (e.g., in the learning module) it is less likely to see pedestrians.
- the actions of the action space can include, for instance: lane swap left, lane swap right, maintain speed, and stop.
- the method 200 can include planning a trajectory based on the action S 260 , which functions to enable the agent to perform the selected action (e.g., as described above).
- the trajectory preferably specifies the set of locations and associated speeds for the agent to be at in order to perform the selected action.
- the trajectory is preferably generated based on one of a set of trajectory learning modules (e.g., different than the learning module described above, separate from but including the same and/or a similar architecture as the learning modules described above, etc.), but can additionally or alternatively be generated with any other suitable tools, programmed or learned.
- S 260 can optionally additionally or alternatively include any or all of: validating the trajectory, implementing a fallback mechanism, operating the vehicle according to a trajectory, determining control commands with which to operate the vehicle based on a trajectory, and/or any other suitable output.
- the method includes: receiving a set of inputs S 210 , the set of inputs including at least map (e.g., high definition hand-labeled map, map labeled in an automated fashion, map labeled both manually and in an automated fashion, etc.), a vehicle location (e.g., pose), and optionally a route planned for the vehicle; determining a context prescribed by the map based on the location of the agent; selecting a learning module including a neural network based on the context; defining an action space including a set of actions available to the agent with the learning module; and selecting an action from the action space (e.g., with the learning module). Additionally or alternatively, the method 200 can include any other suitable processes (e.g., determining a trajectory based on the action).
- map e.g., high definition hand-labeled map, map labeled in an automated fashion, map labeled both manually and in an automated fashion, etc.
- a vehicle location e.g., pose
- a route planned for the vehicle
- the method 200 includes: receiving a set of inputs, wherein the set of inputs includes a hand labeled, high definition map prescribing a set of contexts, further preferably a series of contexts, for at least a fixed route of the autonomous agent, wherein the set of inputs further includes sensor information from a set of sensors onboard the autonomous agent, wherein the sensor information includes at least a pose of the autonomous agent, wherein the pose and optionally the route are used to select a context for the agent based on the map, and optionally any other suitable inputs; selecting a context based on a location and/or orientation of the vehicle (e.g., pose), the labeled map, and optionally any or all of the other information received, wherein the context informs how the remaining processes of the method are performed; selecting a context-aware learning agent based on the context, wherein a single context-aware learning agent is assigned to each context and trained (e.g., with an inverse reinforcement learning model); defining an action space and selecting
- the set of inputs includes receiving a route as shown in FIG. 7A ; determining a first context shown in FIG. 7B based on the route and the map, wherein from the start of the trip, the vehicle is located in a parking lot type context with a single lane of traffic and expected heavy foot traffic.
- This foot traffic is usually localized to sidewalk however in this context pedestrian cutoff events are highly probable.
- the learning agent is optimized to be significantly more aware of pedestrians and their actions and as such the reward function is tuned to achieve this.
- the available actions to this agent are: maintaining a predetermined speed (e.g., speed bounded by any leading dynamic objects in agent's path and the speed limit of the current road); yielding behavior relative to a set of dynamic objects (e.g., which may have a precedence over the agent); staying stopped in the current location; and providing a stopping location which the ego vehicle must stop by.
- a predetermined speed e.g., speed bounded by any leading dynamic objects in agent's path and the speed limit of the current road
- yielding behavior relative to a set of dynamic objects e.g., which may have a precedence over the agent
- staying stopped in the current location and providing a stopping location which the ego vehicle must stop by.
- the second context that the vehicle encounters along this route shown in FIG. 7C , is the context of a single lane residential road. This context is quite large including handling single lane traffic lights and stop sign intersection, pedestrian's crossing and right-hand turns.
- the decisions required to handle this context are very similar to that of the first context, the main difference being what the algorithm is trained on and that the reward function is tuned for two different sets of behaviors.
- the final context encountered by the agent shown in FIG. 7D , is a multi-lane residential context, which includes everything that the single lane context needs to handle but in multiple lanes. Thus it needs to understand how to switch lanes, and how to handle turning on multi lane intersection. To handle this, in addition to the actions from the second context, it also contains the following two actions in its action space: changing to the left lane (when safe to do so) and changing to the right lane (when safe to do so).
- the method includes: receiving a set of inputs S 210 , the set of inputs including at least map (e.g., high definition hand-labeled map, map labeled in an automated fashion, map labeled both manually and in an automated fashion, etc.), a vehicle location (e.g., pose), and optionally a route planned for the vehicle; determining a context for the agent with a context identifier module based on the inputs; selecting a learning module including a neural network based on the context; defining an action space including a set of actions available to the agent with the learning module; and selecting an action from the action space (e.g., with the learning module). Additionally or alternatively, the method 200 can include any other suitable processes (e.g., determining a trajectory based on the action).
- map e.g., high definition hand-labeled map, map labeled in an automated fashion, map labeled both manually and in an automated fashion, etc.
- a vehicle location e.g., pose
- the method 200 can include any other suitable processes and/or be performed in any suitable way(s).
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Automation & Control Theory (AREA)
- Medical Informatics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Mathematical Physics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Analytical Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Mechanical Engineering (AREA)
- Transportation (AREA)
- Human Computer Interaction (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Algebra (AREA)
- Probability & Statistics with Applications (AREA)
Abstract
A system for context-aware decision making of an autonomous agent includes a computing system having a context selector and a map. A method for context-aware decision making of an autonomous agent includes receiving a set of inputs, determining a context associated with an autonomous agent based on the set of inputs, and optionally any or all of: labeling a map; selecting a learning module (context-specific learning module) based on the context; defining an action space based on the learning module; selecting an action from the action space; planning a trajectory based on the action S260; and/or any other suitable processes.
Description
- This application is a continuation of U.S. application Ser. No. 17/846,870, filed 22 Jun. 2022, which is a continuation of U.S. application Ser. No. 17/584,062, filed 25 Jan. 2022, which is a continuation of U.S. application Ser. No. 17/332,839, filed 27 May 2021, which is a continuation of U.S. application Ser. No. 17/306,014, filed 3 May 2021, which is a continuation of U.S. application Ser. No. 17/116,810, filed 9 Dec. 2020, which claims the benefit of U.S. Provisional Application Ser. No. 63/035,401, filed 5 Jun. 2020, and U.S. Provisional Application Ser. No. 63/055,756, filed 23 Jul. 2020, each of which is incorporated in its entirety by this reference.
- This invention relates generally to the autonomous vehicle field, and more specifically to a new and useful system and method for context-aware decision making of an autonomous agent in the autonomous vehicle field.
- In autonomous and semi-autonomous control of automotive vehicles, conventional systems and methods for decision making can be classified as one of two approaches: classical approaches relying on traditional programming principles and machine learning based approaches. However, each of these approaches comes with its limitations. Further, a large percentage of current autonomous vehicle systems and methods attempt to drive in various different environments, which makes either the classical approaches extremely involved (and most likely impossible) or the machine learning based approaches lacking explainability (and therefore causing safety concerns).
- Thus, there is a need in the autonomous vehicle field to create an improved and useful system and method for decision making.
-
FIG. 1 is a schematic of an autonomous vehicle system for context selection. -
FIG. 2 is a schematic of a method for autonomous vehicle context selection. -
FIG. 3 depicts a variation of an integration of the system for context selection (context selector) within an autonomous vehicle architecture. -
FIG. 4 depicts a variation of a set of contexts. -
FIG. 5 depicts a variation of a vehicle executing a creep behavior within a particular context. -
FIG. 6 depicts a schematic of a variation of context switching for fixed routes in comparison to the route options in a geo-fenced operational design domain (ODD). -
FIGS. 7A-7D depict an example of context selection along a route. -
FIG. 8 depicts a schematic variation of an overall system of the autonomous agent. -
FIG. 9 depicts a schematic variation of context-aware decision making and trajectory planning. -
FIGS. 10A-10B depict a variation of a use case of an autonomous vehicle in fixed-route deliveries and a schematic of fixed routes driven by the vehicles. -
FIG. 11 depicts a variation of an integration of a context selector in a trajectory generation architecture. -
FIG. 12 depicts a variation of a context selector in which a context is selected based on a map and a location and/or orientation of the vehicle. -
FIG. 13 depicts a variation of a context identifier module which determines a context for the vehicle based on a set of inputs and one or more models and/or algorithms. -
FIG. 14 depicts a variation of a map indicating a sequential series of contexts for each of a set of example routes. -
FIG. 15 depicts a variation of map indicating an example set of context region assignments. -
FIG. 16 depicts a variation of a context selector and set of learned deep decision networks. - The following description of the preferred embodiments of the invention is not intended to limit the invention to these preferred embodiments, but rather to enable any person skilled in the art to make and use this invention.
- As shown in
FIG. 1 , asystem 100 for context-aware decision making of an autonomous agent includes a computing system having a context selector and a map. Additionally or alternatively, the system can include and/or interface with any or all of: an autonomous agent (equivalently referred to herein as an autonomous vehicle and/or an ego vehicle); a vehicle control system; a sensor system; and/or any other suitable components or combination of components. - As shown in
FIG. 2 , amethod 200 for context-aware decision making of an autonomous agent includes receiving a set of inputs S210 and determining a context associated with an autonomous agent based on the set of inputs S220. Additionally or alternatively, themethod 200 can include any or all of: labeling a map S205; selecting a learning module (context-specific learning module) based on the context S230; defining an action space based on the learning module S240; selecting an action from the action space S250; planning a trajectory based on the action S260; and/or any other suitable processes. Themethod 200 can be performed with a system as described above and/or any other suitable system. - The system and method for context-aware decision making of an autonomous agent can confer several benefits over current systems and methods.
- First, in preferred variations, the system and/or method confers the benefit of decision making through a hybrid approach of deep learning and rule-based processes, achieving explainable artificial intelligence (AI).
- This can in turn confer the benefit of achieving a safe and scalable solution for autonomy by any or all of: decomposing learned models into micro-modules with intended functionality restricted to very explainable tasks; building rule-based fallback and validation systems around the micro-modules to guarantee safety, which enables validation of the performance and underlying properties of each of these sub-modules; and/or any other suitable processes.
- Second, in preferred variations, the system and/or method further confers the benefit of reducing the amount of data required to traverse, validate, and/or add a new route by implementing a small, limited-route ODD including a small number of contexts which can be overly defined and/or described, thereby eliminating and/or reducing a number of edge cases encountered by the vehicle. Equivalently, the system and/or method can reduce the need for a large amount of data due to acute confinement of problem while maintaining all the benefits of learning systems (e.g., maintaining an extremely low margin of error in decisions, enabling human-style driving decisions, continued driving progress, etc.). This can enable an optimal selection of learning modules (e.g., deep learning models) and/or training of the learning modules based on low required data and or minimal edge cases; in some examples, for instance, inverse reinforcement learning algorithms (which conventionally require diverse and significant amounts of data to be trained) can be leveraged, which effectively replicate human driving. In specific examples, significantly less data (e.g., 50 times less, 100 times less, 1000 times less, etc.) is required than geofenced approaches to validate a route.
- Third, in some variations (e.g., as shown in
FIGS. 10A-10B ), the system and/or method confers the benefit of enabling supply chain growth in short haul logistics (e.g., B2B trucking) applications, by enabling fixed route autonomous delivery of goods between locations. - Fourth, in preferred variations, the system and/or method further preferably confer the benefit of enabling determination and awareness of a context of an autonomous vehicle, which can confer the subsequent benefits of: achieving smooth switching of contexts (e.g., with minimal validation and testing, based on a predetermined series of contexts specified for particular fixed route, etc.); training models based on a specific context (therefore reducing the training required for each model), such as optimizing for different cost functions and/or parameters in different contexts; selecting safe and appropriate behaviors/actions for the vehicle in light of the context; and/or can confer any other benefit.
- Fifth, in some variations, the system and/or method confers the benefit of overcoming the shortcomings of conventional systems and methods, which can include, for instance, any or all of: an inability to adapt to unexpected situations; overgeneralization, which often leads to conservative behavior; hard-to-tune (e.g., hard to manually tune) parameters; constraint monitoring and adaptation of hyper parameters based on environmental changes; unsolvability even in small ODDs; and/or any other shortcomings of conventional systems and methods.
- Additionally or alternatively, the system and method can confer any other benefit.
- The
system 100 functions to enable context selection and context-aware decision making for a vehicle and includes: a computing system having a context selector and a map. Additionally or alternatively, the system can include and/or interface with any or all of: an autonomous agent; a vehicle control system; a sensor system; and/or any other suitable components or combination of components. - The
system 100 is preferably configured to implement and/or interface with a system which implements a hybrid architecture of decision making (e.g., as shown inFIG. 3 , as shown inFIG. 8 , as shown inFIG. 9 , as shown inFIG. 11 , etc.), the hybrid architecture implementing both classical, rule-based approaches and machine learning approaches, which is enabled by a small ODD, fixed route driving framework. This functions to maintain explainability of the vehicle's decision making while enabling the vehicle to drive with human-like driving behavior on routes validated with minimal training data. - In a first set of variations, as shown in
FIGS. 10-10B , thesystem 100 is implemented in autonomous short-haul (e.g., between 5 and 400 miles) B2B fixed-route applications. In these variations, the autonomous agents preferably receive inventory from sorting centers, but can additionally or alternatively receive inventory for parcel hubs and/or warehouses. The agent then preferably delivers the inventory to and/or between any or all of: sorting centers, micro-fulfillment centers, distribution centers, retail stores, and local delivery centers. Additionally or alternatively, the agents can interface with residences (e.g., customer homes), and/or any other suitable locations/facilities. - The
system 100 includes a computing system, which functions to select a context associated with the vehicle. Additionally or alternatively, the computing system can function to perform any or all of: route planning of the vehicle at a planning module (e.g., generating a trajectory); localization of the vehicle and/or surrounding objects at a localization module; path prediction of the vehicle and/or objects surrounding the vehicle at a prediction module; storage of information; and/or any other suitable functions. - The computing system is preferably designed to offer a centralized and parallel computing model which enables high concurrency of task execution, low latency, and high throughput. The adaptive communication capabilities of the framework allows for high data throughput while the user-level scheduler with resource awareness enables the centralized computing model to perform at the highest level.
- To enable this, the computing system is preferably designed at least partially in a modular format including a set of modular computing components, referred to herein as learning modules (equivalently referred to herein as learning agents or learning models), each associated with predefined inputs and outputs. Each computing component contains a specific algorithm module built to process a set of data inputs and generate a set of outputs. The computing system can optionally include a middleware framework, which extracts dependencies from these components and links them all together (e.g., with a topological ordering process such as a directed acyclic graph, etc.). At runtime, for instance, the framework takes the predefined components described above and combines them with fused data from the sensors to create lightweight user-level tasks. Each task is then scheduled based on resource availability and task priorities and executed as optimized threads.
- Additionally or alternatively, the system and/or computing system can be otherwise configured and/or designed.
- The computing system includes an onboard computing system onboard (e.g., integrated within) the autonomous agent.
- In preferred variations, the autonomous agent includes an autonomous vehicle that is preferably a fully autonomous vehicle and/or able to be operated as a fully autonomous vehicle, but can additionally or alternatively be any semi-autonomous or fully autonomous vehicle, a teleoperated vehicle, and/or any other suitable vehicle. The autonomous vehicle is preferably an automobile (e.g., car, driverless car, bus, shuttle, taxi, ride-share vehicle, truck, semi-truck, etc.). Additionally or alternatively, the autonomous vehicle can include any or all of: a watercraft (e.g., boat, water taxi, etc.), aerial vehicle (e.g., plane, helicopter, drone, etc.), terrestrial vehicle (e.g., 2-wheeled vehicle, bike, motorcycle, scooter, etc.), and/or any other suitable vehicle and/or transportation device, autonomous machine, autonomous device, autonomous robot, and/or any other suitable device.
- The computing system can additionally or alternatively include a remote computing system offboard the autonomous agent, such as a cloud computing system. The remote computing system is preferably in communication with the onboard computing system (e.g., to collect information from the onboard computing system, to provide updated models to the onboard computing system, etc.), but can additionally or alternatively be in communication with any other suitable components.
- The computing system preferably includes active and redundant subsystems, but can additionally or alternatively include any other suitable subsystems.
- The computing system preferably includes a context selector, which functions to select a context associated with the vehicle. The context selector is further preferably part of a planning module of the computing system, which can additionally include any or all of: a set of learning modules (e.g., deep learning models); a trajectory generator; a trajectory validator; and/or any other suitable components. Additionally or alternatively, the context selector can be independent from a planning module, a planning module can include any other suitable components, and/or the computing system can be otherwise configured.
- The computing system further preferably includes a processing system, which functions to process the inputs received at the computing system. The processing system preferably includes a set of central processing units (CPUs) and a set of graphical processing units (GPUs), but can additionally or alternatively include any other components or combination of components (e.g., processors, microprocessors, system-on-a-chip (SoC) components, etc.).
- The computing system can optionally further include any or all of: memory, storage, and/or any other suitable components.
- In addition to the planning module, the computing system can include and/or interface with any or all of: a localization module, prediction module, perception module, and/or any other suitable modules for operation of the autonomous agent.
- The computing system (e.g., onboard computing system) is preferably in communication with (e.g., in wireless communication with, in wired communication with, coupled to, physically coupled to, electrically coupled to, etc.) a vehicle control system, which functions to execute commands determined by the computing system.
- The computing system includes and/or interfaces with a map, which functions to at least partially enable the determination of a context associated with the autonomous agent. The map is preferably a high definition, hand-labeled map as described below, which prescribes the context of the autonomous agent based on its location and/or position within the map, but can additionally or alternatively include any other map (e.g., map labeled in an automated fashion, map labeled through both manual and automated processes, etc.) and/or combination of maps.
- The
system 100 preferably includes and/or interfaces with a sensor system (equivalently referred to herein as a sensor subsystem), which functions to enable any or all of: a localization of the autonomous agent (e.g., within a map), a detection of surrounding objects (e.g., dynamic objects, static objects, etc.) of the autonomous agent, and/or any other suitable function. - The sensor system can include any or all of: cameras (e.g., 360-degree coverage cameras, ultra-high resolution cameras, etc.), light detection and ranging (LiDAR) sensors, radio detection and ranging (RADAR) sensors, motion sensors (e.g., accelerometers, gyroscopes, inertial measurement units [IMUs], speedometers, etc.), location sensors (e.g., Global Navigation Satellite System [GNSS] sensors, Inertial Navigation System [INS] sensors, Global Positioning System [GPS] sensors, any combination, etc.), ultrasonic sensors, and/or any suitable sensors.
- In a set of variations, the sensor system includes: 16-beam LIDARs (e.g., for high fidelity obstacle detection, etc.); short range RADARs (e.g., for blind spot detection, cross traffic alert, emergency braking, etc.); ultrasonic sensors (e.g., for park assist, collision avoidance, etc.); 360-degree coverage cameras (e.g., for surround view for pedestrian/cyclist/urban obstacle detection and avoidance, etc.); 128-beam LIDAR (e.g., for localization of vehicle with high precision); long range ultra-high resolution cameras (e.g., for traffic sign and traffic light detection); long range RADARs (e.g., for long range obstacle tracking and avoidance); GNSS/INS (e.g., for ultra-high precision localization); and/or any other suitable sensors.
- In preferred variations, for instance, the sensor system is configured to enable a determination of a pose (and/or any other location and/or orientation parameter(s)) of the agent, which is used to select a context associated with that pose (e.g., in a 1:1 fashion, with multiple options for context, etc.) based on a labeled map. In specific examples, the sensor system includes a localization subsystem which determines the pose, wherein the localization subsystem includes any combination of GPS, IMU, LIDAR, camera, and/or other sensors mounted on the vehicle to estimate its current position at any given time. The sensor system further preferably enables the determination of information (e.g., location, motion, etc.) of objects and/or features in the environment of the agent, such as, but not limited to: dynamic objects, static objects, road infrastructure, environmental conditions (e.g., rain, snow, lighting conditions, etc.), and/or any other suitable information.
- Additionally or alternatively, the sensor system can include any other suitable sensors configured to collect any suitable sensor information.
- Further additionally or alternatively, the
system 100 can include any other suitable components or combination of components. - As shown in
FIG. 2 , themethod 200 includes receiving a set of inputs S210 and determining a context associated with an autonomous agent based on the set of inputs S220. Additionally or alternatively, themethod 200 can include any or all of: labeling a map S205; selecting a learning module (context-specific learning module) based on the context S230; defining an action space based on the learning module S240; selecting an action from the action space S250; planning a trajectory based on the action S260; and/or any other suitable processes. - The method preferably implements and/or interfaces with a system implementing hybrid decision making as described above, but can additionally or alternatively implement and/or interface with methods implementing any other suitable decision making.
- The
method 200 functions to determining a context associated with an autonomous agent and thereby enable context-aware decision making of the autonomous agent. Additionally or alternatively, themethod 200 can function to enable selection of an action to be performed by the autonomous agent, the generation of a trajectory to be traveled by the autonomous agent, and/or perform any other suitable function(s). - The
method 200 is preferably performed with asystem 100 as described above, but can additionally or alternatively be performed with any suitable system. - The
method 200 is preferably performed throughout the duration of the route being traveled by the autonomous agent and based on a map (e.g., continuously checking for a context change), but can additionally or alternatively be performed at any or all of: a predetermined frequency (e.g., constant frequency), in response to a trigger, at a set of intervals (e.g., random intervals), once, and/or at any other suitable times. - The
method 200 can optionally include labeling a map S205, which functions to specify the context for the vehicle at any location of the autonomous agent along a route (e.g., predetermined route). Additionally or alternatively, labeling the map can function to specify the location of one or more static objects along a route of the agent; a location and/or other information of lane boundaries and/or other road features (e.g., information about current lane, lane boundaries, lane lines, etc.); the locations of starting points and vehicle destinations; a set of parameters (e.g., speed limit, target vehicle speed, dimensions, etc.) associated with locations of the vehicle; and/or any other suitable information. - The map is preferably a high definition map but can additionally or alternatively be or include any other suitable maps. The map is further preferably hand labeled with any or all of the information described above, which can be enabled, for instance, by the small ODD and fixed route approach to preferred variations of the system and method. This functions to produce a highly accurate, safe, and dependable map with which to transition between vehicle contexts. In variations of conventional systems and methods implementing geofenced ODDs (e.g., as shown in
FIG. 6 ), for instance, the number of potential routes available to the vehicle is significant, thereby causing: more required switches between contexts (e.g., increasing the chance of incorrectly selecting a context); increased requirements for storing and switching between a large number of fully learned models (e.g., resulting in a computationally crippling problem); and/or any other effects. - The contexts are preferably assigned to one or more particular regions in the map (e.g., hard-coded into the map, soft-coded into the map, etc.), such that a particular context relevant to the agent (e.g., context in which agent is located, context in which the agent is about to be located, context that agent has departed, etc.) can be determined (e.g., with one or more inputs received in S210 such as pose information of the autonomous agent) in S220.
- The contexts are preferably assigned to locations and/or regions within the map. Each location and/or region in the map can be assigned any or all of: a single context; multiple contexts (e.g., indicating an intersection of multiple routes, wherein a single context is selected based on additional information such as any or all of the inputs received in S210, etc.); no context (e.g., indicating a location and/or region not on a fixed route option for the autonomous agent); and/or any combination of contexts. The particular context(s) assigned to the location and/or region are preferably determined based on the static environment at that location and/or within that region, such as any or all of: features of the roadway within that region (e.g., number of lanes, highway vs. residential road, one-way vs. two-way, dirt and/or gravel vs. asphalt, curvature, shoulder vs. no shoulder, etc.); landmarks and/or features within that region (e.g., parking lot, roundabout, etc.); a type of zone associated with that location and/or region (e.g., school zone, construction zone, hospital zone, residential zone, etc.); a type of dynamic objects encountered at the location and/or region (e.g., pedestrians, bicycles, vehicles, animals, etc.); traffic parameters associated with that location and/or region (e.g., speed limit, traffic sign types, height limits for semi trucks, etc.); and/or any other environmental information.
- In a first set of variations, the map is a high-definition map with hard-coded contexts. In specific example, the map is a geo-location file with semantic annotations of context for variations points and/or areas.
- Additionally or alternatively, the time of day at which a route is being taken and/or one or more dynamic (e.g., temporal) features can be taken into account, such as any or all of: traffic patterns (e.g., at the time that a fixed route is scheduled to and/or most likely to take place); weather conditions; lighting conditions; time-specific zone information (e.g., times at which school zone restrictions are enforced); and/or any other suitable information.
- In some variations, dynamic objects (e.g., surrounding vehicles, pedestrians, animals, moving objects, etc.) and/or non-permanent objects or environments (e.g., construction sites) are preferably accounted for with a sensor system (rather than the map), wherein in an event that the context is not recognizable and/or otherwise affected based on the dynamic object (as calculated in an uncertainty estimate), a fallback motion planner can be triggered. Additionally or alternatively, maps can be dynamically and/or iteratively produced to account for any or all of the features and/or objects.
- Additionally or alternatively, the context assignments can be determined based on route information (e.g., fixed route information), such as a set of fixed routes prescribed for the autonomous agent to take. The route information can include any or all of: a starting location of the route, a destination of the route, a directionality of the autonomous agent along the route, and/or any other information. In variations involving fixed routes, for instance, the contexts assigned to the map are preferably selected based on the vehicle's progression along the route and the contexts that the vehicle would sequentially encounter in doing so. In specific examples, for instance, an intersection at which the agent is planned to pass straight through in a first fixed route may be assigned a different context than the same intersection at which the agent is planned to turn right at in a second fixed route.
- Additionally or alternatively, any or all of the contexts can be determined independently of a route and/or a fixed route.
- Additionally or alternatively, one or more contexts identified in S220 can be determined based on a map without assigned and/or prescribed contexts, such as in variations in which the context is identified with one or more programmed processes and/or machine learning processes (e.g., as described in S220) based on a set of inputs (e.g., those described in S210). In some variations, for instance, a context is identified with a context identifier module (e.g., as part of the agent's planning module, as part of the computing system, etc.), such as that shown in
FIG. 13 , wherein the context identifier module can receive inputs from the map such as road infrastructure information (e.g., location of, size of, parameters associated with, etc.) and/or static environment features, which can individually and/or collectively include, but is not limited to, any or all of: road signs, lane lines, buildings, railroad tracks, bus routes, and/or other infrastructure information. - The map can optionally include (e.g., assign, prescribe, etc.) one or more transition zones which are arranged between different contexts, and can indicate, for instance, a change in context (e.g., along a fixed route, along a dynamically determined route, etc.), thereby enabling a switching of contexts to occur smoothly (e.g., by defining an action space). Assigning transition zones can function, for instance, to define an action space subsequently in the method which smoothly transitions the vehicle from one context to the next (e.g., preventing the availability of certain actions, prescribing that the agent maintain his or her lane, preventing a turn, etc.). The transition zones can be any or all of: overlapping with (e.g., partially overlapping with, fully overlapping with, etc.) one or more contexts; non-overlapping with one or more contexts; and/or any combination of overlapping and non-overlapping. Additionally or alternatively, the transition zones can be contexts themselves; the method can be performed in absence of labeled transition zones (e.g., by anticipating the subsequent context); and/or be otherwise performed.
- In a first variation, S205 includes hand labeling a high definition map to prescribe a set of contexts, further preferably a series of contexts, for at least a set of fixed routes available to an autonomous agent. S205 further preferably includes hand labeling a set of static objects and/or road features associated with the routes. In specific examples, each region in the map is assigned a single context (e.g., for all routes, for a specific route, etc.). Additionally or alternatively, each region in the map can be assigned multiple contexts (e.g., for overlapping routes, to be selected from, etc.).
- In a second variation, S205 includes labeling a map without prescribed contexts, wherein the context is determined subsequently in the method 200 (e.g., with one or more machine learning models). In specific examples, the map is labeled with road features and/or landmarks, which are subsequently used (e.g., with sensor information) to determine a context.
- Additionally or alternatively, S205 can be performed in another suitable way or the
method 200 can be performed in absence of S205. - The
method 200 includes receiving a set of inputs S210, which functions to receive information with which to determine a context for the autonomous agent. Additionally or alternatively, S210 can function to receive information with which to perform any suitable processes of the method (e.g., determine an action and/or behavior, determine a trajectory of the agent, etc.). Additionally or alternatively, S210 can include determining (e.g., calculating) any or all of the set of inputs, combining inputs (e.g., in one or more sensor fusion processes), processing and/or preprocessing the set of inputs, and/or any other suitable processes. - S210 is preferably performed throughout the
method 200, such as any or all of: continuously, at a predetermined frequency, at random intervals, prior to each of a set of processes of themethod 200, and/or at any other suitable times. S210 can additionally or alternatively be performed in response to a trigger (e.g., based on the map, based on sensor information, etc.), at random intervals, and/or at any other suitable time(s) during themethod 200. - The set of inputs received in S210 preferably includes sensor information collected at a sensor subsystem of the autonomous agent, such as any or all of: a sensor system onboard the autonomous agent, a sensor system remote from the autonomous agent, and/or a sensor system in communication with the autonomous agent and/or a computing system (e.g., onboard computing system, remote computing system, etc.) of the autonomous agent. Additionally or alternatively, the sensor information can be collected from any other suitable sensor(s) and/or combination of sensors, S210 can be performed in absence of collecting sensor inputs, and/or S210 can be performed in any other suitable way(s).
- The sensor information preferably includes location information associated with the autonomous agent, such as any or all of: position, orientation (e.g., heading angle), pose, geographical location (e.g., using global positioning system [GPS] coordinates, using other coordinates, etc.), location within a map, and/or any other suitable location information. In preferred variations, for instance, S210 includes receiving pose information from a localization module of the sensor subsystem, wherein the localization module includes any or all of: GPS sensors, IMUs, LIDAR sensors, cameras, and/or any other sensors (e.g., as described above). Additionally or alternatively, any other sensor information can be received from any suitable sensors.
- The sensor information can additionally or alternatively include motion information and/or other dynamic information associated with the autonomous agent, such as, but not limited to, any or all of: velocity/speed, acceleration, and/or any other suitable information.
- The sensor information further preferably includes location information and/or motion information associated with one or more dynamic objects in an environment of the autonomous agent, such as any or all of the location information described above, location information relative to the autonomous agent, motion information of the dynamic objects, predicted information (e.g., predicted trajectory), historical information (e.g., historical trajectory), and/or any other suitable information. The dynamic objects can include, but are not limited to, any or all of: other vehicles (e.g., autonomous vehicles, non-autonomous vehicles, 4-wheeled vehicles, 2-wheeled vehicles such as bicycles, etc.), pedestrians (e.g., walking, running, rollerblading, skateboarding, etc.), animals, and/or any other moving objects (e.g., ball rolling across street, rolling shopping cart, etc.). Additionally or alternatively, the sensor information can include any other information associated with one or more dynamic objects, such as the size of the dynamic objects, an identification of the type of object, other suitable information, and/or the information collected in S210 can be collected in absence of dynamic object information.
- The sensor information can optionally include location information and/or other information associated with one or more static objects (e.g., stationary pedestrians, road infrastructure, construction site and/or construction equipment, barricade(s), traffic cone(s), parked vehicles, etc.) in an environment of the autonomous agent, such as any or all of the information described above (e.g., identification of object type, etc.). Additionally or alternatively, the sensor information can include any other information associated with one or more static objects and/or the information collected in S210 can be collected in absence of static object information.
- The set of inputs received in S210 further preferably includes the map and/or any information determined from (e.g., determined based on, derived from, included in, etc.) the map, such as any or all of the information described above in S205. In some variations, this includes one or more contexts (and/or transition zones) selected based on (e.g., predetermined/assigned to) a region/location of the autonomous agent (e.g., as determined based on sensor information as described above). In additional or alternative variations, the map information includes any or all of: road infrastructure information and/or other static environment information, route information, and/or any other suitable information.
- Information associated with the map can optionally be determined based on other information received in S210, such as any or all of the sensor information received at one or more sensor systems. For instance, location information (e.g., current pose, current position, current geographical location, etc.) associated with the autonomous agent can be used to locate the agent within the map (e.g., determine its position within a hand labeled map), which is used to determine the map information relevant to the autonomous agent. In a first set of variations, this information is a particular context selected (e.g., assigned to) based on the location of (e.g., a region including the location, a stretch of road on which the agent is located, a particular intersection in which the agent is located, etc.) the autonomous agent. In a second set of variations, the location of the autonomous agent in comparison with the map includes information associated with the road infrastructure (e.g., road signs, lane lines, buildings, etc.), which is used, preferably along with other inputs (e.g., static environment features, static object information, autonomous agent vehicle state, dynamic environment features, etc.) to determine a context (e.g., using one or more learning-based models, using a pattern recognition and/or classification model, etc.) for the autonomous agent.
- Additionally or alternatively, the map information can include any other information (e.g., a set of possible contexts, a set of parameters and/or weights for an algorithm and/or model, etc.) and/or be received in any other way, such as, but not limited to, any or all of: independently of other information received in S210, concurrently with other information received in S210, prior to other information received in S210, subsequent to other information received in S210, multiple times in S210, and/or at any other suitable time(s).
- The set of inputs can optionally include a route and/or associated route information (e.g., route identifier of a fixed route, agent's progression through route, etc.) assigned to and/or being traversed by the agent (e.g., fixed route selected for the agent, dynamic route being traveled by the agent, predicted route for the agent, etc.), which can function for instance, to select information from one or more maps and/or to select a particular map (e.g., a route-specific map). The map information selected based on route can include, but is not limited to, any or all of: a current context, a future context (e.g., next context in a fixed route), a transition zone, and/or any other suitable information from a map. In variations in which a context is selected based on a map, route information can be used to select the appropriate context for the particular route in regions in which multiple routes are overlapping (e.g., at an intersection which multiple routes pass through). The route information (e.g., previous contexts of route, historical information, fixed route identifier, destination, starting point, directionality of route, etc.) can be used, for instance, to select the proper context from multiple context options. Additionally or alternatively, the route information can be used in other processes of the method 200 (e.g., in defining an action space based on the context, in selecting a behavior from the action space, in determining the agent's trajectory, etc.), any other suitable information can be determined based on route information, the set of inputs can be collected independently of and/or in absence of route information, and/or S210 can be performed in any other suitable ways.
- The set of inputs S210 can additionally or alternatively include any other suitable information, such as, but not limited to, any or all of: a state (e.g., operational state, driving state, etc.) of the autonomous agent, a trajectory of the agent, a set of control commands for the agent, historical information associated with the agent and/or an environment of the agent, predicted information associated with the agent and/or the environment (e.g., predicted trajectories of dynamic objects), and/or any other suitable information and/or inputs.
- In a first set of variations (e.g., as shown in
FIG. 12 ), S210 includes receiving a map specifying a set of assigned contexts for an agent; optionally a route (e.g., fixed route) of the agent; and sensor information from a set of sensors onboard the autonomous agent, wherein the sensor information includes at least a pose of the autonomous agent, wherein the pose and optionally the route are used to select a context for the agent based on the map. Additionally or alternatively, S210 can include receiving any other suitable inputs. - In a set of specific examples, S210 includes receiving a selected route for the vehicle, a high definition hand-labeled map specifying a context for each portion of the route (and optionally other routes), and sensor information including at least location information associated with the vehicle and optionally any or all of: motion information, object information (e.g., dynamic object information, static object information, etc.), and/or any other suitable information.
- In a second set of variations, S210 includes receiving a map including information associated with road infrastructure, such as the road infrastructure along one or more routes of the agent (e.g., a fixed route of the agent) and optionally any or all of: other map information (e.g., speed limit information, traffic laws, etc.); a state of the agent, equivalently referred to herein as an ego state (e.g., as determined by a sensor subsystem; pose, velocity, and acceleration; etc.); static environment features and/or information; dynamic environment features and/or information; sensor information; and/or any other suitable information, wherein the context is determined with one or more models and/or algorithms (e.g., convolutional neural networks [CNNs], recurrent neural networks [RNNs], support-vector machines [SVMs], etc.).
- In a set of specific examples, a context identifier module, which includes one or more deep learning models, receives as input a map and optionally route information for the agent, a state of the agent, static environment features, and dynamic environment features, with which the context identifier module determines a context for the agent.
- In a third set of variations, S210 includes receiving a set of multiple possible contexts for the agent based on a map, wherein a context of the set of multiple contexts is determined based on other inputs received in S210.
- Additionally or alternatively, S210 can include any other suitable processes.
- 4.3 Method—Determining a Context Associated with the Autonomous Agent Based on the Set of Inputs S220
- The
method 200 includes determining a context associated with the autonomous agent S220, which functions to specify the context in which the autonomous agent is operating, and can further function to: select a learning module based on the context (e.g., according to a 1:1 mapping), define and/or a limit a set of behaviors or actions available to the agent, specify particular parameters (e.g., creep distance) associated with the behaviors and/or actions, reduce and/or minimize the amount of data required to training the learning modules, and/or perform any other suitable function(s). - S220 is preferably performed in response to (e.g., after, based on, etc.) S210, but can additionally or alternatively be performed as part of S210 and/or concurrently with S210, in place of S210, in absence of S210, multiple times throughout the method, and/or at any other time(s) during the
method 200. Further additionally or alternatively, themethod 200 can be performed in absence of S220. - A context refers to a high level driving environment of the agent, which can inform and restrict the vehicle's decision at any given time and/or range of times. The context can include and/or define and/or be determined based on any or all of: a region type of the vehicle (e.g., residential, non-residential, highway, school, commercial, parking lot, etc.); a lane feature and/or other infrastructure feature of the road the vehicle is traversing (e.g., number of lanes, one-way road, two-way road, intersection, two-way stop and/or intersection, three-way stop and/or intersection, four-way stop and/or intersection, lanes in a roundabout, etc.); a proximity to one or more static objects and/or environmental features (e.g., particular building, body of water, railroad track, parking lot, shoulder, region in which the agent can pull over/pull off to the side of a road, etc.); a proximity a parameter associated with the location (e.g., speed limit, speed limit above a predetermined threshold, speed limit below a predetermined threshold, etc.); road markings (e.g., yellow lane, white lane, dotted lane line, solid lane line, etc.); and/or any other suitable information.
- Examples of contexts can include, but are not limited to, any or all of: a two-way, two-lane residential road (e.g., in which the agent cannot change contexts due to road geometry as shown in
FIG. 4 ); a two-way, two-lane non-residential road; a multi-lane highway (e.g., in which the agent can learn it is less likely to see pedestrians); a one-way, single-lane road; a one-way, two-lane road; a one-way road with “n” number (e.g., 1, 2, 3, 4, 5, 6, greater than 6, etc.) of lanes; a two-way road with “n” number (e.g., 1, 2, 3, 4, 5, 6, greater than 6, etc.) of lanes; a single lane road in a parking lot; a single lane road with a yellow boundary on the side; a multi-lane fast moving road (e.g., having a speed above a predetermined threshold); an on ramp of a highway; an off-ramp of a highway; regions connecting to roads (e.g., parking lot, driveway, etc.); and/or any other suitable contexts. - S220 is preferably performed based on a set of inputs received in S210, but can additionally or alternatively be performed based on any other suitable information.
- S220 is preferably performed by reading a map (e.g., as described above) to determine the context assigned to a point and/or area corresponding to the location of the agent (e.g., pose) and/or a route of the agent. As described previously, each point and/or region can be any or all of: associated with at most 1 context (in a 1:1 mapping), associated with at most 1 context per route (e.g., wherein a fixed route assignment of the agent is used to select the proper context), associated with multiple contexts (e.g., which are ranked, prioritized, selected from based on other inputs received in S210), and/or otherwise associated. In specific examples, the context assignments are hard-coded into the map. Alternatively, the context assignments can be soft-coded and/or otherwise assigned.
- Additionally or alternatively, S220 can be performed with any number of algorithms, models (e.g., machine learning models, deep learning models, supervised learning models, unsupervised learning models, semi-supervised learning models, statistical models, pattern recognition models, etc.), finite state machines (FSMs), processes (e.g., traditionally programmed process), decision trees, and/or equations.
- In some variations, for instance, one or more machine learning models, such as, but not limited to: one or more neural networks (e.g., CNNs, RNNs, etc.); SVMs; and/or any other suitable models, are implemented to determine a context (e.g., as a context identifier module of
FIG. 13 ) of the agent based on any or all of the inputs in S210. - Determining a context can optionally include switching between contexts, which preferably includes determining a transition between a current context and a future context. The transition can be in the form of any or all of: a transition zone (e.g., as described above) prescribed in the map; a change in contexts; a transition action/behavior (e.g., lane changing, merging, exiting a highway, etc.); a transition trajectory (e.g., trajectory taken by the vehicle to change from a 1st lane into a 2nd lane, etc.); and/or any can prescribe any other motion for the vehicle. The transition is preferably determined (e.g., prescribed) based on the map (e.g., as indicated as a transition zone, as indicated through distinct adjacent contexts in the map, as indicated through distinct sequential contexts in a fixed route, etc.), but can additionally or alternatively be dynamically determined, determined with a trajectory planner, determined based on sensor information, and/or otherwise determined.
- S220 can optionally include selecting a scenario based on the context, which functions to further specify the context, such as based on any or all of the information described above (e.g., speed limit, sensor information of objects surrounding vehicle, etc.). Examples of scenarios for a first context (e.g., a two-way residential road) include, but are not limited to, any or all of: a right turn opportunity; an addition of a right turn lane; a stop sign; a traffic light; a yield sign; a crosswalk; a speed bump; and/or any other scenarios. Examples of scenarios for a second context (e.g., a multi-lane highway) include, but are not limited to, any or all of: lane changing; merging; overtaking a slow moving vehicle; and/or any other scenarios. In some variations, for instance, the context triggers the selection of a model and/or algorithm (e.g., a highly-tuned, context-aware custom inverse reinforcement learning (IRL) algorithm), which makes high-level scenario selection and calls a scenario-specific learning module (e.g., as described below) to select an action of the vehicle. Additionally or alternatively, any other suitable algorithms or processes for selecting a scenario can be implemented, an action can be selected in absence of a scenario, a context can be used to select another parameter, and/or S220 can be otherwise performed.
- In a first set of variations, S220 includes selecting a context based on a location and/or orientation of the vehicle (e.g., pose), a labeled map, and optionally any or all of the other information received in S210, wherein the context informs how the remaining processes of the method are performed. Optionally, the context then triggers the selection of a particular scenario (e.g., based on a context-specific IRL algorithm).
- In a set of specific examples (e.g., as shown in
FIG. 14 ), a labeled map indicates a sequential series of contexts for each of set of routes (e.g., C1 to C2 to C3 forRoute 1; C1′ to C2′ to C3′ to C4′ to C5′ forRoute 2; etc.), wherein in an event that the routes overlap in a particular section and/or point (e.g., intersection), a route assignment (e.g.,Route 1 vs. Route 2) and/or directionality of the agent (e.g., West vs. East) can be used to select the proper context. The map can optionally further include transition zones (e.g., having a non-zero size, having a size of zero and indicating an immediate transition, etc.) between adjacent regions of different context(s) and/or any other information. - In an additional or alternative set of specific examples (e.g., as shown in
FIG. 15 ), a labeled map assigns a context to each of a set of regions of a labeled map, wherein a location of the agent within the region (and optionally a route assignment and/or directionality of the agent) determines the context for the agent. The map can optionally further include transition zones (e.g., having a non-zero size, having a size of zero and indicating an immediate transition, etc.) between adjacent regions of different context(s) and/or any other information. - In a second set of variations, S220 includes identifying a context associated with the agent with a context identifier module, the context identifier module including one or more trained models (e.g., machine learning model(s), deep learning model(s), etc.), which receives as input any or all of the information received in S210.
- Additionally or alternatively, S220 can include any other suitable processes and/or be performed in any other suitable ways.
- 4.4 Method—Selecting a Learning Module Based on the Context and/or Scenario S230
- The
method 200 can include selecting a learning module based on the context and/or scenario S230, which functions to enable an action (equivalently referred to as a behavior) to be determined for the agent which takes into account the particular context (and optionally scenario) of the vehicle. S230 can additionally or alternatively function to define an action space available to the agent, inform a trajectory of the agent as determined by a trajectory planner, eliminate one or more actions from consideration by the agent (e.g., minimize a number of available actions to an agent), and/or can perform any other suitable functions. - S230 is preferably performed in response to (e.g., after, based on, etc.) S220, but can additionally or alternatively be performed as part of S220 and/or concurrently with S220, in place of S220, in absence of S220, in response to S210, multiple times throughout the method, and/or at any other time(s) during the
method 200. Further additionally or alternatively, themethod 200 can be performed in absence of S230. - S230 preferably includes selecting a learning module (equivalently referred to herein as a context-aware learning agent or a deep decision network) which includes a set of machine learning (e.g., deep learning) models and/or algorithms, wherein the learning module is trained based on data associated with that particular context. This functions to divide a large amount of data from all possible contexts into a set of manageable amounts, which cover all or nearly all of the situations the agent would encounter in that context.
- Each context is preferably mapped in a 1:1 fashion to a learning module. Additionally or alternatively, a context can be associated with multiple learning modules (e.g., where results from multiple modules are aggregated, where a single learning module is then selected, etc.); a learning module can be associated with multiple contexts; and/or the contexts and learning modules can be otherwise mapped.
- S230 can optionally include receiving a set of inputs, which can include any or all of the set of inputs described above, a different and/or additional set of inputs, and/or any other suitable inputs. In a set of variations, for instance, S230 includes receiving any or all of: the inputs described above; the context and/or scenario of the agent; the set of vehicles and/or other dynamic objects surrounding the vehicle, the predicted paths (e.g., where will it be in lane and in which lane, etc.) of the dynamic objects, static objects surrounding the agent; uncertainty values (e.g., of the predicted paths); routing information associated with the agent; and/or any other suitable inputs.
- In some variations (e.g., as shown in
FIG. 16 ), an environmental representation of the agent is received with the context at a deep decision network selected based on the context. In an example of this shown inFIG. 16 , the environmental representation (referred to as a full environmental representation) includes a latent space representation of a set of inputs (e.g., as described in S210, additional or alternative to those described in S210, etc.), the set of inputs including any or all of: a state of the agent (equivalently referred to herein as an ego vehicle state), one or more maps, routing information (e.g., a selected fixed route, parameters associated with a selected fixed route, etc.), dynamic object information/features, static object information/features, and/or any other suitable information. One or more models (e.g., machine learning models, deep learning models, RNNs, etc.) and/or processes and/or algorithms can optionally be used to process any or all of these inputs (e.g., to determine a latent space representation, to determine another output, to simplify the input(s) to the deep decision network, etc.). In specific examples (e.g., as shown inFIG. 16 ), for instance, a first neural network (e.g., one or more RNNs, one or more CNNs, a combination of RNNs and CNNs, Order Independent Representation inFIG. 16 , etc.) is used process object features (e.g., dynamic, static, both, etc.) in the environment of the agent, which can function, for instance to produce an intermediate representation (e.g., abstraction) of object features. In specific examples, the first neural network is a CNN including a set of one or more convolution and/or pooling layers. Additionally or alternatively, any other suitable models can be used. This representation can optionally be combined with (e.g., appended with in a vector and/or matrix) other environmental information/inputs (e.g., route information, map, agent/ego pose, etc.) and/or outputs from other models, such as a second neural network (e.g., an RNN, a CNN, Ego RNN inFIG. 16 , etc.) which processes the ego vehicle state. Any or all of this information can then be fed into a latent space network (e.g., RNN, CNN, etc.), which is used to determine the environmental representation based on the outputs of these other networks and optionally map information, routing information, and/or any other suitable inputs. The latent space network is preferably in the form of and/or includes one or more autoencoders (with one or more encoders, code, and decoders), but can additionally or alternatively include any or all of: other unsupervised learning models, supervised learning models, semi-supervised learning models, and/or any other suitable neural networks and/or models. In specific examples, the latent space network is a fully-connected feedforward neural network. The output of the latent space representation, which is preferably a low-dimensional features vector (but can additionally or alternatively be any other output) is preferably then used as an input to the learning module. - Additionally or alternatively, any other suitable models and/or algorithms can be implemented, the input and/or outputs can be different for these models and/or algorithms, and/or any suitable model architecture can be implemented.
- The learning module is preferably in the form of a machine learning model, further preferably in the form of one or more neural networks and/or models (e.g., convolutional neural network [CNN], inverse reinforcement learning [IRL] model, reinforcement learning [RL] model, imitation learning [IL] model, etc.) trained for a particular context and/or contexts, but can additionally or alternatively include any other suitable models, algorithms, decision trees, lookup tables, and/or other tools.
- In preferred variations, each of the learning modules is a neural network, further preferably a deep Q-learning network (e.g., IRL algorithm/network), wherein the number of layers (e.g., hidden layers) of the neural network can vary for different contexts and/or actions (e.g., between 3-8 layers, 3 or less layers, 8 or more layers, between 2 and 10 layers, between 1 and 15 layers, etc.). Additionally or alternatively, any other suitable networks, algorithms, and/or models can be used in the learning module(s), such as, but not limited to, any or all of: policy gradient methods, finite state machines [FSMs], probabilistic methods (e.g., Partially Observable Markov Decision Process [POMDP]), imitation learning [IL], RL or variations of IRL, and/or any other suitable models and/or networks and/or algorithms. Each of the learning modules is preferably the same type of neural network (e.g., with different numbers of layers) and/or algorithm and/or model, but can alternatively be different.
- Each of the learning modules is preferably trained based on data occurring within the particular context type or context types associated with the learning module and optionally further trained based on data occurring within one or more fixed routes which pass through the context. In some variations, for instance, a single learning module applies to a particular context type, wherein the single learning module is trained based on different versions of that context. In other variations, a single learning module applies to a particular context within a particular route, wherein the single learning module is trained based on data associated with that particular context in the particular fixed route. Additionally or alternatively, the learning module(s) can be trained on any suitable data.
- Each of the learning modules is further preferably trained with inverse reinforcement learning, which functions to determine a reward function and/or an optimal driving policy for each of the context-aware learning agents. The output of this training is further preferably a compact fully-connected network model that represents the reward function and an optimal policy for each learning module. Additionally or alternatively, the learning modules can be otherwise suitably trained and/or implemented.
- In a first variation, S230 includes selecting a context-aware learning agent (equivalently referred to herein as a context-aware learning module) based on a determined context of the agent, wherein a single context-aware learning agent is assigned to each context. The context-aware learning agent is preferably trained with an inverse reinforcement learning technique, but can additionally or alternatively be otherwise trained.
- In a second variation, S230 includes selecting from multiple context-aware learning agents assigned to and/or available to a particular context, wherein the particular context-aware learning agent is selected based on any or all of: machine learning, a decision tree, statistical methods, an algorithm, and/or with any other suitable tool(s).
- Additionally or alternatively, any other suitable learning modules can be selected, used, and/or trained.
- 4.5 Method—Defining an Action Space Based on the Learning Module S240 and Selecting an Action from the Action Space S250
- The
method 200 can include defining an action space based on the learning module S240, which functions to define a set of actions (equivalently referred to herein as behaviors) available to the agent in light of the vehicle's context and/or environment. Additionally or alternatively, S240 can function to minimize a number of available actions to the agent as informed by the context, which functions to simplify the process (e.g., reduce the time, prevent selection of an incompatible action, etc.) required to select an action for the vehicle. Themethod 200 can optionally additionally or alternatively include selecting an action from the action space S250, which functions to determine a next behavior (e.g., switching and/or transitioning to a different behavior than current behavior, maintaining a current behavior, etc.) of the vehicle. - S240 and/or S250 are preferably performed in response to (e.g., after, based on, etc.) S230, but can additionally or alternatively be performed as part of S230 and/or concurrently with S230, in place of S230, in absence of S230, in response to S220 and/or S210, multiple times throughout the method, and/or at any other time(s) during the
method 200. Further additionally or alternatively, themethod 200 can be performed in absence of S240 and/or S250. - The action space and/or action is preferably produced as an output (e.g., intermediate output, final output, etc.) of the learning module; additionally or alternatively, the learning module can produce any other suitable outputs. In preferred variations, a determination of the context and processing based on this context (e.g., the specific learning module) allows the action space to be relatively small (e.g., relative to all available actions).
- The actions can include, but are not limited to, any or all of: maintaining a lane, changing lanes, turning (e.g., turning right, turning left, performing a U-turn, etc.), merging, creeping, following a vehicle in front of the agent, parking in a lot, pulling over, nudging, passing a vehicle, and/or any other suitable actions such as usual driving actions for human-operated and/or autonomous vehicles.
- Each action is preferably associated with a set of parameters, which are determined based on the particular context of the agent and optionally any other suitable inputs (e.g., sensor information). This highlights a benefit of this architecture, which enables various parameter values to be associated with an action, wherein the context specifies the particular value, thereby enabling the action learned for different contexts to be different yet predictable. In contrast, in conventional methods where the method is entirely programmed, for instance, one would need to either generalize the parameter (e.g., creep distance) to have an overly conservative value or program multiple values for different cases; and in methods including only learning based approaches, this would lead to an oversimplification of the action across cases, which could result in unpredictable agent behavior at times (e.g., robotic behavior, the production of an infeasible trajectory, etc.).
- For preferred variations of this method, the extra information and restriction from the context type can reduce the amount of data that is needed to train the different learning approaches and better tune the agent to a specific context to increase accuracy and confidence.
- In preferred variations, an output layer of each deep decision network is a softmax layer where the number of output nodes is the number of available actions. Additionally or alternatively, an action space and/or available actions can be determined in any other suitable way(s).
- In a specific example, a multi-lane highway context produces a corresponding action space including: maintaining speed, lane change left, and lane change right. In contrast, a different context such as a residential road produces actions such as those in the highway context and additional actions such as stop, yield, creep, left turn, and right turn.
- In additional or alternative variations, an output layer (e.g., linear output layer) can be used to generate an embedding (e.g., a vector, a vector of real numbers, etc.) for the action, wherein the embedding could be matched to stored embeddings associated with particular actions (e.g., at a lookup table). In specific examples, for instance, a length and/or angle of an embedding vector produced by an output layer can be used to match it to a vector associated with a particular action.
- Selecting an action in S250 can be performed by the context-aware learning agent, performed with another model and/or algorithm and/or process, determined based on other information (e.g., any or all of the set of inputs from S210, based on the particular route, based on a next context in the map, etc.), and/or otherwise determined.
- In preferred variations, the action is produced as an output (e.g., single output, multiple outputs, etc.) of the context-aware learning agent.
- In additional or alternative variations, the action can be determined based on a state machine or other rule-based method for choosing an action based on context.
- In a first variation, the context of the agent is determined from a map to be a one-lane residential road (e.g., in which the agent cannot change contexts due to road geometry as shown in
FIG. 4 ). A set of actions determined for this context can include, for instance: maintaining speed, creeping, left turning, right turning, and yielding. For creeping (e.g., as shown inFIG. 5 ), a major parameter is creep distance, which refers to the distance the agent should creep forward with extra caution (e.g., before deciding to merge). For instance, humans tend to creep at a stop sign or before merging on a highway to cautiously gauge any oncoming traffic and pace the speed of the vehicle to merge without collisions or annoyance to road users. Depending on the particular context and optionally action, the value of this parameter is different. In specific examples (e.g., as shown inFIG. 5 ), for the context of a parking lot and the action of turning right and/or stopping at a stop sign, the creep distance is 2 meters, whereas for the context of a multi lane highway and the action of merging, the creep distance is 17 meters. - In a second variation, the context of the agent is determined to be a multi-lane highway in which the agent can learn (e.g., in the learning module) it is less likely to see pedestrians. The actions of the action space can include, for instance: lane swap left, lane swap right, maintain speed, and stop.
- The
method 200 can include planning a trajectory based on the action S260, which functions to enable the agent to perform the selected action (e.g., as described above). - The trajectory preferably specifies the set of locations and associated speeds for the agent to be at in order to perform the selected action. The trajectory is preferably generated based on one of a set of trajectory learning modules (e.g., different than the learning module described above, separate from but including the same and/or a similar architecture as the learning modules described above, etc.), but can additionally or alternatively be generated with any other suitable tools, programmed or learned.
- S260 can optionally additionally or alternatively include any or all of: validating the trajectory, implementing a fallback mechanism, operating the vehicle according to a trajectory, determining control commands with which to operate the vehicle based on a trajectory, and/or any other suitable output.
- In a first variation of the
method 200, the method includes: receiving a set of inputs S210, the set of inputs including at least map (e.g., high definition hand-labeled map, map labeled in an automated fashion, map labeled both manually and in an automated fashion, etc.), a vehicle location (e.g., pose), and optionally a route planned for the vehicle; determining a context prescribed by the map based on the location of the agent; selecting a learning module including a neural network based on the context; defining an action space including a set of actions available to the agent with the learning module; and selecting an action from the action space (e.g., with the learning module). Additionally or alternatively, themethod 200 can include any other suitable processes (e.g., determining a trajectory based on the action). - In a specific example, the
method 200 includes: receiving a set of inputs, wherein the set of inputs includes a hand labeled, high definition map prescribing a set of contexts, further preferably a series of contexts, for at least a fixed route of the autonomous agent, wherein the set of inputs further includes sensor information from a set of sensors onboard the autonomous agent, wherein the sensor information includes at least a pose of the autonomous agent, wherein the pose and optionally the route are used to select a context for the agent based on the map, and optionally any other suitable inputs; selecting a context based on a location and/or orientation of the vehicle (e.g., pose), the labeled map, and optionally any or all of the other information received, wherein the context informs how the remaining processes of the method are performed; selecting a context-aware learning agent based on the context, wherein a single context-aware learning agent is assigned to each context and trained (e.g., with an inverse reinforcement learning model); defining an action space and selecting an action based on the learning module; and determining a trajectory for the vehicle based on the action space and/or action. - In a specific implementation as shown in
FIGS. 7A-7D , the set of inputs includes receiving a route as shown inFIG. 7A ; determining a first context shown inFIG. 7B based on the route and the map, wherein from the start of the trip, the vehicle is located in a parking lot type context with a single lane of traffic and expected heavy foot traffic. This foot traffic is usually localized to sidewalk however in this context pedestrian cutoff events are highly probable. To handle this context, the learning agent is optimized to be significantly more aware of pedestrians and their actions and as such the reward function is tuned to achieve this. The available actions to this agent are: maintaining a predetermined speed (e.g., speed bounded by any leading dynamic objects in agent's path and the speed limit of the current road); yielding behavior relative to a set of dynamic objects (e.g., which may have a precedence over the agent); staying stopped in the current location; and providing a stopping location which the ego vehicle must stop by. The second context that the vehicle encounters along this route, shown inFIG. 7C , is the context of a single lane residential road. This context is quite large including handling single lane traffic lights and stop sign intersection, pedestrian's crossing and right-hand turns. The decisions required to handle this context are very similar to that of the first context, the main difference being what the algorithm is trained on and that the reward function is tuned for two different sets of behaviors. The final context encountered by the agent, shown inFIG. 7D , is a multi-lane residential context, which includes everything that the single lane context needs to handle but in multiple lanes. Thus it needs to understand how to switch lanes, and how to handle turning on multi lane intersection. To handle this, in addition to the actions from the second context, it also contains the following two actions in its action space: changing to the left lane (when safe to do so) and changing to the right lane (when safe to do so). - In a second variation of the
method 200, the method includes: receiving a set of inputs S210, the set of inputs including at least map (e.g., high definition hand-labeled map, map labeled in an automated fashion, map labeled both manually and in an automated fashion, etc.), a vehicle location (e.g., pose), and optionally a route planned for the vehicle; determining a context for the agent with a context identifier module based on the inputs; selecting a learning module including a neural network based on the context; defining an action space including a set of actions available to the agent with the learning module; and selecting an action from the action space (e.g., with the learning module). Additionally or alternatively, themethod 200 can include any other suitable processes (e.g., determining a trajectory based on the action). - Additionally or alternatively, the
method 200 can include any other suitable processes and/or be performed in any suitable way(s). - Although omitted for conciseness, the preferred embodiments include every combination and permutation of the various system components and the various method processes, wherein the method processes can be performed in any suitable order, sequentially or concurrently.
- As a person skilled in the art will recognize from the previous detailed description and from the figures and claims, modifications and changes can be made to the preferred embodiments of the invention without departing from the scope of this invention defined in the following claims.
Claims (20)
1. A method for operation of an autonomous vehicle along a route, the method comprising:
collecting a set of sensor inputs from a set of sensors onboard the autonomous vehicle;
processing the set of sensor inputs to determine a location associated with the autonomous vehicle;
with a deep neural network, limiting a set of available actions for the vehicle based on the location to produce a limited set of available actions;
selecting an action from the limited set of available actions; and
operating the autonomous vehicle along the route based on a trajectory which executes the selected action.
2. The method of claim 1 , wherein the limited set of available actions is further determined based on a predetermined map, wherein the predetermined map includes a set of labels corresponding to each of a set of locations, the set of locations comprising the location.
3. The method of claim 1 , wherein each of the set of routes is fixed.
4. The method of claim 3 , wherein the limited set of available actions is further determined based on a map, wherein the map includes the set of routes, wherein at least a portion of routes in the set of routes are overlapping relative to each other.
5. The method of claim 4 , wherein the limited set of available actions is further determined based on an orientation of the autonomous vehicle, the orientation of the vehicle determined based on processing the set of sensor inputs.
6. The method of claim 3 , wherein each of the set of routes is limited to a distance of less than 100 miles.
7. The method of claim 6 , wherein each of the set of routes is limited to a distance of less than 20 miles.
8. The method of claim 6 , wherein each of the set of routes is arranged between a distribution center and a retail store.
9. The method of claim 1 , wherein the deep neural network comprises a recurrent neural network.
10. The method of claim 1 , wherein the limited set of available actions is further determined based on a set of static environmental features.
11. The method of claim 10 , wherein the limited set of available actions is further determined based on a set of dynamic environmental features.
12. A method for operation of an autonomous vehicle along a fixed route, the method comprising:
receiving a set of sensor inputs from a set of sensors onboard the autonomous vehicle;
processing the set of sensor inputs to determine a location of the autonomous vehicle, wherein the location of the autonomous vehicle is along the fixed route, the fixed route arranged between a distribution center and a retail store;
with a neural network, determining a set of available actions for the vehicle based on the location;
selecting an action from the set of available actions; and
operating the autonomous vehicle along the fixed route based on a trajectory which executes the selected action.
13. The method of claim 12 , wherein the set of available actions for the autonomous vehicle is further determined based on the fixed route.
14. The method of claim 13 , wherein the set of available actions for the autonomous vehicle is further determined based on a map.
15. The method of claim 14 , wherein the map comprises the fixed route.
16. The method of claim 12 , wherein the neural network comprises a deep neural network.
17. The method of claim 16 , wherein the deep neural network comprises a recurrent neural network.
18. The method of claim 12 , further comprising determining the trajectory based on a second neural network, the second neural network separate and distinct from the first neural network.
19. The method of claim 12 , wherein each of the set of routes is limited to a distance of less than 100 miles.
20. The method of claim 19 , wherein each of the set of routes is limited to a distance of less than 20 miles.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/858,024 US11505208B1 (en) | 2020-06-05 | 2022-07-05 | Method and system for context-aware decision making of an autonomous agent |
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063035401P | 2020-06-05 | 2020-06-05 | |
US202063055756P | 2020-07-23 | 2020-07-23 | |
US17/116,810 US11034364B1 (en) | 2020-06-05 | 2020-12-09 | Method and system for context-aware decision making of an autonomous agent |
US17/306,014 US11267485B2 (en) | 2020-06-05 | 2021-05-03 | Method and system for context-aware decision making of an autonomous agent |
US17/332,839 US11260882B2 (en) | 2020-06-05 | 2021-05-27 | Method and system for context-aware decision making of an autonomous agent |
US17/584,062 US11396307B2 (en) | 2020-06-05 | 2022-01-25 | Method and system for context-aware decision making of an autonomous agent |
US17/846,870 US11745758B2 (en) | 2020-06-05 | 2022-06-22 | Method and system for context-aware decision making of an autonomous agent |
US17/858,024 US11505208B1 (en) | 2020-06-05 | 2022-07-05 | Method and system for context-aware decision making of an autonomous agent |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/846,870 Continuation US11745758B2 (en) | 2020-06-05 | 2022-06-22 | Method and system for context-aware decision making of an autonomous agent |
Publications (2)
Publication Number | Publication Date |
---|---|
US20220348220A1 true US20220348220A1 (en) | 2022-11-03 |
US11505208B1 US11505208B1 (en) | 2022-11-22 |
Family
ID=76320930
Family Applications (10)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/116,810 Active US11034364B1 (en) | 2020-06-05 | 2020-12-09 | Method and system for context-aware decision making of an autonomous agent |
US17/306,014 Active US11267485B2 (en) | 2020-06-05 | 2021-05-03 | Method and system for context-aware decision making of an autonomous agent |
US17/332,839 Active US11260882B2 (en) | 2020-06-05 | 2021-05-27 | Method and system for context-aware decision making of an autonomous agent |
US17/332,748 Active US11254332B2 (en) | 2020-06-05 | 2021-05-27 | Method and system for context-aware decision making of an autonomous agent |
US17/584,062 Active US11396307B2 (en) | 2020-06-05 | 2022-01-25 | Method and system for context-aware decision making of an autonomous agent |
US17/681,543 Active US11505207B2 (en) | 2020-06-05 | 2022-02-25 | Method and system for context-aware decision making of an autonomous agent |
US17/846,870 Active US11745758B2 (en) | 2020-06-05 | 2022-06-22 | Method and system for context-aware decision making of an autonomous agent |
US17/851,321 Active US11661078B2 (en) | 2020-06-05 | 2022-06-28 | Method and system for context-aware decision making of an autonomous agent |
US17/858,024 Active US11505208B1 (en) | 2020-06-05 | 2022-07-05 | Method and system for context-aware decision making of an autonomous agent |
US18/225,319 Pending US20230365153A1 (en) | 2020-06-05 | 2023-07-24 | Method and system for context-aware decision making of an autonomous agent |
Family Applications Before (8)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/116,810 Active US11034364B1 (en) | 2020-06-05 | 2020-12-09 | Method and system for context-aware decision making of an autonomous agent |
US17/306,014 Active US11267485B2 (en) | 2020-06-05 | 2021-05-03 | Method and system for context-aware decision making of an autonomous agent |
US17/332,839 Active US11260882B2 (en) | 2020-06-05 | 2021-05-27 | Method and system for context-aware decision making of an autonomous agent |
US17/332,748 Active US11254332B2 (en) | 2020-06-05 | 2021-05-27 | Method and system for context-aware decision making of an autonomous agent |
US17/584,062 Active US11396307B2 (en) | 2020-06-05 | 2022-01-25 | Method and system for context-aware decision making of an autonomous agent |
US17/681,543 Active US11505207B2 (en) | 2020-06-05 | 2022-02-25 | Method and system for context-aware decision making of an autonomous agent |
US17/846,870 Active US11745758B2 (en) | 2020-06-05 | 2022-06-22 | Method and system for context-aware decision making of an autonomous agent |
US17/851,321 Active US11661078B2 (en) | 2020-06-05 | 2022-06-28 | Method and system for context-aware decision making of an autonomous agent |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US18/225,319 Pending US20230365153A1 (en) | 2020-06-05 | 2023-07-24 | Method and system for context-aware decision making of an autonomous agent |
Country Status (5)
Country | Link |
---|---|
US (10) | US11034364B1 (en) |
EP (1) | EP4162337A4 (en) |
JP (1) | JP7538892B2 (en) |
CA (1) | CA3181067A1 (en) |
WO (1) | WO2021247075A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220004191A1 (en) * | 2020-07-01 | 2022-01-06 | Wuhan University Of Technology | Usv formation path-following method based on deep reinforcement learning |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10384855B2 (en) | 2014-02-07 | 2019-08-20 | Yeti Coolers, Llc | Insulating device and method for forming insulating device |
CA3072401A1 (en) * | 2017-08-10 | 2019-02-14 | Nissan Motor Co., Ltd. | Traffic lane information management method, running control method, and traffic lane information management device |
JP7199545B2 (en) | 2018-07-20 | 2023-01-05 | メイ モビリティー,インコーポレイテッド | A Multi-view System and Method for Action Policy Selection by Autonomous Agents |
US11511576B2 (en) * | 2020-01-24 | 2022-11-29 | Ford Global Technologies, Llc | Remote trailer maneuver assist system |
KR20210124603A (en) * | 2020-04-06 | 2021-10-15 | 현대자동차주식회사 | Apparatus for controlling autonomous driving of a vehicle, system having the same and method thereof |
CA3180994A1 (en) | 2020-06-05 | 2021-12-09 | Gautam Narang | Method and system for data-driven and modular decision making and trajectory generation of an autonomous agent |
JP2023533225A (en) | 2020-07-01 | 2023-08-02 | メイ モビリティー,インコーポレイテッド | Methods and systems for dynamically curating autonomous vehicle policies |
KR20220060404A (en) * | 2020-11-04 | 2022-05-11 | 현대자동차주식회사 | Method and apparatus for generating test case for dynamic verification of autonomous driving system |
JP2024512980A (en) * | 2021-04-02 | 2024-03-21 | メイ モビリティー,インコーポレイテッド | Method and system for operating autonomous agents with incomplete environmental information |
US20230020503A1 (en) * | 2021-07-08 | 2023-01-19 | Ford Global Technologies, Llc | Machine control |
CN114407915B (en) * | 2021-12-14 | 2024-08-06 | 高德软件有限公司 | Processing method, device and storage medium for operation design domain ODD |
CA3240409A1 (en) | 2021-12-16 | 2023-06-22 | Apeksha Kumavat | Method and system for addressing failure in an autonomous agent |
US12037011B2 (en) | 2021-12-16 | 2024-07-16 | Gatik Ai Inc. | Method and system for expanding the operational design domain of an autonomous agent |
US11541910B1 (en) * | 2022-01-07 | 2023-01-03 | Plusai, Inc. | Methods and apparatus for navigation of an autonomous vehicle based on a location of the autonomous vehicle relative to shouldered objects |
US20230303123A1 (en) * | 2022-03-22 | 2023-09-28 | Qualcomm Incorporated | Model hyperparameter adjustment using vehicle driving context classification |
JP7441258B2 (en) * | 2022-03-25 | 2024-02-29 | 本田技研工業株式会社 | Control device |
US11840257B2 (en) * | 2022-03-25 | 2023-12-12 | Embark Trucks Inc. | Lane change determination for vehicle on shoulder |
US11644842B1 (en) * | 2022-06-23 | 2023-05-09 | Motional Ad Llc | Semi-closed loop rollouts for data augmentation |
CN115432008A (en) * | 2022-09-30 | 2022-12-06 | 重庆长安汽车股份有限公司 | Optimization-multitask-based automatic driving decision planning method, electronic equipment and computer storage medium |
WO2024129832A1 (en) | 2022-12-13 | 2024-06-20 | May Mobility, Inc. | Method and system for assessing and mitigating risks encounterable by an autonomous vehicle |
DE102023108635A1 (en) | 2023-04-04 | 2024-10-10 | Dr. Ing. H.C. F. Porsche Aktiengesellschaft | Method and system for creating a context-based recommendation for at least one route for a moving entity |
KR102675973B1 (en) * | 2023-12-15 | 2024-06-19 | 연세대학교 산학협력단 | Method for performing tasks according to capeam model including context-aware planning module and environment-aware memory module and ai agent using the same |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170345181A1 (en) * | 2016-05-27 | 2017-11-30 | Beijing Kuangshi Technology Co., Ltd. | Video monitoring method and video monitoring system |
US20210304123A1 (en) * | 2020-03-26 | 2021-09-30 | Fractal Analytics Private Limited | System and method for identifying object information in image or video data |
Family Cites Families (66)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6292830B1 (en) * | 1997-08-08 | 2001-09-18 | Iterations Llc | System for optimizing interaction among agents acting on multiple levels |
US6278965B1 (en) * | 1998-06-04 | 2001-08-21 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | Real-time surface traffic adviser |
US7230545B2 (en) | 2003-11-07 | 2007-06-12 | Nattel Group, Inc. | Automobile communication and registry system |
US8024112B2 (en) * | 2005-09-29 | 2011-09-20 | Microsoft Corporation | Methods for predicting destinations from partial trajectories employing open-and closed-world modeling methods |
US7831391B2 (en) | 2007-06-12 | 2010-11-09 | Palo Alto Research Center Incorporated | Using segmented cones for fast, conservative assessment of collision risk |
US8612107B2 (en) | 2008-06-10 | 2013-12-17 | The Regents Of The University Of Michigan | Method, control apparatus and powertrain system controller for real-time, self-learning control based on individual operating style |
US20100131148A1 (en) | 2008-11-26 | 2010-05-27 | Jaime Camhi | System and method for estimated driver intention for driver assistance system control |
FR2947231B1 (en) * | 2009-06-30 | 2013-03-29 | Valeo Vision | METHOD FOR PREDICTIVELY DETERMINING THE ROAD SITUATIONS OF A VEHICLE |
US8175617B2 (en) | 2009-10-28 | 2012-05-08 | Digimarc Corporation | Sensor-based mobile search, related methods and systems |
US9679258B2 (en) | 2013-10-08 | 2017-06-13 | Google Inc. | Methods and apparatus for reinforcement learning |
JP2015089801A (en) * | 2013-11-07 | 2015-05-11 | 株式会社デンソー | Operation control device |
US9507346B1 (en) | 2015-11-04 | 2016-11-29 | Zoox, Inc. | Teleoperation system and method for trajectory modification of autonomous vehicles |
US9910441B2 (en) | 2015-11-04 | 2018-03-06 | Zoox, Inc. | Adaptive autonomous vehicle planner logic |
KR102137213B1 (en) | 2015-11-16 | 2020-08-13 | 삼성전자 주식회사 | Apparatus and method for traning model for autonomous driving, autonomous driving apparatus |
WO2017197018A2 (en) * | 2016-05-10 | 2017-11-16 | Trustees Of Tufts College | Systems and methods enabling online one-shot learning and generalization by intelligent systems of task-relevant features and transfer to a cohort of intelligent systems |
US10495753B2 (en) * | 2016-09-19 | 2019-12-03 | Nec Corporation | Video to radar |
WO2018078413A1 (en) * | 2016-10-31 | 2018-05-03 | Toyota Motor Europe | Driving assistance method and system |
WO2018102425A1 (en) | 2016-12-02 | 2018-06-07 | Starsky Robotics, Inc. | Vehicle control system and method of use |
KR20180094725A (en) | 2017-02-16 | 2018-08-24 | 삼성전자주식회사 | Control method and control apparatus of car for automatic driving and learning method for automatic driving |
US10095234B2 (en) * | 2017-03-07 | 2018-10-09 | nuTonomy Inc. | Planning for unknown objects by an autonomous vehicle |
JP6838241B2 (en) * | 2017-06-01 | 2021-03-03 | 日立Astemo株式会社 | Mobile behavior prediction device |
US10559140B2 (en) | 2017-06-16 | 2020-02-11 | Uatc, Llc | Systems and methods to obtain feedback in response to autonomous vehicle failure events |
US10752246B2 (en) * | 2017-07-01 | 2020-08-25 | Tusimple, Inc. | System and method for adaptive cruise control with proximate vehicle detection |
US10883844B2 (en) | 2017-07-27 | 2021-01-05 | Waymo Llc | Neural networks for vehicle trajectory planning |
US10520940B2 (en) * | 2017-08-14 | 2019-12-31 | GM Global Technology Operations LLC | Autonomous operation using deep spatio-temporal learning |
US11430071B2 (en) * | 2017-08-16 | 2022-08-30 | Mobileye Vision Technologies Ltd. | Navigation based on liability constraints |
US10807599B2 (en) | 2017-09-18 | 2020-10-20 | Baidu Usa Llc | Driving scenario based lane guidelines for path planning of autonomous driving vehicles |
EP3467717A1 (en) * | 2017-10-04 | 2019-04-10 | Prowler.io Limited | Machine learning system |
US10866588B2 (en) | 2017-10-16 | 2020-12-15 | Toyota Research Institute, Inc. | System and method for leveraging end-to-end driving models for improving driving task modules |
WO2019088989A1 (en) | 2017-10-31 | 2019-05-09 | Nissan North America, Inc. | Reinforcement and model learning for vehicle operation |
US20190146508A1 (en) | 2017-11-14 | 2019-05-16 | Uber Technologies, Inc. | Dynamic vehicle routing using annotated maps and profiles |
US10345811B2 (en) | 2017-11-14 | 2019-07-09 | GM Global Technology Operations LLC | Method and apparatus for scenario generation and parametric sweeps for the development and evaluation of autonomous driving systems |
US10859384B2 (en) | 2017-11-15 | 2020-12-08 | Uatc, Llc | Lightweight vehicle localization systems and methods |
US20190147255A1 (en) | 2017-11-15 | 2019-05-16 | Uber Technologies, Inc. | Systems and Methods for Generating Sparse Geographic Data for Autonomous Vehicles |
EP3727980A4 (en) * | 2017-12-18 | 2021-08-04 | PlusAI Corp | Method and system for personalized motion planning in autonomous driving vehicles |
US11639183B2 (en) * | 2018-01-17 | 2023-05-02 | Mitsubishi Electric Corporation | Driving control device, driving control method, and computer readable medium |
US10836379B2 (en) | 2018-03-23 | 2020-11-17 | Sf Motors, Inc. | Multi-network-based path generation for vehicle parking |
US11099558B2 (en) * | 2018-03-27 | 2021-08-24 | Nvidia Corporation | Remote operation of vehicles using immersive virtual reality environments |
US10906536B2 (en) | 2018-04-11 | 2021-02-02 | Aurora Innovation, Inc. | Control of autonomous vehicle based on determined yaw parameter(s) of additional vehicle |
US10860022B2 (en) * | 2018-04-11 | 2020-12-08 | GM Global Technology Operations LLC | Method and apparatus for automatical rule learning for autonomous driving |
US11454970B2 (en) | 2018-05-21 | 2022-09-27 | Cummins Inc. | Adjustment of autonomous vehicle control authority |
US20190361454A1 (en) | 2018-05-24 | 2019-11-28 | GM Global Technology Operations LLC | Control systems, control methods and controllers for an autonomous vehicle |
EP3579174A1 (en) * | 2018-06-08 | 2019-12-11 | Hexagon Technology Center GmbH | Mobile vehicles in manufacturing |
US11054829B2 (en) * | 2018-07-17 | 2021-07-06 | Baidu Usa Llc | Multimodal motion planning framework for autonomous driving vehicles |
US10678245B2 (en) | 2018-07-27 | 2020-06-09 | GM Global Technology Operations LLC | Systems and methods for predicting entity behavior |
US10845815B2 (en) | 2018-07-27 | 2020-11-24 | GM Global Technology Operations LLC | Systems, methods and controllers for an autonomous vehicle that implement autonomous driver agents and driving policy learners for generating and improving policies based on collective driving experiences of the autonomous driver agents |
US10981564B2 (en) | 2018-08-17 | 2021-04-20 | Ford Global Technologies, Llc | Vehicle path planning |
US10882522B2 (en) * | 2018-09-13 | 2021-01-05 | Toyota Research Institute, Inc. | Systems and methods for agent tracking |
US11479243B2 (en) | 2018-09-14 | 2022-10-25 | Honda Motor Co., Ltd. | Uncertainty prediction based deep learning |
CN112334370B (en) | 2018-10-26 | 2024-04-30 | 深圳市大疆创新科技有限公司 | Automated vehicle actions for lane departure warning and associated systems and methods |
JP7279721B2 (en) * | 2018-10-30 | 2023-05-23 | 日本電気株式会社 | Object recognition device, object recognition method and object recognition program |
US11480972B2 (en) | 2018-11-13 | 2022-10-25 | Qualcomm Incorporated | Hybrid reinforcement learning for autonomous driving |
US20220001858A1 (en) | 2018-11-13 | 2022-01-06 | Nec Corporation | Dangerous scene prediction device, dangerous scene prediction method, and dangerous scene prediction program |
US11137762B2 (en) | 2018-11-30 | 2021-10-05 | Baidu Usa Llc | Real time decision making for autonomous driving vehicles |
US10814871B2 (en) | 2018-12-26 | 2020-10-27 | Gm Cruise Holdings Llc | Computing system for assigning maneuver labels to autonomous vehicle sensor data |
US10311336B1 (en) * | 2019-01-22 | 2019-06-04 | StradVision, Inc. | Method and device of neural network operations using a grid generator for converting modes according to classes of areas to satisfy level 4 of autonomous vehicles |
US11521396B1 (en) * | 2019-02-02 | 2022-12-06 | Uatc, Llc | Probabilistic prediction of dynamic object behavior for autonomous vehicles |
US11157784B2 (en) | 2019-05-08 | 2021-10-26 | GM Global Technology Operations LLC | Explainable learning system and methods for autonomous driving |
US20200363800A1 (en) * | 2019-05-13 | 2020-11-19 | Great Wall Motor Company Limited | Decision Making Methods and Systems for Automated Vehicle |
US11294387B2 (en) | 2019-06-17 | 2022-04-05 | Toyota Research Institute, Inc. | Systems and methods for training a vehicle to autonomously drive a route |
US11562203B2 (en) | 2019-12-30 | 2023-01-24 | Servicenow Canada Inc. | Method of and server for training a machine learning algorithm for estimating uncertainty of a sequence of models |
CN111061277B (en) * | 2019-12-31 | 2022-04-05 | 歌尔股份有限公司 | Unmanned vehicle global path planning method and device |
US20210256466A1 (en) | 2020-02-14 | 2021-08-19 | Zoox, Inc. | Mobile delivery vehicle management and routing |
US11460857B1 (en) * | 2020-02-21 | 2022-10-04 | Zoox, Inc. | Object or person attribute characterization |
US20210117760A1 (en) | 2020-06-02 | 2021-04-22 | Intel Corporation | Methods and apparatus to obtain well-calibrated uncertainty in deep neural networks |
JP2023553980A (en) | 2020-12-14 | 2023-12-26 | メイ モビリティー,インコーポレイテッド | Autonomous vehicle safety platform system and method |
-
2020
- 2020-12-09 CA CA3181067A patent/CA3181067A1/en active Pending
- 2020-12-09 WO PCT/US2020/064058 patent/WO2021247075A1/en active Application Filing
- 2020-12-09 EP EP20938767.9A patent/EP4162337A4/en active Pending
- 2020-12-09 JP JP2022574685A patent/JP7538892B2/en active Active
- 2020-12-09 US US17/116,810 patent/US11034364B1/en active Active
-
2021
- 2021-05-03 US US17/306,014 patent/US11267485B2/en active Active
- 2021-05-27 US US17/332,839 patent/US11260882B2/en active Active
- 2021-05-27 US US17/332,748 patent/US11254332B2/en active Active
-
2022
- 2022-01-25 US US17/584,062 patent/US11396307B2/en active Active
- 2022-02-25 US US17/681,543 patent/US11505207B2/en active Active
- 2022-06-22 US US17/846,870 patent/US11745758B2/en active Active
- 2022-06-28 US US17/851,321 patent/US11661078B2/en active Active
- 2022-07-05 US US17/858,024 patent/US11505208B1/en active Active
-
2023
- 2023-07-24 US US18/225,319 patent/US20230365153A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170345181A1 (en) * | 2016-05-27 | 2017-11-30 | Beijing Kuangshi Technology Co., Ltd. | Video monitoring method and video monitoring system |
US20210304123A1 (en) * | 2020-03-26 | 2021-09-30 | Fractal Analytics Private Limited | System and method for identifying object information in image or video data |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220004191A1 (en) * | 2020-07-01 | 2022-01-06 | Wuhan University Of Technology | Usv formation path-following method based on deep reinforcement learning |
US11914376B2 (en) * | 2020-07-01 | 2024-02-27 | Wuhan University Of Technology | USV formation path-following method based on deep reinforcement learning |
Also Published As
Publication number | Publication date |
---|---|
US11661078B2 (en) | 2023-05-30 |
US11505208B1 (en) | 2022-11-22 |
US11396307B2 (en) | 2022-07-26 |
US11267485B2 (en) | 2022-03-08 |
US20220315042A1 (en) | 2022-10-06 |
CA3181067A1 (en) | 2021-12-09 |
US11254332B2 (en) | 2022-02-22 |
US20220332346A1 (en) | 2022-10-20 |
US20220234610A1 (en) | 2022-07-28 |
US11260882B2 (en) | 2022-03-01 |
US11034364B1 (en) | 2021-06-15 |
US20230365153A1 (en) | 2023-11-16 |
US11505207B2 (en) | 2022-11-22 |
US20210380129A1 (en) | 2021-12-09 |
JP2023528077A (en) | 2023-07-03 |
US11745758B2 (en) | 2023-09-05 |
JP7538892B2 (en) | 2024-08-22 |
US20210380128A1 (en) | 2021-12-09 |
EP4162337A4 (en) | 2024-07-03 |
WO2021247075A1 (en) | 2021-12-09 |
US20220144306A1 (en) | 2022-05-12 |
US20210380130A1 (en) | 2021-12-09 |
EP4162337A1 (en) | 2023-04-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11505208B1 (en) | Method and system for context-aware decision making of an autonomous agent | |
US12012121B2 (en) | Method and system for data-driven and modular decision making and trajectory generation of an autonomous agent | |
US11320827B2 (en) | Method and system for deterministic trajectory selection based on uncertainty estimation for an autonomous agent | |
US20240217558A1 (en) | Goal-based Motion Forecasting |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
AS | Assignment |
Owner name: GATIK AI INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NARANG, GAUTAM;KUMAVAT, APEKSHA;NARANG, ARJUN;AND OTHERS;REEL/FRAME:060503/0283 Effective date: 20210104 |
|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
CC | Certificate of correction |