[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2024193208A1 - Deep learning-based signal light recognition method and apparatus - Google Patents

Deep learning-based signal light recognition method and apparatus Download PDF

Info

Publication number
WO2024193208A1
WO2024193208A1 PCT/CN2024/072682 CN2024072682W WO2024193208A1 WO 2024193208 A1 WO2024193208 A1 WO 2024193208A1 CN 2024072682 W CN2024072682 W CN 2024072682W WO 2024193208 A1 WO2024193208 A1 WO 2024193208A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal light
vehicle
preset
light source
light
Prior art date
Application number
PCT/CN2024/072682
Other languages
French (fr)
Chinese (zh)
Inventor
陈安猛
陈远鹏
胡文博
冷静
宣经纬
薛鹏
张军良
张文海
Original Assignee
合众新能源汽车股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 合众新能源汽车股份有限公司 filed Critical 合众新能源汽车股份有限公司
Publication of WO2024193208A1 publication Critical patent/WO2024193208A1/en

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60KARRANGEMENT OR MOUNTING OF PROPULSION UNITS OR OF TRANSMISSIONS IN VEHICLES; ARRANGEMENT OR MOUNTING OF PLURAL DIVERSE PRIME-MOVERS IN VEHICLES; AUXILIARY DRIVES FOR VEHICLES; INSTRUMENTATION OR DASHBOARDS FOR VEHICLES; ARRANGEMENTS IN CONNECTION WITH COOLING, AIR INTAKE, GAS EXHAUST OR FUEL SUPPLY OF PROPULSION UNITS IN VEHICLES
    • B60K31/00Vehicle fittings, acting on a single sub-unit only, for automatically controlling vehicle speed, i.e. preventing speed from exceeding an arbitrarily established velocity or maintaining speed at a particular velocity, as selected by the vehicle operator
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W40/00Estimation or calculation of non-directly measurable driving parameters for road vehicle drive control systems not related to the control of a particular sub unit, e.g. by using mathematical models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/56Context or environment of the image exterior to a vehicle by using sensors mounted on the vehicle
    • G06V20/58Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads
    • G06V20/584Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads of vehicle lights or traffic lights
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60WCONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE OR DIFFERENT FUNCTION; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TO THE CONTROL OF A PARTICULAR SUB-UNIT
    • B60W2552/00Input parameters relating to infrastructure
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02BCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO BUILDINGS, e.g. HOUSING, HOUSE APPLIANCES OR RELATED END-USER APPLICATIONS
    • Y02B20/00Energy efficient lighting technologies, e.g. halogen lamps or gas discharge lamps
    • Y02B20/40Control techniques providing energy savings, e.g. smart controller or presence detection

Definitions

  • the present invention relates to the field of autonomous driving technology, and in particular to a signal light recognition method and device based on deep learning.
  • a vehicle In an autonomous driving scenario, a vehicle needs to promptly identify environmental information around the road, such as zebra crossings, traffic lights, etc.
  • the vehicle can ensure driving safety by performing autonomous driving based on the identified environmental information.
  • the recognition of traffic lights is particularly important for the safety of autonomous driving.
  • the signal light recognition method used is completed through 2D target detection.
  • 2D target detection due to the different models and sizes of signal lights on the road, it is difficult to obtain the distance between the signal light and the vehicle through scale information through 2D target detection. Without accurate distance information, in the application of autonomous driving scenarios, the lack of distance information will cause some decision errors and lags.
  • the present invention provides a traffic light recognition method and device based on deep learning.
  • the main purpose is to achieve a scenario where an autonomous driving vehicle passes through an intersection with traffic lights, predict the distance information between the traffic light and the vehicle, and assist in decision-making for subsequent planning of the vehicle.
  • the present invention proposes the following solutions:
  • the present invention provides a signal light recognition method based on deep learning, the method comprising:
  • a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified;
  • a signal light queue corresponding to the signal light to be identified is formed using preset rules and added to the preset signal light queue set; wherein the signal light queue stores the signal light to be identified within a preset time range.
  • the speed of the vehicle is adjusted using a preset speed change rule so as to pass through the intersection that the vehicle is preparing to pass.
  • the present invention provides a signal light recognition device based on deep learning, the device comprising:
  • a first acquisition unit configured to acquire a target image at a preset time interval, wherein the target image includes at least one signal light to be identified;
  • a prediction unit configured to obtain, based on the target image, a lamp panel position, a light source position, a light source type, and a light source depth of the signal light to be identified by using a preset deep neural network prediction model;
  • a forming unit configured to form a signal light queue corresponding to the signal light to be identified by using a preset rule based on the light panel position, light source position, light source type and light source depth of the signal light to be identified, and add the signal light queue to a preset signal light queue set; wherein the signal light queue stores the light panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle within a preset time range corresponding to the signal light to be identified;
  • a first determining unit is used to determine a signal light queue corresponding to a target signal light from the preset signal light queue set based on the destination information and current position information of the vehicle, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass;
  • An adjustment unit is used to adjust the speed of the vehicle using a preset speed change rule based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light, so as to pass the intersection that the vehicle is preparing to pass.
  • a storage medium includes a stored program, wherein when the program is running, the device where the storage medium is located is controlled to execute the traffic light recognition method based on deep learning described in the first aspect.
  • an electronic device comprising a memory, a processor, and a computer program stored in the memory and operable on the processor.
  • a computer program when the processor executes the program, implements all or part of the steps of the signal light recognition device based on deep learning as described in the second aspect.
  • the signal light recognition method and device based on deep learning provided by the present invention are because the currently used signal light recognition method is completed through 2D target detection.
  • the lack of distance information will cause some decision errors and lags.
  • the present invention obtains a target image at a preset time interval, wherein the target image contains at least one signal light to be identified; based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source category and light source depth of the signal light to be identified; based on the lamp panel position, light source position, light source category and light source depth of the signal light to be identified, a signal light queue corresponding to the signal light to be identified is formed using a preset rule and added to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source category and distance information between the signal light to be identified and the vehicle within a preset time range; based on the destination information and current position information of the vehicle, a signal light queue corresponding to a target signal light is determined from the preset signal light queue set, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass; based on the distance information between the target signal light and the
  • the present invention applies a combination of depth estimation and target detection methods to predict a traffic light with depth information.
  • traffic lights can be discovered earlier and used as an auxiliary distance reference in scenarios where high-precision maps or GPS are unavailable, thereby enhancing the robustness of the autonomous driving system and avoiding some shortcomings of 2D traffic light recognition.
  • the lack of distance information can cause some decision-making errors and lags.
  • FIG1 shows a flow chart of a signal light recognition method based on deep learning provided by an embodiment of the present invention
  • FIG2 shows a flow chart of another signal light recognition method based on deep learning provided by an embodiment of the present invention
  • FIG3 shows a block diagram of a signal light recognition device based on deep learning provided by an embodiment of the present invention
  • FIG4 shows a block diagram of another signal light recognition device based on deep learning provided by an embodiment of the present invention.
  • FIG5 shows an application scenario diagram corresponding to another signal light recognition device based on deep learning provided by an embodiment of the present invention.
  • the current method for signal light recognition is accomplished through 2D target detection.
  • 2D target detection due to the different types and sizes of signal lights on the road, it is difficult to obtain the distance between the signal light and the vehicle through scale information through 2D target detection. Without accurate distance information, the lack of distance information in autonomous driving scenarios will cause some decision errors and lags.
  • the inventors thought of combining depth estimation and target detection methods to predict a signal light with depth information. In an era when telephoto lenses are standard, signal lights can be discovered earlier, and can be used as an auxiliary distance reference in scenarios where high-precision maps or GPS are unavailable, thereby enhancing the robustness of the autonomous driving system.
  • an embodiment of the present invention provides a signal light recognition method based on deep learning.
  • the method can predict the distance information between the signal light and the vehicle when the autonomous driving vehicle passes through an intersection with a signal light, and assist in decision-making for the subsequent planning of the vehicle.
  • the specific execution steps are shown in FIG1, including:
  • the target image contains at least one signal light to be identified.
  • Traffic lights are a category of traffic safety products. They are an important tool to strengthen road traffic management, reduce the occurrence of traffic accidents, improve road use efficiency, and improve traffic conditions. They are suitable for crosses, T-shaped intersections, etc., and are controlled by road traffic signal controllers to guide vehicles and pedestrians to pass safely and orderly. Traffic lights are composed of red lights, green lights, and yellow lights. Red lights indicate that passage is prohibited, green lights indicate that passage is allowed, and yellow lights indicate slow travel or warning. The "Road Traffic Law Implementation Regulations" divide traffic lights into: motor vehicle lights, non-motor vehicle lights, pedestrian crossing lights, lane lights, direction indicator lights, flashing warning lights, and road and railway level crossing lights.
  • This embodiment first needs to acquire the target image at a preset time interval, wherein the preset time interval can be every second, every 2 seconds, etc., and is set according to actual needs.
  • the preset time interval can be every second, every 2 seconds, etc., and is set according to actual needs.
  • This embodiment does not make specific limitations; since the intersection is usually cross-shaped or T-shaped, there may be 3, 2, etc. traffic lights in different directions in the image obtained by the vehicle, but before executing the next step, the image obtained by the vehicle must contain at least one traffic light to be identified.
  • a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified.
  • the target image can be obtained, and then the lamp panel position, light source position, light source category and light source depth of the signal light to be identified are obtained by using a preset deep neural network prediction model; wherein the preset deep neural network prediction model is pre-designed, and includes an input image module, a feature extraction module, a feature fusion module, a depth prediction module, a signal light panel frame prediction module, a ROI extraction module, a light source regression classification module, etc., which are not specifically limited in this embodiment;
  • the input image module obtains the target image as the input data of the preset deep neural network prediction model, extracts features through the feature extraction module, and then fuses the extracted features through the feature fusion module.
  • the depth of the light source is obtained through the depth prediction module based on the fused information;
  • the light panel frame information is obtained through the signal light panel frame prediction module based on the fused information, and the light panel frame information includes the position of the light panel;
  • the ROI extraction module matches the light panel frame information with the features obtained through the feature extraction module, and extracts ROI features:
  • the light source position and light source category are obtained through the light source regression classification module.
  • the light source category is classified according to different light source colors, images, etc., for the purpose of indicating whether the vehicle is moving forward or turning, etc. This embodiment does not make specific limitations.
  • a signal light queue corresponding to the signal light to be identified is formed using preset rules and added to the preset signal light queue set.
  • the traffic light queue saves the lamp panel position, light source position, light source type and distance information between the traffic light to be identified and the vehicle corresponding to the traffic light to be identified within a preset time range; the distance information between the traffic light to be identified and the vehicle is obtained by the light source depth, which can be the numerical value of the light source depth, or the distance corrected by combining the light source depth with the light source scale information.
  • a signal light panel corresponds to a signal light queue, that is, a signal light queue includes all the information of a signal light, such as: light panel information, light source related information (light source position, light source type and light source depth), etc.
  • the order of adding the formed traffic light queues to the preset traffic light queue set can be set as follows: the queue formed by the traffic light directly in front of the vehicle is located at the first one of the preset traffic light queues, followed by the queue corresponding to the traffic light on the left side of the intersection, and the queue corresponding to the traffic light on the right side of the intersection; or the traffic light at the intersection to be passed can be determined first according to the destination and current position of the vehicle, and then the queue formed by the traffic light is located at the first one of the preset traffic light queue set; this implementation does not make specific limitations.
  • the target signal light is a signal light set at the intersection where the vehicle is to pass;
  • the preset signal light queue set can be obtained from step 103. It is necessary to determine the intersection that the vehicle is going to pass through based on the destination set in the vehicle's automatic driving system and the current position information determined. When the intersection is determined, the light panel position of the signal light at the intersection can be obtained, and then the signal light queue corresponding to the light panel position can be searched from the preset signal light queue. The signal light queue found is formed by the signal light at the intersection that the vehicle is going to pass through. The signal light is the target signal light, and the signal light queue is the signal light queue corresponding to the target signal light.
  • the vehicle speed is adjusted using a preset speed change rule.
  • the adjusting of the vehicle speed is used to pass through an intersection that the vehicle is preparing to pass.
  • the signal light queue corresponding to the target signal light can be obtained, and the distance information and light source type of the target signal light and the vehicle that are pre-stored can be obtained from the signal light queue, and the vehicle speed can be adjusted using the preset speed change rule.
  • the preset speed change rule can set different levels of speed according to the different light source types and the different distances between the target signal light and the vehicle. For example, when the distance between the target signal light and the vehicle is in the first distance range (far away), regardless of the light source type, the vehicle speed can be reduced to the first level of speed. When the distance between the target signal light and the vehicle gradually decreases and reaches the second distance range (neither far nor near), the color of the target signal light is further determined.
  • the vehicle speed is reduced to the second level of speed so that the subsequent vehicle can stop smoothly and quickly when it reaches the stop line.
  • the target signal light may still be red when the vehicle reaches the stop line.
  • the vehicle can maintain the first level of speed. It should be noted that the first level speed is faster than the second level speed.
  • the present invention provides a signal light recognition method based on deep learning.
  • the present invention obtains a target image at a preset time interval, wherein the target image contains at least one signal light to be recognized; based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source category and light source depth of the signal light to be recognized; based on the lamp panel position, light source position, light source category and light source depth of the signal light to be recognized, a signal light queue corresponding to the signal light to be recognized is formed using a preset rule and added to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source category and distance information between the signal light to be recognized and the vehicle within a preset time range; based on the destination information and current position information of the vehicle, a signal light queue corresponding to a target signal light is determined from the preset signal light queue set, wherein the target signal light
  • the present invention combines depth estimation and target detection methods. Predicting a traffic light with depth information can detect traffic lights earlier in the era when telephoto lenses are standard, and can serve as an auxiliary distance reference in scenarios where high-precision maps or GPS are unavailable, enhancing the robustness of the autonomous driving system, and avoiding some shortcomings of 2D traffic light recognition, such as the lack of distance information in autonomous driving scene applications, which can cause some decision errors and lags. Further, as a refinement and extension of the embodiment shown in Figure 1, an embodiment of the present invention also provides another method for traffic light recognition based on deep learning, as shown in Figure 2, and its specific steps are as follows:
  • the target image contains at least one signal light to be identified
  • a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified.
  • step 102 is combined with the description of step 102 in the above method, and the same contents are not repeated here.
  • the target image can be obtained from step 201, and then based on the target image, the resnet-18 network in the preset deep neural network prediction model is used as the backbone network to perform feature extraction to obtain the target feature; based on the target feature, the PAN-FPN structure in the preset deep neural network prediction model is used to perform feature fusion to obtain target feature fusion information; based on the target feature fusion information, the convolution layer and the depth prediction layer in the preset deep neural network prediction model are used to obtain the light source depth of the signal light to be identified; based on the target feature fusion information, the full connection technology of the neural network is used to perform automatic regression to obtain the target frame position of the light panel; the target frame position of the light panel is matched with the target feature, and the ROI feature is extracted according to the target frame position of the light panel; based on the ROI feature, target frame regression and category classification are performed to obtain the light source position and the light source category of the signal light to be identified, wherein the light source category is one or more of a red light,
  • the box at the traffic light contains the light panel information.
  • the preset deep neural network prediction model detects the box at the traffic light, and then obtains the features of the area within the box ROI corresponding to the traffic light to identify the internal light source, including its category and position.
  • a signal light queue corresponding to the signal light to be identified is formed using preset rules, and added to the preset signal light queue set.
  • step 103 is combined with the description of step 103 in the above method, and the same contents are not repeated here.
  • the signal light queue stores information on the lamp panel position, light source position, light source type, and distance between the signal light to be identified and the vehicle corresponding to the signal light to be identified within a preset time range.
  • the lamp panel position, light source position, light source type and light source depth of the signal light to be identified can be obtained, and then the scale information corresponding to the signal light to be identified is obtained.
  • the method for obtaining the scale information is to obtain the approximate distance through the image size (mainly width and height) of the target light source and the prior knowledge of the width and height information and the actual target size; the light source depth corresponding to the signal light to be identified and the scale information are fused to obtain the distance information between the signal light to be identified and the vehicle.
  • the light source distance output by the model must conform to the relationship between the image size of the light source and the actual distance. Because the size of traffic lights has national standards, the image size and the actual size are related and there will not be too much error.
  • the scale information is used as a priori knowledge to correct the distance between the signal light to be identified and the vehicle; based on the lamp panel position, the light source position, the light source type and the distance information between the signal light to be identified and the vehicle, a signal light queue corresponding to the signal light to be identified is formed; and the signal light queue corresponding to the signal light to be identified is added to the preset signal light queue set in the chronological order of formation of the signal light queue.
  • the on-board camera can be used to obtain the lane line information and road arrow information on the lane where the vehicle is located, as shown in FIG5 , and is used to locate the current position of the subsequent vehicle.
  • the lane line information and road arrow information can be obtained from step 204, that is, an image of the road condition ahead can be obtained, as shown in FIG5 , including lane lines, road arrows and traffic lights, etc.;
  • the current position information of the vehicle can be obtained by approximating the position of the center line of the vehicle through the center of the image, and the current position information includes: the lateral position and direction of the vehicle, and the relationship with the lane lines and road arrows is shown in FIG5 .
  • step 104 is combined with the description of step 104 in the above method, and the same contents are not repeated here.
  • the target traffic light is a traffic light set at the intersection through which the vehicle is to pass.
  • the vehicle speed is adjusted using a preset speed change rule.
  • step 105 This step is combined with the description of step 105 in the above method, and the same contents are not repeated here.
  • the adjusting of the vehicle speed is used to pass through an intersection that the vehicle is preparing to pass.
  • the vehicle distance is maintained and the vehicle in front is followed through the intersection where the target signal light is located; after the vehicle passes the intersection where the target signal light is located, the preset signal light queue set is cleared.
  • the present invention provides a signal light recognition method based on deep learning.
  • the present invention mainly targets the scene where an autonomous driving vehicle passes through an intersection with a signal light, applies a combination of depth estimation and target detection methods, and uses an end-to-end deep neural network architecture to predict a signal light with depth information, and then predicts the distance information between the signal light and the vehicle, and assists in the subsequent planning of the vehicle. Decision-making can be avoided in some shortcomings of 2D signal light recognition, such as the lack of distance information in the application of autonomous driving scenarios, which will cause some decision errors and lags.
  • signal lights can be found earlier, and can be used as auxiliary distance references in scenarios where high-precision maps or GPS are unavailable, thereby enhancing the robustness of the autonomous driving system.
  • the distance between the vehicle and the signal light can be obtained in real time through high-precision maps and GPS information, but this requires that the high-precision map be accurate and updated in a timely manner, and that the GPS error is small. Both of these conditions will fail in reality.
  • an embodiment of the present invention further provides a signal light recognition device based on deep learning, which is used to implement the method shown in FIG. 1 above.
  • This device embodiment corresponds to the aforementioned method embodiment.
  • this device embodiment will not repeat the details of the aforementioned method embodiment one by one, but it should be clear that the device in this embodiment can correspond to all the contents of the aforementioned method embodiment.
  • the device includes:
  • a first acquisition unit 31 is used to acquire a target image at a preset time interval, wherein the target image contains at least one signal light to be identified;
  • a prediction unit 32 configured to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified by using a preset deep neural network prediction model based on the target image obtained from the acquisition unit 31;
  • the forming unit 33 is used to form a signal light queue corresponding to the signal light to be identified based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified obtained from the prediction unit 32 using a preset rule, and add the signal light queue to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle within a preset time range.
  • a first determining unit 34 is used to determine, based on the destination information and current position information of the vehicle, a signal light queue corresponding to a target signal light from the preset signal light queue set obtained by the forming unit 33, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass;
  • the adjustment unit 35 is used to adjust the speed of the vehicle using a preset speed change rule based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light obtained from the determination unit 34, so as to pass through the intersection that the vehicle is preparing to pass.
  • an embodiment of the present invention also provides another signal light recognition device based on deep learning, which is used to implement the method shown in FIG. 2 above.
  • This device embodiment corresponds to the aforementioned method embodiment.
  • this device embodiment will no longer repeat the details of the aforementioned method embodiment one by one, but it should be clear that the device in this embodiment can correspond to all the contents of the aforementioned method embodiment.
  • the device includes:
  • a first acquisition unit 31 is used to acquire a target image at a preset time interval, wherein the target image contains at least one signal light to be identified;
  • a prediction unit 32 configured to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified by using a preset deep neural network prediction model based on the target image obtained from the acquisition unit 31;
  • the forming unit 33 is used to form a signal light queue corresponding to the signal light to be identified based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified obtained from the prediction unit 32 using a preset rule, and add the signal light queue to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle within a preset time range.
  • a first determining unit 34 configured to determine, based on the destination information of the vehicle and the current position information obtained from the second determining unit 37, a signal light queue corresponding to a target signal light from the set of preset signal light queues obtained by the forming unit 33, wherein the target signal light is a signal light provided at the intersection through which the vehicle is to pass;
  • an adjusting unit 35 configured to adjust the speed of the vehicle by using a preset speed change rule based on the distance information between the target signal light and the vehicle and the light source type stored in the signal light queue corresponding to the target signal light obtained from the determining unit 34, so as to pass through the intersection that the vehicle is preparing to pass;
  • a second acquisition unit 36 is used to acquire lane line information and road arrow information of the lane where the vehicle is located;
  • a second determining unit 37 configured to determine the current position information of the vehicle based on the lane line information and the road arrow information obtained from the second acquiring unit 36;
  • the monitoring unit 38 is used to maintain the vehicle distance and follow the vehicle in front through the intersection where the target signal light is located when it is detected that there are other vehicles in front of the lane where the vehicle is located;
  • the deleting unit 39 is used to clear the preset signal light queue set after the vehicle obtained from the monitoring unit 38 passes through the intersection where the target signal light is located.
  • the prediction unit 32 includes:
  • a first feature extraction module 321 is used to extract features based on the target image using the resnet-18 network in the preset deep neural network prediction model as a backbone network to obtain target features;
  • a feature fusion module 322 is used to perform feature fusion based on the target feature obtained from the first feature extraction module 321 using the PAN-FPN structure in the preset deep neural network prediction model to obtain target feature fusion information;
  • a first prediction module 323, configured to obtain the light source depth of the signal light to be identified by using the convolution layer and the depth prediction layer in the preset deep neural network prediction model based on the target feature fusion information obtained from the feature fusion module 322;
  • a regression module 324 is used to automatically regress the target frame position of the lamp panel using a full connection technology of a neural network based on the target feature fusion information obtained from the feature fusion module 322;
  • a second feature extraction module 325 is used to match the lamp panel target frame position obtained from the regression module 324 with the target feature, and extract ROI features according to the lamp panel target frame position;
  • the second prediction module 326 is used to perform target frame regression and category classification based on the ROI features obtained from the second feature extraction module 325 to obtain the light source position and the light source category of the signal light to be identified, wherein the light source category is one or more of a red light, a yellow light, a green light, a left arrow, a right arrow, a forward arrow, a backward arrow and a number.
  • the forming unit 33 includes:
  • a first acquisition module 331 is used to acquire the scale information corresponding to the signal light to be identified
  • a second acquisition module 332 is used to fuse the light source depth corresponding to the signal light to be identified and the scale information obtained from the first acquisition module 331 to obtain the distance information between the signal light to be identified and the vehicle;
  • a forming module 333 configured to form a signal light queue corresponding to the signal light to be identified based on the lamp panel position, the light source position, the light source type and the distance information between the signal light to be identified and the vehicle obtained from the second acquiring module 332;
  • the adding module 334 is used to add the signal light queue corresponding to the signal light to be identified obtained from the forming module 333 to the preset signal light queue set according to the time sequence of the formation of the signal light queue.
  • the adjustment unit 35 includes:
  • a first judgment module 351 is used to judge whether the distance information between the target signal light and the vehicle is less than a preset threshold
  • a first adjustment module 352 configured to adjust the speed of the vehicle to a first preset speed if the distance information between the target signal light and the vehicle obtained from the first judgment module 351 is not less than a preset threshold;
  • a second judgment module 353 is used to judge whether the light source type is a green light if the distance information between the target signal light and the vehicle obtained from the first judgment module 351 is less than a preset threshold;
  • a second adjustment module 354 is configured to adjust the speed of the vehicle to a first preset speed if the light source type obtained from the second judgment module 353 is a green light;
  • a third adjustment module 355, configured to adjust the speed of the vehicle to a second preset speed if the light source type obtained from the second judgment module 353 is not a green light, and monitor whether the vehicle meets a preset parking condition; wherein the preset parking condition is a preset safety distance before the vehicle reaches a stop line or a stationary vehicle ahead, and the second preset threshold is less than the first preset threshold;
  • the speed of the vehicle is adjusted to 0.
  • an embodiment of the present invention also provides a processor, which is used to run a program, wherein the program, when running, executes the deep learning-based traffic light recognition method described in Figures 1-2 above.
  • an embodiment of the present invention also provides a storage medium, which is used to store a computer program, wherein when the computer program is running, it controls the device where the storage medium is located to execute the deep learning-based traffic light recognition method described in Figures 1-2 above.
  • the memory may include non-permanent memory in a computer-readable medium, random access memory (RAM) and/or non-volatile memory in the form of read-only memory (ROM) or flash RAM, and the memory includes at least one memory chip.
  • RAM random access memory
  • ROM read-only memory
  • flash RAM flash random access memory
  • the embodiments of the present application may be provided as methods, systems, or computer program products. Therefore, the present application may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment in combination with software and hardware. Moreover, the present application may adopt the form of a computer program product implemented in one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) that contain computer-usable program code.
  • a computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
  • These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing device to work in a specific manner, so that the instructions stored in the computer-readable memory produce a manufactured product including an instruction device that implements the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.
  • These computer program instructions may also be loaded onto a computer or other programmable data processing device so that a series of operational steps are executed on the computer or other programmable device to produce a computer-implemented process, whereby the instructions executed on the computer or other programmable device provide steps for implementing the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.
  • a computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
  • processors CPU
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • Memory may include non-permanent storage in a computer-readable medium, random access memory (RAM) and/or non-volatile memory in the form of read-only memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
  • RAM random access memory
  • ROM read-only memory
  • flash RAM flash memory
  • Computer readable media include permanent and non-permanent, removable and non-removable media that can be implemented by any method or technology to store information.
  • Information can be computer readable instructions, data structures, program modules or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, compact disk read-only memory (CD-ROM), digital versatile disk (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media that can be used to store information that can be accessed by a computing device.
  • computer readable media does not include temporary computer readable media (transitory media), such as modulated data signals and carrier waves.
  • the embodiments of the present application may be provided as methods, systems or computer program products. Therefore, the present application may adopt the form of a complete hardware embodiment, a complete software embodiment or an embodiment in combination with software and hardware. Moreover, the present application may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) that contain computer-usable program code.
  • a computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Multimedia (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Mechanical Engineering (AREA)
  • Transportation (AREA)
  • Biomedical Technology (AREA)
  • Automation & Control Theory (AREA)
  • Combustion & Propulsion (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Circuit Arrangement For Electric Light Sources In General (AREA)
  • Feedback Control In General (AREA)
  • Traffic Control Systems (AREA)

Abstract

A deep learning-based signal light recognition method and apparatus, relating to the technical field of autonomous driving, for use in signal light recognition in autonomous driving. The main objective is to predict distance information between a signal light and an ego vehicle, and to assist in decision-making for subsequent planning of the vehicle. The main technical solution is as follows: acquiring target images at a preset time interval; on the basis of the target images, obtaining, by using a preset deep neural network prediction model, lamp panel information and light source information of a signal light to be recognized; on the basis of the lamp panel information and the light source information, forming a corresponding signal light queue by using a preset rule, and adding the signal light queue into a preset signal light queue set; on the basis of destination information and current position information of a vehicle, determining the signal light queue corresponding to a target signal light from the preset signal light queue set; and adjusting the speed of the vehicle by using a preset speed change rule on the basis of distance information between the target signal light and the vehicle and a light source category stored in the signal light queue corresponding to the target signal light.

Description

一种基于深度学习的信号灯识别方法及装置A signal light recognition method and device based on deep learning 技术领域Technical Field
本发明涉及自动驾驶技术领域,尤其涉及一种基于深度学习的信号灯识别方法及装置。The present invention relates to the field of autonomous driving technology, and in particular to a signal light recognition method and device based on deep learning.
背景技术Background Art
车辆在自动驾驶场景中,需要及时识别道路周围的环境信息,例如:斑马线,信号灯等,车辆在识别的环境信息的基础上进行自动驾驶来可以保证驾驶的安全性,其中,信号灯的识别对于自动驾驶的安全性尤为重要。In an autonomous driving scenario, a vehicle needs to promptly identify environmental information around the road, such as zebra crossings, traffic lights, etc. The vehicle can ensure driving safety by performing autonomous driving based on the identified environmental information. Among them, the recognition of traffic lights is particularly important for the safety of autonomous driving.
目前,采用的信号灯识别方法是通过2D目标检测方式来完成,但是,由于在行车道路上存在的信号灯型号和尺寸各有不同,通过2D目标检测方式很难通过尺度信息获取信号灯距离本车的距离,没有准确的距离信息,在自动驾驶场景应用中,由于距离信息的缺少,会造成一些决策误差以及滞后。At present, the signal light recognition method used is completed through 2D target detection. However, due to the different models and sizes of signal lights on the road, it is difficult to obtain the distance between the signal light and the vehicle through scale information through 2D target detection. Without accurate distance information, in the application of autonomous driving scenarios, the lack of distance information will cause some decision errors and lags.
发明内容Summary of the invention
鉴于上述问题,本发明提供一种基于深度学习的信号灯识别方法及装置,主要目的是为了实现针对自动驾驶车辆通过有信号灯的路口场景,可预测信号灯和本车的距离信息,并对车辆的后续规划进行辅助决策。In view of the above problems, the present invention provides a traffic light recognition method and device based on deep learning. The main purpose is to achieve a scenario where an autonomous driving vehicle passes through an intersection with traffic lights, predict the distance information between the traffic light and the vehicle, and assist in decision-making for subsequent planning of the vehicle.
为解决上述技术问题,本发明提出以下方案:In order to solve the above technical problems, the present invention proposes the following solutions:
第一方面,本发明提供一种基于深度学习的信号灯识别方法,所述方法包括:In a first aspect, the present invention provides a signal light recognition method based on deep learning, the method comprising:
按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯;Acquire a target image at a preset time interval, wherein the target image contains at least one signal light to be identified;
基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;Based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified;
基于所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时间范围内所述待识别 信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;Based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified, a signal light queue corresponding to the signal light to be identified is formed using preset rules and added to the preset signal light queue set; wherein the signal light queue stores the signal light to be identified within a preset time range. The lamp panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle corresponding to the signal light;
基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;Based on the destination information and current location information of the vehicle, determining a signal light queue corresponding to a target signal light from the preset signal light queue set, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass;
基于所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口。Based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light, the speed of the vehicle is adjusted using a preset speed change rule so as to pass through the intersection that the vehicle is preparing to pass.
第二方面,本发明提供一种基于深度学习的信号灯识别装置,所述装置包括:In a second aspect, the present invention provides a signal light recognition device based on deep learning, the device comprising:
第一获取单元,用于按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯;A first acquisition unit, configured to acquire a target image at a preset time interval, wherein the target image includes at least one signal light to be identified;
预测单元,用于基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;A prediction unit, configured to obtain, based on the target image, a lamp panel position, a light source position, a light source type, and a light source depth of the signal light to be identified by using a preset deep neural network prediction model;
形成单元,用于基于所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;A forming unit, configured to form a signal light queue corresponding to the signal light to be identified by using a preset rule based on the light panel position, light source position, light source type and light source depth of the signal light to be identified, and add the signal light queue to a preset signal light queue set; wherein the signal light queue stores the light panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle within a preset time range corresponding to the signal light to be identified;
第一确定单元,用于基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;A first determining unit is used to determine a signal light queue corresponding to a target signal light from the preset signal light queue set based on the destination information and current position information of the vehicle, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass;
调整单元,用于基于所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口。An adjustment unit is used to adjust the speed of the vehicle using a preset speed change rule based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light, so as to pass the intersection that the vehicle is preparing to pass.
为了实现上述目的,根据本发明的第三方面,提供了一种存储介质,所述存储介质包括存储的程序,其中,在所述程序运行时控制所述存储介质所在设备执行上述第一方面所述基于深度学习的信号灯识别方法。In order to achieve the above-mentioned purpose, according to the third aspect of the present invention, a storage medium is provided, wherein the storage medium includes a stored program, wherein when the program is running, the device where the storage medium is located is controlled to execute the traffic light recognition method based on deep learning described in the first aspect.
为了实现上述目的,根据本发明的第四方面,提供了一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计 算机程序,所述处理器执行所述程序时实现如第二方面所述用于基于深度学习的信号灯识别装置的全部或部分步骤。In order to achieve the above object, according to a fourth aspect of the present invention, there is provided an electronic device, comprising a memory, a processor, and a computer program stored in the memory and operable on the processor. A computer program, when the processor executes the program, implements all or part of the steps of the signal light recognition device based on deep learning as described in the second aspect.
借由上述技术方案,本发明提供的基于深度学习的信号灯识别方法及装置,是由于目前采用的信号灯识别方法是通过2D目标检测方式来完成,但是,由于在行车道路上存在的信号灯型号和尺寸各有不同,通过2D目标检测方式很难通过尺度信息获取信号灯距离本车的距离,没有准确的距离信息,在自动驾驶场景应用中,由于距离信息的缺少,会造成一些决策误差以及滞后。为此,本发明通过按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯;基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;基于所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;基于所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口。本发明应用深度估计和目标检测方法的结合,预测一个带有深度信息的信号灯,在长焦镜头标配的时代,可以更早的发现信号灯,并且在高精地图或GPS不可用场景下作为辅助的距离参考,增强自动驾驶系统鲁棒性,并且可以避免2D信号灯识别的一些缺点,比如在自动驾驶场景应用中,距离信息的缺少,会造成一些决策误差以及滞后。Through the above technical scheme, the signal light recognition method and device based on deep learning provided by the present invention are because the currently used signal light recognition method is completed through 2D target detection. However, due to the different models and sizes of signal lights on the road, it is difficult to obtain the distance between the signal light and the vehicle through scale information through 2D target detection. Without accurate distance information, in the application of autonomous driving scenarios, the lack of distance information will cause some decision errors and lags. To this end, the present invention obtains a target image at a preset time interval, wherein the target image contains at least one signal light to be identified; based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source category and light source depth of the signal light to be identified; based on the lamp panel position, light source position, light source category and light source depth of the signal light to be identified, a signal light queue corresponding to the signal light to be identified is formed using a preset rule and added to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source category and distance information between the signal light to be identified and the vehicle within a preset time range; based on the destination information and current position information of the vehicle, a signal light queue corresponding to a target signal light is determined from the preset signal light queue set, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass; based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light, a preset speed change rule is used to adjust the vehicle speed so as to pass through the intersection through which the vehicle is to pass. The present invention applies a combination of depth estimation and target detection methods to predict a traffic light with depth information. In an era when telephoto lenses are standard, traffic lights can be discovered earlier and used as an auxiliary distance reference in scenarios where high-precision maps or GPS are unavailable, thereby enhancing the robustness of the autonomous driving system and avoiding some shortcomings of 2D traffic light recognition. For example, in autonomous driving scenario applications, the lack of distance information can cause some decision-making errors and lags.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solution of the present invention. In order to more clearly understand the technical means of the present invention, it can be implemented according to the contents of the specification. In order to make the above and other purposes, features and advantages of the present invention more obvious and easy to understand, the specific implementation methods of the present invention are listed below.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目 的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those skilled in the art by reading the following detailed description of the preferred embodiment. The present invention is not intended to be limiting. In addition, the same reference symbols are used to represent the same components throughout the drawings. In the drawings:
图1示出了本发明实施例提供的一种基于深度学习的信号灯识别方法流程图;FIG1 shows a flow chart of a signal light recognition method based on deep learning provided by an embodiment of the present invention;
图2示出了本发明实施例提供的另一种基于深度学习的信号灯识别方法流程图;FIG2 shows a flow chart of another signal light recognition method based on deep learning provided by an embodiment of the present invention;
图3示出了本发明实施例提供的一种基于深度学习的信号灯识别装置的组成框图;FIG3 shows a block diagram of a signal light recognition device based on deep learning provided by an embodiment of the present invention;
图4示出了本发明实施例提供的另一种基于深度学习的信号灯识别装置的组成框图;FIG4 shows a block diagram of another signal light recognition device based on deep learning provided by an embodiment of the present invention;
图5示出了本发明实施例提供的另一种基于深度学习的信号灯识别装置对应的应用场景图。FIG5 shows an application scenario diagram corresponding to another signal light recognition device based on deep learning provided by an embodiment of the present invention.
具体实施方式DETAILED DESCRIPTION
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。The exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although the exemplary embodiments of the present disclosure are shown in the accompanying drawings, it should be understood that the present disclosure can be implemented in various forms and should not be limited by the embodiments set forth herein. On the contrary, these embodiments are provided in order to enable a more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.
对于目前采用的信号灯识别方法是通过2D目标检测方式来完成,但是,由于在行车道路上存在的信号灯型号和尺寸各有不同,通过2D目标检测方式很难通过尺度信息获取信号灯距离本车的距离,没有准确的距离信息,在自动驾驶场景应用中,由于距离信息的缺少,会造成一些决策误差以及滞后。针对此问题,发明人想到应用深度估计和目标检测方法的结合,预测一个带有深度信息的信号灯,在长焦镜头标配的时代,可以更早的发现信号灯,并且在高精地图或GPS不可用场景下作为辅助的距离参考,增强自动驾驶系统鲁棒性。The current method for signal light recognition is accomplished through 2D target detection. However, due to the different types and sizes of signal lights on the road, it is difficult to obtain the distance between the signal light and the vehicle through scale information through 2D target detection. Without accurate distance information, the lack of distance information in autonomous driving scenarios will cause some decision errors and lags. To address this problem, the inventors thought of combining depth estimation and target detection methods to predict a signal light with depth information. In an era when telephoto lenses are standard, signal lights can be discovered earlier, and can be used as an auxiliary distance reference in scenarios where high-precision maps or GPS are unavailable, thereby enhancing the robustness of the autonomous driving system.
为此,本发明实施例提供了一种基于深度学习的信号灯识别方法,通过该方法实现针对自动驾驶车辆通过有信号灯的路口场景,可预测信号灯和本车的距离信息,并对车辆的后续规划进行辅助决策,其具体执行步骤如图1所示,包括: To this end, an embodiment of the present invention provides a signal light recognition method based on deep learning. The method can predict the distance information between the signal light and the vehicle when the autonomous driving vehicle passes through an intersection with a signal light, and assist in decision-making for the subsequent planning of the vehicle. The specific execution steps are shown in FIG1, including:
101、按照预设时间间隔获取目标图像。101. Acquire a target image at a preset time interval.
其中,所述目标图像中至少包含一个待识别信号灯。Wherein, the target image contains at least one signal light to be identified.
交通信号灯是交通安全产品中的一个类别,是为了加强道路交通管理,减少交通事故的发生,提高道路使用效率,改善交通状况的一种重要工具。适用于十字、丁字等交叉路口,由道路交通信号控制机控制,指导车辆和行人安全有序地通行。交通信号灯由红灯、绿灯、黄灯组成。红灯表示禁止通行,绿灯表示准许通行,黄灯表示慢行或警示。《道路交通法实施条例》将交通信号灯分为:机动车信号灯、非机动车信号灯、人行横道信号灯、车道信号灯、方向指示信号灯、闪光警告信号灯、道路与铁路平面交叉道口信号灯。Traffic lights are a category of traffic safety products. They are an important tool to strengthen road traffic management, reduce the occurrence of traffic accidents, improve road use efficiency, and improve traffic conditions. They are suitable for crosses, T-shaped intersections, etc., and are controlled by road traffic signal controllers to guide vehicles and pedestrians to pass safely and orderly. Traffic lights are composed of red lights, green lights, and yellow lights. Red lights indicate that passage is prohibited, green lights indicate that passage is allowed, and yellow lights indicate slow travel or warning. The "Road Traffic Law Implementation Regulations" divide traffic lights into: motor vehicle lights, non-motor vehicle lights, pedestrian crossing lights, lane lights, direction indicator lights, flashing warning lights, and road and railway level crossing lights.
在车辆自动驾驶过程中,如果遇到路口,需要识别路口处的信号灯,根据信号灯的指示进行调整车速,以便通过该路口。本实施例首先需要按照预设时间间隔获取目标图像,其中,所述预设时间间隔可以是每秒、每2秒等,根据实际需要进行设置,本实施例不做具体限定;由于路口通常为十字型或者丁字型,车辆获得的图像中可能会存在3个、2个等不同方向的信号灯,但是,在执行下一步之前所述车辆获得的图像至少得包含一个待识别信号灯。During the process of automatic driving of the vehicle, if it encounters an intersection, it is necessary to identify the traffic lights at the intersection and adjust the vehicle speed according to the instructions of the traffic lights in order to pass the intersection. This embodiment first needs to acquire the target image at a preset time interval, wherein the preset time interval can be every second, every 2 seconds, etc., and is set according to actual needs. This embodiment does not make specific limitations; since the intersection is usually cross-shaped or T-shaped, there may be 3, 2, etc. traffic lights in different directions in the image obtained by the vehicle, but before executing the next step, the image obtained by the vehicle must contain at least one traffic light to be identified.
102、基于目标图像,利用预设深度神经网络预测模型得到待识别信号灯的灯盘位置、光源位置、光源类别和光源深度。102. Based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified.
从步骤101可得所述目标图像,然后利用预设深度神经网络预测模型得到待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;其中,所述预设深度神经网络预测模型是预先设计的,其包括输入图像模块、特征提取模块、特征融合模块、深度预测模块、信号灯灯盘框预测模块、ROI提取模块、灯源回归分类模块等,本实施例不做具体限定;From step 101, the target image can be obtained, and then the lamp panel position, light source position, light source category and light source depth of the signal light to be identified are obtained by using a preset deep neural network prediction model; wherein the preset deep neural network prediction model is pre-designed, and includes an input image module, a feature extraction module, a feature fusion module, a depth prediction module, a signal light panel frame prediction module, a ROI extraction module, a light source regression classification module, etc., which are not specifically limited in this embodiment;
所述输入图像模块获取所述目标图像,作为所述预设深度神经网络预测模型的输入数据,经过特征提取模块进行特征提取,然后经过所述特征融合模块将提取到的特征进行融合,基于融合后的信息通过所述深度预测模块得到光源深度;再基于融合后的信息通过所述信号灯灯盘框预测模块得到灯盘框信息,所述灯盘框信息包括灯盘位置;所述ROI提取模块根据所述灯盘框信息和上述经过特征提取模块得到的特征进行匹配,提取到 ROI特征;基于所述ROI特征,经过所述灯源回归分类模块得到光源位置、光源类别。其中,所述光源类别是按照不同灯源颜色、图像等进行分类,目的是用于指示车辆是否前进或者转向等;本实施例不做具体限定。The input image module obtains the target image as the input data of the preset deep neural network prediction model, extracts features through the feature extraction module, and then fuses the extracted features through the feature fusion module. The depth of the light source is obtained through the depth prediction module based on the fused information; the light panel frame information is obtained through the signal light panel frame prediction module based on the fused information, and the light panel frame information includes the position of the light panel; the ROI extraction module matches the light panel frame information with the features obtained through the feature extraction module, and extracts ROI features: Based on the ROI features, the light source position and light source category are obtained through the light source regression classification module. The light source category is classified according to different light source colors, images, etc., for the purpose of indicating whether the vehicle is moving forward or turning, etc. This embodiment does not make specific limitations.
103、基于待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中。103. Based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified, a signal light queue corresponding to the signal light to be identified is formed using preset rules and added to the preset signal light queue set.
其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;所述待识别信号灯与所述车辆的距离信息是由光源深度得到的,可以是光源深度的数值,也可以是将光源深度与光源尺度信息结合进行校正后的距离。Among them, the traffic light queue saves the lamp panel position, light source position, light source type and distance information between the traffic light to be identified and the vehicle corresponding to the traffic light to be identified within a preset time range; the distance information between the traffic light to be identified and the vehicle is obtained by the light source depth, which can be the numerical value of the light source depth, or the distance corrected by combining the light source depth with the light source scale information.
一个信号灯的灯盘对应一个信号灯队列,也就是说,一个信号灯队列包括一个信号灯所有的信息,例如:灯盘信息、光源相关信息(光源位置、光源类别和光源深度)等;A signal light panel corresponds to a signal light queue, that is, a signal light queue includes all the information of a signal light, such as: light panel information, light source related information (light source position, light source type and light source depth), etc.
将形成的信号灯队列加入到预设信号灯队列集合中的顺序可以设置为所述车辆正前方的信号灯形成的队列位于所述预设信号灯队列结合的第一个,依次后续为路口左边信号灯对应的队列、路口右边信号灯对应的队列;也可以根据所述车辆的目的地和当前位置先确定预备要通过的路口处的信号灯,然后将该信号灯形成的队列位于所述预设信号灯队列集合的第一个;本实施不做具体限定。The order of adding the formed traffic light queues to the preset traffic light queue set can be set as follows: the queue formed by the traffic light directly in front of the vehicle is located at the first one of the preset traffic light queues, followed by the queue corresponding to the traffic light on the left side of the intersection, and the queue corresponding to the traffic light on the right side of the intersection; or the traffic light at the intersection to be passed can be determined first according to the destination and current position of the vehicle, and then the queue formed by the traffic light is located at the first one of the preset traffic light queue set; this implementation does not make specific limitations.
104、基于车辆的目的地信息和当前位置信息,从预设信号灯队列集合中确定目标信号灯对应的信号灯队列。104. Based on the destination information and current location information of the vehicle, determine a signal light queue corresponding to the target signal light from a set of preset signal light queues.
其中,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;Wherein, the target signal light is a signal light set at the intersection where the vehicle is to pass;
从步骤103可得所述预设信号灯队列集合,需要根据车辆自动驾驶系统中设定的目的地和判断的当前位置信息去确定所述车辆预备要通过的路口,当路口确定后可以获得该路口处的信号灯的灯盘位置,然后从所述预设信号灯队列中查找该灯盘位置对应的信号灯队列,查找到的信号灯队列就是预备要通过的路口处的信号灯形成的,该信号灯为目标信号灯,该信号灯队列为所述目标信号灯对应的信号灯队列。 The preset signal light queue set can be obtained from step 103. It is necessary to determine the intersection that the vehicle is going to pass through based on the destination set in the vehicle's automatic driving system and the current position information determined. When the intersection is determined, the light panel position of the signal light at the intersection can be obtained, and then the signal light queue corresponding to the light panel position can be searched from the preset signal light queue. The signal light queue found is formed by the signal light at the intersection that the vehicle is going to pass through. The signal light is the target signal light, and the signal light queue is the signal light queue corresponding to the target signal light.
105、基于目标信号灯对应的信号灯队列保存的目标信号灯与车辆的距离信息和光源类别,利用预设变速规则调整车辆的车速。105. Based on the distance information between the target signal light and the vehicle and the light source type stored in the signal light queue corresponding to the target signal light, the vehicle speed is adjusted using a preset speed change rule.
其中,所述调整车辆的车速用于通过车辆预备通过的路口。The adjusting of the vehicle speed is used to pass through an intersection that the vehicle is preparing to pass.
从步骤104可得所述目标信号灯对应的信号灯队列,从该信号灯队列可以得到预先保存的目标信号灯与车辆的距离信息和光源类别,利用预设变速规则调整车辆的车速。其中,所述预设变速规则可以根据光源类别的不同以及目标信号灯与车辆的距离的不同设置不同等级的车速,例如:当目标信号灯与车辆之间的距离位于第一距离范围时(较远),不论光源类别是什么,都可以先将车速降低到第一等级车速,当目标信号灯与车辆之间的距离逐渐拉近时到达第二距离范围时(不远不近),再进一步判断目标信号灯是什么颜色,如果是红灯,说明车辆距离停车线不远,可能车辆到达停车线时目标信号灯依然是红灯,那么将车辆的车速降低至第二等级车速,以便后续车辆达到停车线时能够平稳快速的停车;如果是绿灯,说明车辆距离停车线不远,可能车辆到达停车线时目标信号灯依然是红灯,那么车辆可以保持第一等级车速。需要说明的是第一等级车速要快于第二等级车速。From step 104, the signal light queue corresponding to the target signal light can be obtained, and the distance information and light source type of the target signal light and the vehicle that are pre-stored can be obtained from the signal light queue, and the vehicle speed can be adjusted using the preset speed change rule. The preset speed change rule can set different levels of speed according to the different light source types and the different distances between the target signal light and the vehicle. For example, when the distance between the target signal light and the vehicle is in the first distance range (far away), regardless of the light source type, the vehicle speed can be reduced to the first level of speed. When the distance between the target signal light and the vehicle gradually decreases and reaches the second distance range (neither far nor near), the color of the target signal light is further determined. If it is a red light, it means that the vehicle is not far from the stop line, and the target signal light may still be red when the vehicle reaches the stop line. In this case, the vehicle speed is reduced to the second level of speed so that the subsequent vehicle can stop smoothly and quickly when it reaches the stop line. If it is a green light, it means that the vehicle is not far from the stop line, and the target signal light may still be red when the vehicle reaches the stop line. In this case, the vehicle can maintain the first level of speed. It should be noted that the first level speed is faster than the second level speed.
基于上述图1实施例的实现方式可以看出,本发明提供一种基于深度学习的信号灯识别方法,本发明通过按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯;基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;基于所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;基于所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口。本发明应用深度估计和目标检测方法的结合, 预测一个带有深度信息的信号灯,在长焦镜头标配的时代,可以更早的发现信号灯,并且在高精地图或GPS不可用场景下作为辅助的距离参考,增强自动驾驶系统鲁棒性,并且可以避免2D信号灯识别的一些缺点,比如在自动驾驶场景应用中,距离信息的缺少,会造成一些决策误差以及滞后。进一步的,作为对图1所示实施例的细化及扩展,本发明实施例还提供了另一种基于深度学习的信号灯识别方法,如图2所示,其具体步骤如下:Based on the implementation of the embodiment of FIG. 1 above, it can be seen that the present invention provides a signal light recognition method based on deep learning. The present invention obtains a target image at a preset time interval, wherein the target image contains at least one signal light to be recognized; based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source category and light source depth of the signal light to be recognized; based on the lamp panel position, light source position, light source category and light source depth of the signal light to be recognized, a signal light queue corresponding to the signal light to be recognized is formed using a preset rule and added to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source category and distance information between the signal light to be recognized and the vehicle within a preset time range; based on the destination information and current position information of the vehicle, a signal light queue corresponding to a target signal light is determined from the preset signal light queue set, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass; based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light, a preset speed change rule is used to adjust the vehicle speed so as to pass through the intersection through which the vehicle is to pass. The present invention combines depth estimation and target detection methods. Predicting a traffic light with depth information can detect traffic lights earlier in the era when telephoto lenses are standard, and can serve as an auxiliary distance reference in scenarios where high-precision maps or GPS are unavailable, enhancing the robustness of the autonomous driving system, and avoiding some shortcomings of 2D traffic light recognition, such as the lack of distance information in autonomous driving scene applications, which can cause some decision errors and lags. Further, as a refinement and extension of the embodiment shown in Figure 1, an embodiment of the present invention also provides another method for traffic light recognition based on deep learning, as shown in Figure 2, and its specific steps are as follows:
201、按照预设时间间隔获取目标图像。201. Acquire a target image at a preset time interval.
本步骤结合上述方法中101步骤的描述,在此相同的内容不赘述。This step is combined with the description of step 101 in the above method, and the same contents are not repeated here.
其中,所述目标图像中至少包含一个待识别信号灯;Wherein, the target image contains at least one signal light to be identified;
202、基于目标图像,利用预设深度神经网络预测模型得到待识别信号灯的灯盘位置、光源位置、光源类别和光源深度。202. Based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified.
本步骤结合上述方法中102步骤的描述,在此相同的内容不赘述。This step is combined with the description of step 102 in the above method, and the same contents are not repeated here.
从步骤201可得所述目标图像,然后基于所述目标图像,利用所述预设深度神经网络预测模型中的resnet-18网络作为主干网络进行特征提取,得到目标特征;基于所述目标特征,利用所述预设深度神经网络预测模型中的PAN-FPN结构进行特征融合,得到目标特征融合信息;基于目标特征融合信息,利用所述预设深度神经网络预测模型中的卷积层和深度预测层得到所述待识别信号灯的所述光源深度;基于所述目标特征融合信息,利用神经网络的全连接技术进行自动回归得到灯盘目标框位置;将所述灯盘目标框位置和所述目标特征进行匹配,根据所述灯盘目标框位置提取ROI特征;基于所述ROI特征进行目标框回归和类别的分类,得到所述待识别信号灯的所述光源位置和所述光源类别,其中,所述光源类别是红灯、黄灯、绿灯、向左箭头、向右箭头、向前箭头、向后箭头和数字中一种或几种。如图5所示,信号灯处的框内为灯盘信息,首先,预设深度神经网络预测模型检测得到信号灯处的框,然后获取对应信号灯处的框ROI内区域的特征,进行内部的光源识别,包括其类别和位置。The target image can be obtained from step 201, and then based on the target image, the resnet-18 network in the preset deep neural network prediction model is used as the backbone network to perform feature extraction to obtain the target feature; based on the target feature, the PAN-FPN structure in the preset deep neural network prediction model is used to perform feature fusion to obtain target feature fusion information; based on the target feature fusion information, the convolution layer and the depth prediction layer in the preset deep neural network prediction model are used to obtain the light source depth of the signal light to be identified; based on the target feature fusion information, the full connection technology of the neural network is used to perform automatic regression to obtain the target frame position of the light panel; the target frame position of the light panel is matched with the target feature, and the ROI feature is extracted according to the target frame position of the light panel; based on the ROI feature, target frame regression and category classification are performed to obtain the light source position and the light source category of the signal light to be identified, wherein the light source category is one or more of a red light, a yellow light, a green light, a left arrow, a right arrow, a forward arrow, a backward arrow and a number. As shown in Figure 5, the box at the traffic light contains the light panel information. First, the preset deep neural network prediction model detects the box at the traffic light, and then obtains the features of the area within the box ROI corresponding to the traffic light to identify the internal light source, including its category and position.
203、基于待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中。 203. Based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified, a signal light queue corresponding to the signal light to be identified is formed using preset rules, and added to the preset signal light queue set.
本步骤结合上述方法中103步骤的描述,在此相同的内容不赘述。This step is combined with the description of step 103 in the above method, and the same contents are not repeated here.
其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息。The signal light queue stores information on the lamp panel position, light source position, light source type, and distance between the signal light to be identified and the vehicle corresponding to the signal light to be identified within a preset time range.
从步骤202可得所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,然后再获取所述待识别信号灯对应的尺度信息,获取尺度信息方法是通过目标光源的图像尺寸(主要是宽和高),通过宽高信息与实际目标尺寸的先验知识得到其大致的距离;将所述待识别信号灯对应的所述光源深度和所述尺度信息进行融合得到所述待识别信号灯与所述车辆的距离信息,模型输出的光源距离要符合其光源的图像尺寸与实际距离的关系,因为交通信号灯的大小有国家标准,其图像尺寸和实际尺寸是有联系的,不会有太大的误差,将尺度信息作为一个先验知识,目的是校正所述待识别信号灯与所述车辆的距离;基于所述待识别信号灯对应的所述灯盘位置、所述光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息形成所述待识别信号灯对应的信号灯队列;按照信号灯队列形成的时间先后顺序将所述待识别信号灯对应的信号灯队列加入到预设信号灯队列集合中。From step 202, the lamp panel position, light source position, light source type and light source depth of the signal light to be identified can be obtained, and then the scale information corresponding to the signal light to be identified is obtained. The method for obtaining the scale information is to obtain the approximate distance through the image size (mainly width and height) of the target light source and the prior knowledge of the width and height information and the actual target size; the light source depth corresponding to the signal light to be identified and the scale information are fused to obtain the distance information between the signal light to be identified and the vehicle. The light source distance output by the model must conform to the relationship between the image size of the light source and the actual distance. Because the size of traffic lights has national standards, the image size and the actual size are related and there will not be too much error. The scale information is used as a priori knowledge to correct the distance between the signal light to be identified and the vehicle; based on the lamp panel position, the light source position, the light source type and the distance information between the signal light to be identified and the vehicle, a signal light queue corresponding to the signal light to be identified is formed; and the signal light queue corresponding to the signal light to be identified is added to the preset signal light queue set in the chronological order of formation of the signal light queue.
204、获取车辆所在车道的车道线信息和路面箭头信息。204. Obtain lane line information and road arrow information of the lane where the vehicle is located.
通过车载摄像头可以获取车辆所在车道上的车道线信息和路面箭头信息,如图5所示,用于后续车辆的当前位置的定位。The on-board camera can be used to obtain the lane line information and road arrow information on the lane where the vehicle is located, as shown in FIG5 , and is used to locate the current position of the subsequent vehicle.
205、基于车道线信息和路面箭头信息确定车辆的当前位置信息。205. Determine the current position information of the vehicle based on the lane line information and the road arrow information.
从步骤204可得所述车道线信息和路面箭头信息,即得到前方路况的图像,如图5所示,包括车道线、路面箭头和信号灯等;通过该图像中心近似本车中心线的位置,可以得到车辆的当前位置信息,所述当前位置信息包括:本车的横向位置及方向,与车道线和路面箭头的关系如图5所示。The lane line information and road arrow information can be obtained from step 204, that is, an image of the road condition ahead can be obtained, as shown in FIG5 , including lane lines, road arrows and traffic lights, etc.; the current position information of the vehicle can be obtained by approximating the position of the center line of the vehicle through the center of the image, and the current position information includes: the lateral position and direction of the vehicle, and the relationship with the lane lines and road arrows is shown in FIG5 .
206、基于车辆的目的地信息和当前位置信息,从预设信号灯队列集合中确定目标信号灯对应的信号灯队列。206. Based on the destination information and current position information of the vehicle, determine a signal light queue corresponding to the target signal light from a set of preset signal light queues.
本步骤结合上述方法中104步骤的描述,在此相同的内容不赘述。This step is combined with the description of step 104 in the above method, and the same contents are not repeated here.
其中,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯。 Wherein, the target traffic light is a traffic light set at the intersection through which the vehicle is to pass.
207、基于目标信号灯对应的信号灯队列保存的目标信号灯与车辆的距离信息和光源类别,利用预设变速规则调整车辆的车速。207. Based on the distance information between the target signal light and the vehicle and the light source type stored in the signal light queue corresponding to the target signal light, the vehicle speed is adjusted using a preset speed change rule.
本步骤结合上述方法中105步骤的描述,在此相同的内容不赘述。This step is combined with the description of step 105 in the above method, and the same contents are not repeated here.
其中,所述调整车辆的车速用于通过车辆预备通过的路口。The adjusting of the vehicle speed is used to pass through an intersection that the vehicle is preparing to pass.
判断所述目标信号灯与所述车辆的距离信息是否小于预设阈值;若所述目标信号灯与所述车辆的距离信息不小于预设阈值,则将所述车辆的车速调整为第一预设车速;若所述目标信号灯与所述车辆的距离信息小于预设阈值,则判断所述光源类别是否为绿灯;若所述光源类别为绿灯,则将所述车辆的车速调整为第一预设车速;若所述光源类别不为绿灯,则将所述车辆的车速调整为第二预设车速,且监测所述车辆是否符合预设停车条件;其中,所述预设停车条件为所述车辆到达停止线前或前方静止车辆前的预设安全距离,所述第二预设阈值小于所述第一预设阈值;当所述车辆符合预设停车条件时,则将所述车辆的车速调整为0。Determine whether the distance information between the target signal light and the vehicle is less than a preset threshold; if the distance information between the target signal light and the vehicle is not less than the preset threshold, adjust the speed of the vehicle to a first preset speed; if the distance information between the target signal light and the vehicle is less than the preset threshold, determine whether the light source category is a green light; if the light source category is a green light, adjust the speed of the vehicle to a first preset speed; if the light source category is not a green light, adjust the speed of the vehicle to a second preset speed, and monitor whether the vehicle meets a preset parking condition; wherein the preset parking condition is a preset safety distance before the vehicle reaches a stop line or a stationary vehicle in front, and the second preset threshold is less than the first preset threshold; when the vehicle meets the preset parking condition, adjust the speed of the vehicle to 0.
进一步的,在本发明的另一优选实施例中,还可以当监测到所述车辆所在车道的前方存在其他车辆时,则保持车距,跟随前方车辆通过所述目标信号灯所在路口;当所述车辆通过所述目标信号灯所在路口之后,将所述预设信号灯队列集合进行清空。Furthermore, in another preferred embodiment of the present invention, when it is detected that there are other vehicles in front of the lane where the vehicle is located, the vehicle distance is maintained and the vehicle in front is followed through the intersection where the target signal light is located; after the vehicle passes the intersection where the target signal light is located, the preset signal light queue set is cleared.
基于上述图2的实现方式可以看出,本发明提供一种基于深度学习的信号灯识别方法,本发明主要针对自动驾驶车辆通过有信号灯的路口场景,应用深度估计和目标检测方法的结合,利用端到端的深度神经网络架构预测一个带有深度信息的信号灯,进而预测信号灯和本车的距离信息,并对车辆的后续规划进行辅助决策,可以避免2D信号灯识别的一些缺点,比如在自动驾驶场景应用中,距离信息的缺少,会造成一些决策误差以及滞后。而且在长焦镜头标配的时代,可以更早的发现信号灯,并且在高精地图或GPS不可用场景下作为辅助的距离参考,增强自动驾驶系统鲁棒性。可以通过高精度图和GPS信息,实时获取本车与信号灯的距离,但这要求高精地图要准确和更新及时,还有保证GPS误差小,这两个条件在现实中都会遇到失效的情况。Based on the implementation of FIG. 2 above, it can be seen that the present invention provides a signal light recognition method based on deep learning. The present invention mainly targets the scene where an autonomous driving vehicle passes through an intersection with a signal light, applies a combination of depth estimation and target detection methods, and uses an end-to-end deep neural network architecture to predict a signal light with depth information, and then predicts the distance information between the signal light and the vehicle, and assists in the subsequent planning of the vehicle. Decision-making can be avoided in some shortcomings of 2D signal light recognition, such as the lack of distance information in the application of autonomous driving scenarios, which will cause some decision errors and lags. Moreover, in the era of standard telephoto lenses, signal lights can be found earlier, and can be used as auxiliary distance references in scenarios where high-precision maps or GPS are unavailable, thereby enhancing the robustness of the autonomous driving system. The distance between the vehicle and the signal light can be obtained in real time through high-precision maps and GPS information, but this requires that the high-precision map be accurate and updated in a timely manner, and that the GPS error is small. Both of these conditions will fail in reality.
进一步的,作为对上述图1所示方法的实现,本发明实施例还提供了一种基于深度学习的信号灯识别装置,用于对上述图1所示的方法进行实 现。该装置实施例与前述方法实施例对应,为便于阅读,本装置实施例不再对前述方法实施例中的细节内容进行逐一赘述,但应当明确,本实施例中的装置能够对应实现前述方法实施例中的全部内容。如图3所示,该装置包括:Furthermore, as an implementation of the method shown in FIG. 1 above, an embodiment of the present invention further provides a signal light recognition device based on deep learning, which is used to implement the method shown in FIG. 1 above. This device embodiment corresponds to the aforementioned method embodiment. For ease of reading, this device embodiment will not repeat the details of the aforementioned method embodiment one by one, but it should be clear that the device in this embodiment can correspond to all the contents of the aforementioned method embodiment. As shown in Figure 3, the device includes:
第一获取单元31,用于按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯;A first acquisition unit 31 is used to acquire a target image at a preset time interval, wherein the target image contains at least one signal light to be identified;
预测单元32,用于基于从所述获取单元31得到的所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;A prediction unit 32, configured to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified by using a preset deep neural network prediction model based on the target image obtained from the acquisition unit 31;
形成单元33,用于基于从所述预测单元32得到的所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;The forming unit 33 is used to form a signal light queue corresponding to the signal light to be identified based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified obtained from the prediction unit 32 using a preset rule, and add the signal light queue to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle within a preset time range.
第一确定单元34,用于基于所述车辆的目的地信息和当前位置信息,从根据所述形成单元33得到的所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;A first determining unit 34 is used to determine, based on the destination information and current position information of the vehicle, a signal light queue corresponding to a target signal light from the preset signal light queue set obtained by the forming unit 33, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass;
调整单元35,用于基于从所述确定单元34得到的所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口。The adjustment unit 35 is used to adjust the speed of the vehicle using a preset speed change rule based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light obtained from the determination unit 34, so as to pass through the intersection that the vehicle is preparing to pass.
进一步的,作为对上述图2所示方法的实现,本发明实施例还提供了另一种基于深度学习的信号灯识别装置,用于对上述图2所示的方法进行实现。该装置实施例与前述方法实施例对应,为便于阅读,本装置实施例不再对前述方法实施例中的细节内容进行逐一赘述,但应当明确,本实施例中的装置能够对应实现前述方法实施例中的全部内容。如图4所示,该装置包括:Furthermore, as an implementation of the method shown in FIG. 2 above, an embodiment of the present invention also provides another signal light recognition device based on deep learning, which is used to implement the method shown in FIG. 2 above. This device embodiment corresponds to the aforementioned method embodiment. For ease of reading, this device embodiment will no longer repeat the details of the aforementioned method embodiment one by one, but it should be clear that the device in this embodiment can correspond to all the contents of the aforementioned method embodiment. As shown in FIG. 4, the device includes:
第一获取单元31,用于按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯; A first acquisition unit 31 is used to acquire a target image at a preset time interval, wherein the target image contains at least one signal light to be identified;
预测单元32,用于基于从所述获取单元31得到的所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;A prediction unit 32, configured to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified by using a preset deep neural network prediction model based on the target image obtained from the acquisition unit 31;
形成单元33,用于基于从所述预测单元32得到的所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;The forming unit 33 is used to form a signal light queue corresponding to the signal light to be identified based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified obtained from the prediction unit 32 using a preset rule, and add the signal light queue to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle within a preset time range.
第一确定单元34,用于基于所述车辆的目的地信息和从所述第二确定单元37得到的当前位置信息,从根据所述形成单元33得到的所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;a first determining unit 34, configured to determine, based on the destination information of the vehicle and the current position information obtained from the second determining unit 37, a signal light queue corresponding to a target signal light from the set of preset signal light queues obtained by the forming unit 33, wherein the target signal light is a signal light provided at the intersection through which the vehicle is to pass;
调整单元35,用于基于从所述确定单元34得到的所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口;an adjusting unit 35, configured to adjust the speed of the vehicle by using a preset speed change rule based on the distance information between the target signal light and the vehicle and the light source type stored in the signal light queue corresponding to the target signal light obtained from the determining unit 34, so as to pass through the intersection that the vehicle is preparing to pass;
第二获取单元36,用于获取所述车辆所在车道的车道线信息和路面箭头信息;A second acquisition unit 36 is used to acquire lane line information and road arrow information of the lane where the vehicle is located;
第二确定单元37,用于基于从所述第二获取单元36得到的所述车道线信息和所述路面箭头信息确定所述车辆的当前位置信息;A second determining unit 37, configured to determine the current position information of the vehicle based on the lane line information and the road arrow information obtained from the second acquiring unit 36;
监测单元38,用于当监测到所述车辆所在车道的前方存在其他车辆时,则保持车距,跟随前方车辆通过所述目标信号灯所在路口;The monitoring unit 38 is used to maintain the vehicle distance and follow the vehicle in front through the intersection where the target signal light is located when it is detected that there are other vehicles in front of the lane where the vehicle is located;
删除单元39,用于当从所述监测单元38得到的所述车辆通过所述目标信号灯所在路口之后,将所述预设信号灯队列集合进行清空。The deleting unit 39 is used to clear the preset signal light queue set after the vehicle obtained from the monitoring unit 38 passes through the intersection where the target signal light is located.
进一步的,所述预测单元32,包括:Furthermore, the prediction unit 32 includes:
第一特征提取模块321,用于基于所述目标图像,利用所述预设深度神经网络预测模型中的resnet-18网络作为主干网络进行特征提取,得到目标特征; A first feature extraction module 321 is used to extract features based on the target image using the resnet-18 network in the preset deep neural network prediction model as a backbone network to obtain target features;
特征融合模块322,用于基于从所述第一特征提取模块321得到的所述目标特征,利用所述预设深度神经网络预测模型中的PAN-FPN结构进行特征融合,得到目标特征融合信息;A feature fusion module 322 is used to perform feature fusion based on the target feature obtained from the first feature extraction module 321 using the PAN-FPN structure in the preset deep neural network prediction model to obtain target feature fusion information;
第一预测模块323,用于基于从所述特征融合模块322得到的目标特征融合信息,利用所述预设深度神经网络预测模型中的卷积层和深度预测层得到所述待识别信号灯的所述光源深度;A first prediction module 323, configured to obtain the light source depth of the signal light to be identified by using the convolution layer and the depth prediction layer in the preset deep neural network prediction model based on the target feature fusion information obtained from the feature fusion module 322;
回归模块324,用于基于从所述特征融合模块322得到的所述目标特征融合信息,利用神经网络的全连接技术进行自动回归得到灯盘目标框位置;A regression module 324 is used to automatically regress the target frame position of the lamp panel using a full connection technology of a neural network based on the target feature fusion information obtained from the feature fusion module 322;
第二特征提取模块325,用于将从所述回归模块324得到的所述灯盘目标框位置和所述目标特征进行匹配,根据所述灯盘目标框位置提取ROI特征;A second feature extraction module 325 is used to match the lamp panel target frame position obtained from the regression module 324 with the target feature, and extract ROI features according to the lamp panel target frame position;
第二预测模块326,用于基于从所述第二特征提取模块325得到的所述ROI特征进行目标框回归和类别的分类,得到所述待识别信号灯的所述光源位置和所述光源类别,其中,所述光源类别是红灯、黄灯、绿灯、向左箭头、向右箭头、向前箭头、向后箭头和数字中一种或几种。The second prediction module 326 is used to perform target frame regression and category classification based on the ROI features obtained from the second feature extraction module 325 to obtain the light source position and the light source category of the signal light to be identified, wherein the light source category is one or more of a red light, a yellow light, a green light, a left arrow, a right arrow, a forward arrow, a backward arrow and a number.
进一步的,所述形成单元33,包括:Furthermore, the forming unit 33 includes:
第一获取模块331,用于获取所述待识别信号灯对应的尺度信息;A first acquisition module 331 is used to acquire the scale information corresponding to the signal light to be identified;
第二获取模块332,用于将所述待识别信号灯对应的所述光源深度和从所述第一获取模块331得到的所述尺度信息进行融合得到所述待识别信号灯与所述车辆的距离信息;A second acquisition module 332 is used to fuse the light source depth corresponding to the signal light to be identified and the scale information obtained from the first acquisition module 331 to obtain the distance information between the signal light to be identified and the vehicle;
形成模块333,用于基于所述待识别信号灯对应的所述灯盘位置、所述光源位置、光源类别和从所述第二获取模块332得到的所述待识别信号灯与所述车辆的距离信息形成所述待识别信号灯对应的信号灯队列;A forming module 333, configured to form a signal light queue corresponding to the signal light to be identified based on the lamp panel position, the light source position, the light source type and the distance information between the signal light to be identified and the vehicle obtained from the second acquiring module 332;
加入模块334,用于按照信号灯队列形成的时间先后顺序将从所述形成模块333得到的所述待识别信号灯对应的信号灯队列加入到预设信号灯队列集合中。The adding module 334 is used to add the signal light queue corresponding to the signal light to be identified obtained from the forming module 333 to the preset signal light queue set according to the time sequence of the formation of the signal light queue.
进一步的,所述调整单元35,包括:Furthermore, the adjustment unit 35 includes:
第一判断模块351,用于判断所述目标信号灯与所述车辆的距离信息是否小于预设阈值; A first judgment module 351 is used to judge whether the distance information between the target signal light and the vehicle is less than a preset threshold;
第一调整模块352,用于若从所述第一判断模块351得到的所述目标信号灯与所述车辆的距离信息不小于预设阈值,则将所述车辆的车速调整为第一预设车速;A first adjustment module 352, configured to adjust the speed of the vehicle to a first preset speed if the distance information between the target signal light and the vehicle obtained from the first judgment module 351 is not less than a preset threshold;
第二判断模块353,用于若从所述第一判断模块351得到的所述目标信号灯与所述车辆的距离信息小于预设阈值,则判断所述光源类别是否为绿灯;A second judgment module 353 is used to judge whether the light source type is a green light if the distance information between the target signal light and the vehicle obtained from the first judgment module 351 is less than a preset threshold;
第二调整模块354,用于若从所述第二判断模块353得到的所述光源类别为绿灯,则将所述车辆的车速调整为第一预设车速;A second adjustment module 354 is configured to adjust the speed of the vehicle to a first preset speed if the light source type obtained from the second judgment module 353 is a green light;
第三调整模块355,用于若从所述第二判断模块353得到的所述光源类别不为绿灯,则将所述车辆的车速调整为第二预设车速,且监测所述车辆是否符合预设停车条件;其中,所述预设停车条件为所述车辆到达停止线前或前方静止车辆前的预设安全距离,所述第二预设阈值小于所述第一预设阈值;a third adjustment module 355, configured to adjust the speed of the vehicle to a second preset speed if the light source type obtained from the second judgment module 353 is not a green light, and monitor whether the vehicle meets a preset parking condition; wherein the preset parking condition is a preset safety distance before the vehicle reaches a stop line or a stationary vehicle ahead, and the second preset threshold is less than the first preset threshold;
当所述车辆符合预设停车条件时,则将所述车辆的车速调整为0。When the vehicle meets the preset parking condition, the speed of the vehicle is adjusted to 0.
进一步的,本发明实施例还提供一种处理器,所述处理器用于运行程序,其中,所述程序运行时执行上述图1-2中所述的基于深度学习的信号灯识别方法。Furthermore, an embodiment of the present invention also provides a processor, which is used to run a program, wherein the program, when running, executes the deep learning-based traffic light recognition method described in Figures 1-2 above.
进一步的,本发明实施例还提供一种存储介质,所述存储介质用于存储计算机程序,其中,所述计算机程序运行时控制所述存储介质所在设备执行上述图1-2中所述的基于深度学习的信号灯识别方法。Furthermore, an embodiment of the present invention also provides a storage medium, which is used to store a computer program, wherein when the computer program is running, it controls the device where the storage medium is located to execute the deep learning-based traffic light recognition method described in Figures 1-2 above.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above embodiments, the description of each embodiment has its own emphasis. For parts that are not described in detail in a certain embodiment, reference can be made to the relevant descriptions of other embodiments.
可以理解的是,上述方法及装置中的相关特征可以相互参考。另外,上述实施例中的“第一”、“第二”等是用于区分各实施例,而并不代表各实施例的优劣。It is understandable that the related features in the above methods and devices can be referenced to each other. In addition, the "first", "second" and the like in the above embodiments are used to distinguish the embodiments, and do not represent the advantages and disadvantages of the embodiments.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working processes of the systems, devices and units described above can refer to the corresponding processes in the aforementioned method embodiments and will not be repeated here.
在此提供的算法和显示不与任何特定计算机、虚拟系统或者其它设备固有相关。各种通用系统也可以与基于在此的示教一起使用。根据上面的 描述,构造这类系统所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。The algorithms and displays provided herein are not inherently related to any particular computer, virtual system, or other device. Various general purpose systems may also be used with the teachings based herein. Description, it is obvious to construct the structure required for this type of system. In addition, the present invention is not directed to any specific programming language. It should be understood that various programming languages can be utilized to implement the content of the present invention described herein, and the above description of specific languages is for the purpose of disclosing the best mode of the present invention.
此外,存储器可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM),存储器包括至少一个存储芯片。In addition, the memory may include non-permanent memory in a computer-readable medium, random access memory (RAM) and/or non-volatile memory in the form of read-only memory (ROM) or flash RAM, and the memory includes at least one memory chip.
本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that the embodiments of the present application may be provided as methods, systems, or computer program products. Therefore, the present application may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment in combination with software and hardware. Moreover, the present application may adopt the form of a computer program product implemented in one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) that contain computer-usable program code.
本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to the flowchart and/or block diagram of the method, device (system) and computer program product according to the embodiment of the present application. It should be understood that each process and/or box in the flowchart and/or block diagram, and the combination of the process and/or box in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions can be provided to a processor of a general-purpose computer, a special-purpose computer, an embedded processor or other programmable data processing device to produce a machine, so that the instructions executed by the processor of the computer or other programmable data processing device produce a device for realizing the function specified in one process or multiple processes in the flowchart and/or one box or multiple boxes in the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing device to work in a specific manner, so that the instructions stored in the computer-readable memory produce a manufactured product including an instruction device that implements the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。 These computer program instructions may also be loaded onto a computer or other programmable data processing device so that a series of operational steps are executed on the computer or other programmable device to produce a computer-implemented process, whereby the instructions executed on the computer or other programmable device provide steps for implementing the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.
在一个典型的配置中,计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, a computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
存储器可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。存储器是计算机可读介质的示例。Memory may include non-permanent storage in a computer-readable medium, random access memory (RAM) and/or non-volatile memory in the form of read-only memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。Computer readable media include permanent and non-permanent, removable and non-removable media that can be implemented by any method or technology to store information. Information can be computer readable instructions, data structures, program modules or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, compact disk read-only memory (CD-ROM), digital versatile disk (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media that can be used to store information that can be accessed by a computing device. As defined in this article, computer readable media does not include temporary computer readable media (transitory media), such as modulated data signals and carrier waves.
还需要说明的是,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、商品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、商品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括要素的过程、方法、商品或者设备中还存在另外的相同要素。It should also be noted that the terms "include", "comprises" or any other variations thereof are intended to cover non-exclusive inclusion, so that a process, method, commodity or device including a series of elements includes not only those elements, but also other elements not explicitly listed, or also includes elements inherent to such process, method, commodity or device. In the absence of more restrictions, the elements defined by the sentence "comprises a ..." do not exclude the existence of other identical elements in the process, method, commodity or device including the elements.
本领域技术人员应明白,本申请的实施例可提供为方法、系统或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that the embodiments of the present application may be provided as methods, systems or computer program products. Therefore, the present application may adopt the form of a complete hardware embodiment, a complete software embodiment or an embodiment in combination with software and hardware. Moreover, the present application may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) that contain computer-usable program code.
以上仅为本申请的实施例而已,并不用于限制本申请。对于本领域技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原理之 内所作的任何修改、等同替换、改进等,均应包含在本申请的权利要求范围之内。 The above are only embodiments of the present application and are not intended to limit the present application. For those skilled in the art, the present application may have various modifications and variations. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present application should be included in the scope of the claims of this application.

Claims (10)

  1. 一种基于深度学习的信号灯识别方法,其特征在于,所述方法包括:A signal light recognition method based on deep learning, characterized in that the method comprises:
    按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯;Acquire a target image at a preset time interval, wherein the target image contains at least one signal light to be identified;
    基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;Based on the target image, a preset deep neural network prediction model is used to obtain the lamp panel position, light source position, light source type and light source depth of the signal light to be identified;
    基于所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;Based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified, a signal light queue corresponding to the signal light to be identified is formed using preset rules and added to a preset signal light queue set; wherein the signal light queue stores the lamp panel position, light source position, light source type and distance information between the signal light to be identified and the vehicle within a preset time range corresponding to the signal light to be identified;
    基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;Based on the destination information and current location information of the vehicle, determining a signal light queue corresponding to a target signal light from the preset signal light queue set, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass;
    基于所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口。Based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light, the speed of the vehicle is adjusted using a preset speed change rule so as to pass through the intersection that the vehicle is preparing to pass.
  2. 根据权利要求1所述的方法,其特征在于,所述基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,包括:The method according to claim 1 is characterized in that the step of obtaining the lamp panel position, light source position, light source type and light source depth of the signal light to be identified based on the target image using a preset deep neural network prediction model comprises:
    基于所述目标图像,利用所述预设深度神经网络预测模型中的resnet-18网络作为主干网络进行特征提取,得到目标特征;Based on the target image, the resnet-18 network in the preset deep neural network prediction model is used as the backbone network to perform feature extraction to obtain target features;
    基于所述目标特征,利用所述预设深度神经网络预测模型中的PAN-FPN结构进行特征融合,得到目标特征融合信息;Based on the target features, the PAN-FPN structure in the preset deep neural network prediction model is used to perform feature fusion to obtain target feature fusion information;
    基于目标特征融合信息,利用所述预设深度神经网络预测模型中的卷积层和深度预测层得到所述待识别信号灯的所述光源深度。 Based on the target feature fusion information, the convolution layer and the depth prediction layer in the preset deep neural network prediction model are used to obtain the light source depth of the signal light to be identified.
  3. 根据权利要求2所述的方法,其特征在于,所述基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,包括:The method according to claim 2 is characterized in that the step of obtaining the lamp panel position, light source position, light source type and light source depth of the signal light to be identified based on the target image using a preset deep neural network prediction model comprises:
    基于所述目标特征融合信息,利用神经网络的全连接技术进行自动回归得到灯盘目标框位置;Based on the target feature fusion information, the fully connected technology of the neural network is used to automatically regress and obtain the target frame position of the lamp panel;
    将所述灯盘目标框位置和所述目标特征进行匹配,根据所述灯盘目标框位置提取ROI特征;Matching the lamp panel target frame position with the target feature, and extracting ROI features according to the lamp panel target frame position;
    基于所述ROI特征进行目标框回归和类别的分类,得到所述待识别信号灯的所述光源位置和所述光源类别,其中,所述光源类别是红灯、黄灯、绿灯、向左箭头、向右箭头、向前箭头、向后箭头和数字中一种或几种。Target frame regression and category classification are performed based on the ROI features to obtain the light source position and the light source category of the signal light to be identified, wherein the light source category is one or more of a red light, a yellow light, a green light, a left arrow, a right arrow, a forward arrow, a backward arrow and a number.
  4. 根据权利要求1所述的方法,其特征在于,所述基于所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中,包括:The method according to claim 1 is characterized in that, based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified, a signal light queue corresponding to the signal light to be identified is formed using a preset rule and added to a preset signal light queue set, comprising:
    获取所述待识别信号灯对应的尺度信息;Obtaining scale information corresponding to the signal light to be identified;
    将所述待识别信号灯对应的所述光源深度和所述尺度信息进行融合得到所述待识别信号灯与所述车辆的距离信息;The light source depth and the scale information corresponding to the signal light to be identified are integrated to obtain the distance information between the signal light to be identified and the vehicle;
    基于所述待识别信号灯对应的所述灯盘位置、所述光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息形成所述待识别信号灯对应的信号灯队列;Forming a signal light queue corresponding to the signal light to be identified based on the light panel position, the light source position, the light source type and the distance information between the signal light to be identified and the vehicle;
    按照信号灯队列形成的时间先后顺序将所述待识别信号灯对应的信号灯队列加入到预设信号灯队列集合中。The signal light queues corresponding to the signal lights to be identified are added to the preset signal light queue set according to the time sequence of the formation of the signal light queues.
  5. 根据权利要求1所述的方法,其特征在于,所述基于所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口,包括:The method according to claim 1 is characterized in that the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light are used to adjust the speed of the vehicle by using a preset speed change rule so as to pass through the intersection that the vehicle is to pass, comprising:
    判断所述目标信号灯与所述车辆的距离信息是否小于预设阈值;Determining whether the distance information between the target signal light and the vehicle is less than a preset threshold;
    若所述目标信号灯与所述车辆的距离信息不小于预设阈值,则将所述车辆的车速调整为第一预设车速; If the distance information between the target signal light and the vehicle is not less than a preset threshold, adjusting the speed of the vehicle to a first preset speed;
    若所述目标信号灯与所述车辆的距离信息小于预设阈值,则判断所述光源类别是否为绿灯;If the distance information between the target signal light and the vehicle is less than a preset threshold, determining whether the light source type is a green light;
    若所述光源类别为绿灯,则将所述车辆的车速调整为第一预设车速;If the light source type is a green light, adjusting the speed of the vehicle to a first preset speed;
    若所述光源类别不为绿灯,则将所述车辆的车速调整为第二预设车速,且监测所述车辆是否符合预设停车条件;其中,所述预设停车条件为所述车辆到达停止线前或前方静止车辆前的预设安全距离,所述第二预设阈值小于所述第一预设阈值;If the light source type is not a green light, the speed of the vehicle is adjusted to a second preset speed, and the vehicle is monitored to see whether it meets a preset parking condition; wherein the preset parking condition is a preset safety distance before the vehicle reaches a stop line or a stationary vehicle ahead, and the second preset threshold is less than the first preset threshold;
    当所述车辆符合预设停车条件时,则将所述车辆的车速调整为0。When the vehicle meets the preset parking condition, the speed of the vehicle is adjusted to 0.
  6. 根据权利要求1-5中任一项所述的方法,其特征在于,在所基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列之前,所述方法还包括:The method according to any one of claims 1 to 5, characterized in that before determining the signal light queue corresponding to the target signal light from the preset signal light queue set based on the destination information and current position information of the vehicle, the method further comprises:
    获取所述车辆所在车道的车道线信息和路面箭头信息;Obtaining lane line information and road arrow information of the lane where the vehicle is located;
    基于所述车道线信息和所述路面箭头信息确定所述车辆的当前位置信息;Determining the current position information of the vehicle based on the lane line information and the road arrow information;
  7. 根据权利要求6所述的方法,其特征在于,在所述基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列之后,所述方法还包括:The method according to claim 6, characterized in that, after determining the signal light queue corresponding to the target signal light from the preset signal light queue set based on the destination information and current location information of the vehicle, the method further comprises:
    当监测到所述车辆所在车道的前方存在其他车辆时,则保持车距,跟随前方车辆通过所述目标信号灯所在路口;When it is detected that there are other vehicles ahead of the lane where the vehicle is located, the vehicle maintains the distance between the vehicles and follows the vehicle ahead through the intersection where the target signal light is located;
    当所述车辆通过所述目标信号灯所在路口之后,将所述预设信号灯队列集合进行清空。After the vehicle passes the intersection where the target signal light is located, the preset signal light queue set is cleared.
  8. 一种基于深度学习的信号灯识别装置,其特征在于,包括:A signal light recognition device based on deep learning, characterized by comprising:
    第一获取单元,用于按照预设时间间隔获取目标图像,所述目标图像中至少包含一个待识别信号灯;A first acquisition unit, configured to acquire a target image at a preset time interval, wherein the target image includes at least one signal light to be identified;
    预测单元,用于基于所述目标图像,利用预设深度神经网络预测模型得到所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度;A prediction unit, configured to obtain, based on the target image, a lamp panel position, a light source position, a light source type, and a light source depth of the signal light to be identified by using a preset deep neural network prediction model;
    形成单元,用于基于所述待识别信号灯的灯盘位置、光源位置、光源类别和光源深度,利用预设规则形成所述待识别信号灯对应的信号灯队列,且加入到预设信号灯队列集合中;其中,所述信号灯队列保存预设时 间范围内所述待识别信号灯对应的灯盘位置、光源位置、光源类别和所述待识别信号灯与所述车辆的距离信息;A forming unit is used to form a signal light queue corresponding to the signal light to be identified based on the lamp panel position, light source position, light source type and light source depth of the signal light to be identified using a preset rule, and add it to the preset signal light queue set; wherein the signal light queue saves the preset time The information of the lamp panel position, light source position, light source type and the distance between the signal light to be identified and the vehicle corresponding to the signal light to be identified within the time range;
    第一确定单元,用于基于所述车辆的目的地信息和当前位置信息,从所述预设信号灯队列集合中确定目标信号灯对应的信号灯队列,所述目标信号灯是所述车辆预备通过的路口处设置的信号灯;A first determining unit is used to determine a signal light queue corresponding to a target signal light from the preset signal light queue set based on the destination information and current position information of the vehicle, wherein the target signal light is a signal light set at the intersection through which the vehicle is to pass;
    调整单元,用于基于所述目标信号灯对应的信号灯队列保存的所述目标信号灯与所述车辆的距离信息和所述光源类别,利用预设变速规则调整所述车辆的车速,用于通过所述车辆预备通过的路口。An adjustment unit is used to adjust the speed of the vehicle using a preset speed change rule based on the distance information between the target signal light and the vehicle and the light source category stored in the signal light queue corresponding to the target signal light, so as to pass the intersection that the vehicle is preparing to pass.
  9. 一种存储介质,所述存储介质包括存储的程序,其特征在于,在所述程序运行时控制所述存储介质所在设备执行权利要求1至权利要求7中任一项所述基于深度学习的信号灯识别方法。A storage medium, comprising a stored program, characterized in that when the program is running, the device where the storage medium is located is controlled to execute the traffic light recognition method based on deep learning as described in any one of claims 1 to claim 7.
  10. 一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述程序时实现如权利要求1至权利要求7中任一项所述基于深度学习的信号灯识别方法。 An electronic device comprises a memory, a processor and a computer program stored in the memory and executable on the processor, wherein when the processor executes the program, the signal light recognition method based on deep learning as described in any one of claims 1 to 7 is implemented.
PCT/CN2024/072682 2023-03-23 2024-01-17 Deep learning-based signal light recognition method and apparatus WO2024193208A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202310319112.2 2023-03-23
CN202310319112.2A CN116503832A (en) 2023-03-23 2023-03-23 Signal lamp identification method and device based on deep learning

Publications (1)

Publication Number Publication Date
WO2024193208A1 true WO2024193208A1 (en) 2024-09-26

Family

ID=87317489

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/072682 WO2024193208A1 (en) 2023-03-23 2024-01-17 Deep learning-based signal light recognition method and apparatus

Country Status (2)

Country Link
CN (1) CN116503832A (en)
WO (1) WO2024193208A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116503832A (en) * 2023-03-23 2023-07-28 合众新能源汽车股份有限公司 Signal lamp identification method and device based on deep learning

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018115511A1 (en) * 2016-12-22 2018-06-28 Luxembourg Institute Of Science And Technology (List) Method and system for enhanced traffic light signaling and for computing a target speed of an automotive vehicle
CN108804983A (en) * 2017-05-03 2018-11-13 腾讯科技(深圳)有限公司 Traffic signal light condition recognition methods, device, vehicle-mounted control terminal and motor vehicle
CN110533940A (en) * 2019-08-15 2019-12-03 北京百度网讯科技有限公司 Method, apparatus, equipment and the computer storage medium of abnormal traffic signal lamp identification
CN113689705A (en) * 2020-05-19 2021-11-23 深圳市丰驰顺行信息技术有限公司 Method and device for detecting red light running of vehicle, computer equipment and storage medium
CN116503832A (en) * 2023-03-23 2023-07-28 合众新能源汽车股份有限公司 Signal lamp identification method and device based on deep learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018115511A1 (en) * 2016-12-22 2018-06-28 Luxembourg Institute Of Science And Technology (List) Method and system for enhanced traffic light signaling and for computing a target speed of an automotive vehicle
CN108804983A (en) * 2017-05-03 2018-11-13 腾讯科技(深圳)有限公司 Traffic signal light condition recognition methods, device, vehicle-mounted control terminal and motor vehicle
CN110533940A (en) * 2019-08-15 2019-12-03 北京百度网讯科技有限公司 Method, apparatus, equipment and the computer storage medium of abnormal traffic signal lamp identification
CN113689705A (en) * 2020-05-19 2021-11-23 深圳市丰驰顺行信息技术有限公司 Method and device for detecting red light running of vehicle, computer equipment and storage medium
CN116503832A (en) * 2023-03-23 2023-07-28 合众新能源汽车股份有限公司 Signal lamp identification method and device based on deep learning

Also Published As

Publication number Publication date
CN116503832A (en) 2023-07-28

Similar Documents

Publication Publication Date Title
CN109785667B (en) Lane departure recognition method, apparatus, device, and storage medium
US11498577B2 (en) Behavior prediction device
US11091161B2 (en) Apparatus for controlling lane change of autonomous vehicle and method thereof
CN107848534B (en) Vehicle control device, vehicle control method, and medium storing vehicle control program
CN108133484B (en) Automatic driving processing method and device based on scene segmentation and computing equipment
US10847034B2 (en) Apparatus and method for controlling lane change for vehicle
US20210107486A1 (en) Apparatus for determining lane change strategy of autonomous vehicle and method thereof
CN113212454A (en) Method and device for adjusting vehicle running state, computer equipment and storage medium
WO2023197408A1 (en) Method and apparatus for determining vehicle speed control model training sample
JP2018036796A (en) Environment information processing device
CN113009539A (en) Automatic lane changing processing method for vehicle, vehicle and equipment
TW201428701A (en) Lane departure warning system and lane identification apparatus and related method thereof
CN113895456A (en) Intersection driving method and device for automatic driving vehicle, vehicle and medium
WO2024193208A1 (en) Deep learning-based signal light recognition method and apparatus
CN112001235A (en) Vehicle traffic information generation method and device and computer equipment
CN115273512A (en) Anti-collision auxiliary method, device, equipment and medium for automatically driving vehicle
CN109946688A (en) Lane-change contextual data extracting method, device and server
CN116266380A (en) Environment data reconstruction method, device, system and storage medium
CN108154119B (en) Automatic driving processing method and device based on self-adaptive tracking frame segmentation
CN113538909A (en) Traffic incident prediction method and device for automatic driving vehicle
CN113156923B (en) Road data generation, navigation and running control method, device, equipment and medium
US20210004016A1 (en) U-turn control system for autonomous vehicle and method therefor
CN107458308B (en) Driving assisting method and system
CN114782924A (en) Traffic light detection method and device for automatic driving and electronic equipment
CN113771846B (en) Automatic control method, device, system and medium for vehicle

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24773777

Country of ref document: EP

Kind code of ref document: A1