[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2023116641A1 - Parking space detection model training method and apparatus, and parking space detection method and apparatus - Google Patents

Parking space detection model training method and apparatus, and parking space detection method and apparatus Download PDF

Info

Publication number
WO2023116641A1
WO2023116641A1 PCT/CN2022/140134 CN2022140134W WO2023116641A1 WO 2023116641 A1 WO2023116641 A1 WO 2023116641A1 CN 2022140134 W CN2022140134 W CN 2022140134W WO 2023116641 A1 WO2023116641 A1 WO 2023116641A1
Authority
WO
WIPO (PCT)
Prior art keywords
parking space
regression
target
category
corner
Prior art date
Application number
PCT/CN2022/140134
Other languages
French (fr)
Chinese (zh)
Inventor
王东伟
吉方成
Original Assignee
北京罗克维尔斯科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京罗克维尔斯科技有限公司 filed Critical 北京罗克维尔斯科技有限公司
Publication of WO2023116641A1 publication Critical patent/WO2023116641A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the present disclosure relates to the technical field of automatic driving, in particular to a parking space detection model training method, a parking space detection method and a device thereof.
  • each parking space only outputs a set of corner points of parking spaces, which has strong randomness, and the corner positions of different frames are easy to jump.
  • the embodiment of the first aspect of the present disclosure proposes a parking space detection model training method, by obtaining a sample image of a sample parking space, and obtaining the center position of the sample parking space from the sample image; according to the center position, determine the The parking space recognition redundant area and the parking space corner regression redundant area in the sample image; according to the parking space identification redundant area, the sample image is marked to generate the category true value heat map of the sample image, and according to The parking space corner returns to a redundant area, and the sample image is marked to generate a regression true value thermodynamic map of the parking space corner of the sample image; based on the sample image, the category true value thermodynamic map and the Regression to the true value heat map, training the parking space detection model, and generating the target parking space detection model.
  • the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, and can detect the center of the parking space more accurately. location, bay category, and corner location.
  • the embodiment of the second aspect of the present disclosure proposes a parking space detection method, including: acquiring a target image to be predicted, wherein the target image includes at least one parking space to be tested; inputting the target image into a target parking space detection model, To obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes a redundant area for identifying the category of the parking space, and the regression target heat map includes The redundant area used for the regression of the corner point of the parking space, the target parking space detection model is generated by training the parking space detection model training method described in the embodiment of the first aspect above; The redundant area used for the regression of the parking space corners is described, and the target parking space category and the target parking space corner points of the parking spaces to be tested are determined.
  • the embodiment of the third aspect of the present disclosure proposes a parking space detection model training device, including: an acquisition module, configured to acquire a sample image of a sample parking space, and obtain the center position of the sample parking space from the sample image; a determination module, It is used to determine the parking space recognition redundant area and the parking space corner regression redundant area in the sample image according to the central position; the generating module is used to mark the sample image according to the parking space identification redundant area , generating a class true value heat map of the sample image, and marking the sample image according to the regression redundant area of the parking corner point, generating a regression true value heat map of the parking space corner point of the sample image; training A module, configured to train a parking space detection model based on the sample image, the category true value thermodynamic map and the regression true value thermodynamic map, to generate a target parking space detection model.
  • the embodiment of the fourth aspect of the present disclosure proposes a parking space detection device, including: a first acquisition module, configured to acquire a target image to be predicted, wherein the target image includes at least one parking space to be tested; a second acquisition module, It is used to input the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes the parking space category
  • the identified redundant area, the regression target heat map includes a redundant area for the regression of parking space corners, and the target parking space detection model is generated by training the parking space detection model training device described in the embodiment of the third aspect above;
  • the determination module is configured to determine the target parking space category and the target parking space corner of the parking space to be tested based on the redundant area used for identifying the parking space category and the redundant area used for the regression of the parking space corner point.
  • the embodiment of the fifth aspect of the present disclosure provides an automatic driving vehicle, which includes the above-mentioned parking space detection model training device or parking space detection device.
  • the embodiment of the sixth aspect of the present disclosure provides an electronic device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the Instructions executed by at least one processor, the instructions are executed by the at least one processor to implement the parking space detection model training method described in the embodiment of the first aspect of the present disclosure and the parking space detection method described in the embodiment of the second aspect .
  • the embodiment of the seventh aspect of the present disclosure provides a non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to realize the parking space described in the embodiment of the first aspect of the present disclosure
  • the detection model training method and the parking space detection method described in the embodiment of the second aspect are used to realize the parking space described in the embodiment of the first aspect of the present disclosure.
  • the embodiment of the eighth aspect of the present disclosure proposes a computer program product, including a computer program, when the computer program is executed by a processor, it implements the parking space detection model training method described in the embodiment of the first aspect of the present disclosure And the parking space detection method described in the embodiment of the second aspect.
  • the embodiment of the ninth aspect of the present disclosure provides a computer program, the computer program includes computer program code, and when the computer program code is run on the computer, the computer executes the computer program according to the first aspect of the present disclosure.
  • Fig. 1 is a schematic diagram of a method for training a parking space detection model according to an embodiment of the present disclosure.
  • Fig. 2 is a schematic diagram of obtaining the central position of a sample parking space from a sample image according to an embodiment of the present disclosure.
  • Fig. 3 is a schematic diagram of a process of generating a category truth heat map of a sample image according to an embodiment of the present disclosure.
  • Fig. 4 is a schematic diagram of a process of generating a regression truth heat map according to an embodiment of the present disclosure.
  • Fig. 5 is a schematic diagram of training a parking space detection model to generate a target parking space detection model according to an embodiment of the present disclosure.
  • Fig. 6 is a schematic diagram of an algorithm of a parking space detection model according to an embodiment of the present disclosure.
  • Fig. 7 is a schematic diagram of a parking space detection model training method according to an embodiment of the present disclosure.
  • Fig. 8 is a schematic diagram of a parking space detection method according to an embodiment of the present disclosure.
  • Fig. 9(a) is a schematic diagram of a thermal map of a target center position according to an embodiment of the present disclosure.
  • Fig. 9(b) is a schematic diagram of a category target heat map shown in an embodiment of the present disclosure.
  • Fig. 9(c) is a schematic diagram of a regression target heat map shown in an embodiment of the present disclosure.
  • Fig. 10 is a schematic diagram of determining a target parking space category and a target parking space corner point of a parking space to be tested according to an embodiment of the present disclosure.
  • Fig. 11 is a schematic diagram of a parking space detection method according to an embodiment of the present disclosure.
  • Fig. 12 is a schematic diagram of a parking space detection model training device according to an embodiment of the present disclosure.
  • Fig. 13 is a schematic diagram of a parking space detection device according to an embodiment of the present disclosure.
  • Fig. 14 is a schematic diagram of an electronic device according to an embodiment of the present disclosure.
  • FIG. 1 is an implementation of a parking space detection model training method proposed in the present disclosure. As shown in FIG. 1 , the parking space detection model training method includes the following steps: S101-S104.
  • a sample image corresponding to the sample parking space collected by the image acquisition device is acquired.
  • the sample images may be images collected by a large number of vehicles during parking, or images of vehicles during parking may be collected by cameras preset near the parking spaces.
  • 4 corner points of the sample parking space are identified from the sample image, and the center position of the sample parking space is obtained according to the coordinates of the 4 corner points.
  • the center position of the sample parking space when the center position of the sample parking space is obtained according to the coordinates of the four corner points, it can be directly marked manually, or it can be input into an algorithm to identify and directly output the center position of the sample parking space.
  • the center of the sample parking space can be taken as the center, and the area within a certain range of the center can be selected as the parking space recognition redundancy area.
  • the center position of the sample parking space can be taken as the center, and the area within a certain range of the center can be selected as the parking space angle Point regression redundant area.
  • S103 mark the sample image according to the redundant area of the parking space recognition, generate the category truth heat map of the sample image, and return the redundant area according to the corner point of the parking space, mark the sample image, and generate the regression of the parking space corner point of the sample image True heat map.
  • the parking space recognition redundant area based on the category of the sample parking space, the parking space recognition redundant area and the non-parking space recognition redundant area on the sample image are respectively marked, and the class true value heat map is generated according to the marked image.
  • the pixel points in the parking space recognition redundant area of the sample parking space in the sampling sample image are assigned as the class value of the sample parking space, and other positions are assigned as 0.
  • the category of the sample parking spaces may include horizontal parking spaces, longitudinal parking spaces, oblique parking spaces, etc., different category values may be assigned to different parking spaces, for example, the category value of the horizontal parking spaces may be assigned a value of 1, and Assign a value of 2 to the category value of the vertical parking space.
  • each sample parking space corresponds to four corner points, and each corner point has two dimensions of x and y, so the sample parking space corner regression has a total of 8 dimensions, according to the parking space corner regression redundant area, based on the sample parking space corner point
  • the multiple dimensions of the sample image are marked, and the regression true value heat map of the corner points of the parking space of the sample image is generated.
  • the training end condition may be: when the parking space detection model is trained for a preset number of times, the training ends. For example, if the preset number of training times is 1000 times, the training will end after the parking space detection model has been trained 1000 times.
  • the training end condition may be: when the error of the trained parking space detection model converges to a preset error value, the training ends.
  • the training end condition may be: when the loss function of the trained parking space detection model decreases to a preset loss value, the training ends.
  • the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, so that the target parking space detection model can be used for the parking space.
  • Accurate recognition so that the center position, parking space category and corner position of the parking space can be detected more accurately.
  • FIG. 2 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in Figure 2, obtaining the center position of the sample parking space from the sample image includes the following steps: S201 -S203.
  • the sample vehicle may collect multiple images during parking, or a preset multiple camera may collect multiple images during parking.
  • the multi-channel fisheye image data during the entire parking process can be collected by the sample vehicle, and the image stitching algorithm is used to stitch the multi-channel fisheye image data into a surround-view mosaic, and the surround-view mosaic is used as a sample image .
  • the corner points of the sample parking spaces can be marked manually, or can be marked by using a target detection algorithm to detect the parking spaces.
  • the sample image obtained above is down-sampled to obtain a down-sampled sample image.
  • the sample image obtained above may be down-sampled by 4 times to obtain a down-sampled sample image, and the size of the sample image is (w/4, h/4).
  • the sample image is down-sampled, the receptive field is increased, and the calculation amount for obtaining the true value of the center position in the sample parking space is reduced.
  • FIG. 3 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. 3 , the generation process of the category true value heat map of the sample image includes the following steps: S301-S303.
  • the obtained sample image is down-sampled to obtain a down-sampled sample image.
  • the sample image obtained above may be down-sampled by 4 times to obtain a down-sampled sample image, and the size of the sample image is (w/4, h/4).
  • a first redundant offset for parking space identification is preset.
  • the first redundancy offset includes ⁇ X1 and ⁇ Y1.
  • ⁇ X1 and ⁇ Y1 are within the range of [0, min(Wspot, Hspot)/2].
  • the values of the pixel points in the redundant parking space recognition area and the first remaining area except the redundant parking space identification area are respectively marked.
  • the pixel points in the parking space identification redundant area of the sample parking space are assigned as the class value of the sample parking space, and other positions are assigned as 0.
  • the category of the sample parking spaces may include horizontal parking spaces, longitudinal parking spaces, oblique parking spaces, etc., different category values may be assigned to different parking spaces, for example, the category value of the horizontal parking spaces may be assigned a value of 1, and Assign a value of 2 to the category value of the vertical parking space. According to the labeled image, generate a heat map of the true value of the category.
  • the embodiment of the present disclosure sets a first redundant offset for parking space identification, sets a redundant area for parking space identification based on the first redundant offset, and generates a class true value heat map to carry out the parking space detection model Training prevents the generated target parking space detection model from being inaccurate in recognizing the parking space category due to corner jumps, and increases the accuracy and robustness of the subsequent prediction of the parking space category.
  • FIG. 4 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. 4 , the generation process of the regression true value heat map includes the following steps: S401-S403.
  • the obtained sample image is down-sampled to obtain a down-sampled sample image.
  • the sample image obtained above may be down-sampled by 4 times to obtain a down-sampled sample image, and the size of the sample image is (w/4, h/4).
  • a second redundant offset for the regression of the corner point of the parking space is preset.
  • the second redundancy offset includes ⁇ X2 and ⁇ Y2.
  • ⁇ X2 and ⁇ Y2 are within the range of [0, min(Wspot, Hspot)/2].
  • each sample parking space corresponds to four corner points, and each corner point has two dimensions of x and y, so the sample parking space corner regression has a total of 8 dimensions.
  • the parking space corner point The regression redundant area is marked as the regression label map of the parking space corner point, and the regression label map of the parking space corner point is processed to obtain the regression true value heat map of the parking space corner point.
  • the embodiment of the present disclosure sets a second redundant offset for the regression of the corner of the parking space, obtains the redundant area of the corner of the parking space based on the second redundant offset, and generates the regression truth of the corner of the parking space
  • the value heat map trains the parking space detection model, which prevents the generated target parking space detection model from inaccurate recognition of parking space corners due to corner jumps, and increases the accuracy and robustness of the subsequent prediction of parking space corners.
  • the corner point regression values from the corner point to the parking space corner point regression redundant area in multiple dimensions are obtained.
  • the corner point regression value can be understood as the distance value from the corner point coordinates in the two dimensions corresponding to each corner point to each pixel point in the corner point regression redundancy area of the parking space.
  • FIG. 5 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. 5 , based on the sample image, the category true value heat map and the regression true value heat map, The training of the parking space detection model to generate the target parking space detection model includes the following steps: S501-S504.
  • Figure 6 is a schematic diagram of the algorithm of the parking space detection model. As shown in Figure 6, in order to minimize the calculation amount of the model under the premise of ensuring the detection accuracy, the sample image is input into the parking space detection model for multi-scale feature extraction to obtain multi-scale feature maps.
  • the feature extraction algorithm may use a convolutional neural network (Convolutional Neural Networks, CNN).
  • the feature maps of the target scale in the multi-scale feature map are fused to obtain the fused feature map.
  • the fused feature map is processed by the center position detection head to complete the center position prediction and obtain the predicted center position.
  • the fused feature map is processed by the category detection head to complete the category prediction, and the category prediction heat map is obtained.
  • the fused feature map is processed by the corner regression detection head to complete the regression prediction, and the regression prediction heat map is obtained.
  • the center position detection head, the category detection head and the corner point regression detection head may be a network composed of convolutional layers.
  • CNN is used for feature extraction on the sample image, and the extracted feature map is subjected to 2 times downsampling, and CNN feature extraction is performed on the 2 times downsampled feature map, and so on. Then perform 4 times downsampling, 8 times downsampling and 16 times downsampling on the feature map in turn, and perform CNN feature extraction on the feature map after each sampling, and obtain 4 times downsampling feature map and 8 times downsampling feature map respectively and 16x downsampled feature maps.
  • the 16-fold downsampled feature map is processed, and the processed image is 16-fold up-sampled, and the 16-fold up-sampled image is fused with the 16-fold downsampled feature map before CNN Feature extraction to obtain the first fused image.
  • the first fused image is upsampled by 8 times, fused with the 8 times downsampled feature map, and then CNN feature extraction is performed to obtain the final fused feature map.
  • the loss function of the parking space detection model is determined based on the predicted center position and the center position, the class prediction heat map and the class true value heat map, and the regression prediction heat map and the regression true value heat map. Among them, the loss function of the parking space detection model is expressed as:
  • Loss Loss_pos+Loss_kps-reg+Loss_cls
  • Loss_pos is the position error between the predicted center position and the center position
  • Loss_kps-reg is the regression error between the regression prediction heat map and the regression true value heat map
  • Loss_cls is the difference between the class prediction heat map and the class true value heat map classification error.
  • the calculation of the position error between the predicted center position and the center position can use the cross entropy loss function (Cross entropy loss); the calculation of the regression error of the regression prediction heat map and the regression true value heat map can use the regression loss function (Smooth L1 Loss) ; To calculate the classification error between the category prediction heatmap and the category true value heatmap, the cross entropy loss function (Cross entropy loss) can be used.
  • the parking space detection model is adjusted and the training continues until the loss function reaches the preset standard, then the training ends and the target parking space detection model is generated.
  • the target parking space detection model obtained by setting redundant area training in the embodiment of the present disclosure reduces the randomness when outputting corner points, and obtains rich semantic information of the sample image through multi-scale feature extraction of the sample image, which improves the accuracy of output.
  • Fig. 7 is an implementation manner of a parking space detection model training method proposed in the present disclosure. As shown in Fig. 7, the parking space detection model training method includes the following steps: S701-S715.
  • S710 Based on the multiple dimensions of the sample parking corner points, mark the regression redundant area of the parking space corner points as a parking space corner point regression labeling map, and process the parking space corner point regression labeling map to obtain the regression true value of the parking space corner points heat map.
  • the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, and can detect the center of the parking space more accurately. location, bay category, and corner location.
  • Fig. 8 is an embodiment of a parking space detection method proposed in the present disclosure.
  • the target parking space detection model of the parking space detection method can be obtained by training the parking space detection model training method of any embodiment of the above-mentioned Fig. 1-Fig. 7, as shown in Fig. 8
  • the parking space detection method includes the following steps: S801-S803.
  • An image to be predicted is acquired, and the image to be predicted is used as a target image, wherein the target image includes at least one parking space to be tested.
  • the target image may be an image collected by the vehicle under test during parking, or may be an image collected by a preset camera during parking.
  • the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes a redundant area for identifying the category of the parking space,
  • the regression target heatmap includes redundant regions for the parking corner regression.
  • Fig. 9(a) is a schematic diagram of the heat map of the target center position of the embodiment of the present disclosure
  • Fig. 9(b) is a schematic diagram of the heat map of the category target of the embodiment of the present disclosure
  • Fig. 9(c) is a schematic diagram of the heat map of the embodiment of the present disclosure Schematic representation of the regression target heatmap.
  • the category target heat map includes a redundant area for the recognition of the parking space category
  • the regression target heat map includes a redundant area for the parking space corner regression .
  • CNN is used for feature extraction on the target image, and the extracted feature map is subjected to 2 times downsampling, and CNN feature extraction is performed on the 2 times downsampled feature map, and so on. , and then sequentially perform 4 times downsampling, 8 times downsampling and 16 times downsampling on the feature map, and perform CNN feature extraction on the feature map after each sampling, and obtain 4 times downsampling feature map and 8 times downsampling feature respectively Figure and 16x downsampling feature map.
  • the 16-fold downsampled feature map is processed, and the processed image is 16-fold up-sampled, and the 16-fold up-sampled image is fused with the 16-fold downsampled feature map before CNN Feature extraction to obtain the first fused image.
  • the first fused image is upsampled by 8 times, fused with the 8 times downsampled feature map, and then CNN feature extraction is performed to obtain the final fused feature map.
  • the target parking space corner point of the parking space to be tested is determined.
  • the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, preventing the randomness of outputting a single result. It can more accurately detect the center position of the parking space, the category of the parking space and the corner position.
  • a non-maximum suppression (Non-Maximum Suppression, NMS) algorithm is performed on the center position target heat map of the parking space to be tested output by the target parking space detection model, and the processed center position target heat map is greater than The pixel point of the preset threshold is used as the target center position of the parking space to be tested.
  • NMS Non-Maximum Suppression
  • FIG. 10 is an implementation of a parking space detection method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. The redundant area is to determine the target parking space category and the target parking space corner of the parking space to be tested, including the following steps: S1001-S1002.
  • the coordinates of multiple pixel points in the redundant area used for identifying the parking space category are averaged to obtain a target pixel point.
  • the target parking space category of the parking space to be tested is determined. For example, if the parking space category value of the target pixel is 1, and the parking space category corresponding to 1 is a horizontal parking space, then the target parking space category of the parking space to be tested is a horizontal parking space.
  • a plurality of parking space categories corresponding to a plurality of pixels in the redundant area for identification of the parking space category is determined, voting is performed on the multiple parking space categories, and a target parking space category of the parking space to be tested is determined. For example, if there are 10 pixels in the redundant area for recognition of the parking space category, among them, the parking space category value of 9 pixels is 1, and the corresponding parking space category of 1 is a horizontal parking space; the parking space category of 1 pixel point The value is 2, and the parking space category corresponding to 2 is a longitudinal parking space, then vote for these 10 parking space categories. Since 90% of the parking spaces indicate that the category of the parking spaces is a horizontal parking space, it is determined that the target parking space category of the parking spaces to be tested is a horizontal parking space.
  • the true value heat map obtains multiple corner regression values from the corner point to the parking space corner regression redundant area in multiple dimensions.
  • Multiple pixel points in the redundant area used for parking corner regression are calculated based on multiple corner regression values corresponding to the redundant area used for parking corner regression, and four corner points of the parking space to be tested are obtained.
  • the multiple corner regression values corresponding to the redundant area used for the parking space corner regression are 21, 22, and 23 pixels respectively, then for the parking space corner
  • Each pixel in the redundant area of point regression is offset by 21, 22, and 23 pixels in both X and Y dimensions, and multiple predicted corners corresponding to the first corner of the parking space to be tested are respectively obtained coordinates, and average the coordinates of multiple predicted corner points to determine the corner point of the target parking space corresponding to any corner point of the parking space to be tested.
  • the target parking space detection model of the present disclosure will consider the redundant area when identifying the parking space, based on the average pixel point or the way of voting on the parking space category corresponding to multiple pixel points, which reduces the randomness of a single result, reduces the error, and can More accurately detect the center position of the parking space, the category of the parking space and the corner position.
  • Fig. 11 is an embodiment of a parking space detection method proposed in the present disclosure.
  • the target parking space detection model of the parking space detection method can be obtained by training the parking space detection model training method of any one of the above-mentioned Fig. 1-Fig. 7 embodiments, as shown in Fig. 11
  • the parking space detection method includes the following steps: S1101-S1107.
  • the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes a redundant area for identifying the category of the parking space,
  • the regression target heatmap includes redundant regions for the parking corner regression.
  • S1106. Determine a plurality of predicted corner coordinates corresponding to any corner of the parking space to be tested based on the regression values of the plurality of corners.
  • the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, preventing the randomness of outputting a single result. It can more accurately detect the center position of the parking space, the category of the parking space and the corner position.
  • Figure 12 is a parking space detection model training device proposed in the present disclosure.
  • the parking space detection model training device 1200 includes an acquisition module 1201, a determination module 1202, a generation module 1203 and a training module 1204, wherein:
  • the obtaining module 1201 is configured to obtain a sample image of a sample parking space, and obtain a center position of the sample parking space from the sample image.
  • the determination module 1202 is configured to determine the redundant area of parking space recognition and the redundant area of parking space corner regression in the sample image according to the center position.
  • the generation module 1203 is used to identify the redundant area according to the parking space, mark the sample image, generate the category truth heat map of the sample image, and return the redundant area according to the corner point of the parking space, mark the sample image, and generate the parking space of the sample image Regression ground-truth heatmap of corner points.
  • the training module 1204 is configured to train the parking space detection model based on the sample image, the category true value heat map and the regression true value heat map to generate a target parking space detection model.
  • the acquisition module 1201 is also used to: identify the corner points of the sample parking space from the sample image and mark them; down-sample the sample image to obtain a sample sample image; determine the sample in the sample sample image based on the coordinates of the corner points The central location of the parking lot.
  • the determination module 1202 is also used to: determine the first redundant offset for parking space recognition; expand the sample image with the center position as the center and the first redundant offset as the pixel offset radius, Generate the parking space recognition redundant area in the sample image; based on the category of the sample parking space, mark the values of the pixels in the parking space recognition redundant area and the first remaining area except the parking space recognition redundant area to generate the category True value heat map.
  • the determination module 1202 is also used to: determine the second redundant offset of the parking space corner regression; take the center position as the center, and use the second redundant offset as the pixel offset radius to perform the calculation on the sample image Expand and generate the parking space corner regression redundant area in the sample image; based on the multiple dimensions of the sample parking space corner point, mark the parking space corner regression redundant area as a parking space corner regression labeling map; mark the parking space corner regression The map is processed to obtain the regression true value heat map of the corner point of the parking space.
  • the generation module 1203 is also used for: based on the heat map of the true regression value of the corner of the parking space, obtain the regression value of the corner point in multiple dimensions from the corner point to the regression redundant area of the corner point of the parking space.
  • the training module 1204 is also used to: input the sample image into the parking space detection model for multi-scale feature extraction; perform center position prediction, category prediction and regression prediction based on the extracted multi-scale feature map, so as to obtain the prediction center position, category Prediction heat map, regression prediction heat map; based on the prediction center position and center position, category prediction heat map and category true value heat map, and regression prediction heat map and regression true value heat map, determine the loss function of the parking space detection model; based on loss The function adjusts the parking space detection model and continues training until the training ends to generate the target parking space detection model.
  • the training module 1204 is also used to: fuse the feature maps of the target scale in the multi-scale feature map to obtain the fused feature map; predict the center position based on the fused feature map to obtain the predicted center position;
  • the category prediction is performed on the map, and the category prediction heat map is obtained; the regression prediction is performed based on the fused feature map, and the regression prediction heat map is obtained.
  • FIG. 13 is a parking space detection device proposed in the present disclosure, which operates on the basis of the above-mentioned parking space detection model training device 1200.
  • the parking space detection device 1300 includes a first acquisition module 1301 and a second acquisition module 1302 and determination module 1303, wherein:
  • the first acquiring module 1301 is configured to acquire a target image to be predicted, wherein the target image includes at least one parking space to be tested.
  • the second acquisition module 1302 is used to input the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes the parking space category The identified redundant area, the regression target heat map includes the redundant area used for the parking corner regression.
  • the determining module 1303 is configured to determine a target parking space category and a target parking space corner point of the parking space to be tested based on the redundant area used for identifying the parking space category and the redundant area used for the parking space corner regression.
  • the second acquisition module 1302 is also used to: process the central position target thermal map output by the target parking space detection model; use the processed pixel points in the central position target thermal map greater than the preset threshold as the parking spaces to be tested Target center position.
  • the determining module 1303 is also used to: average a plurality of pixels in the redundant area used for identifying the parking space category to obtain the target pixel, and based on the target pixel, determine the target parking space category of the parking space to be tested or determine the parking space category Multiple parking space categories corresponding to multiple pixels in the identified redundant area; vote for multiple parking space categories to determine the target parking space category of the parking space to be tested.
  • the determination module 1303 is also used to: determine a plurality of corner regression values corresponding to the redundant area used for the corner regression of the parking space; coordinates of a predicted corner point; the coordinates of multiple predicted corner points are averaged to determine the corner point of the target parking space corresponding to any corner point of the parking space to be measured.
  • an automatic driving vehicle which includes the above-mentioned parking space detection model training device 1200 or the parking space detection device 1300 .
  • an embodiment of the present disclosure also proposes an electronic device 1400.
  • the instructions executed by the processor are executed by at least one processor 1401 to implement the parking space detection model training method and the parking space detection method as shown in the above embodiments.
  • the embodiments of the present disclosure also propose a non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to enable the computer to implement the parking space detection model training method and the parking space as shown in the above-mentioned embodiments. Detection method.
  • the embodiments of the present disclosure further provide a computer program product, including a computer program, which implements the parking space detection model training method and the parking space detection method as shown in the above embodiments when the computer program is executed by a processor.
  • the embodiment of the present disclosure also proposes a computer program, wherein the computer program includes computer program code, when the computer program code is run on the computer, it makes the computer execute the parking space detection model shown in the above-mentioned embodiment Training method and parking space detection method.
  • first and second are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, a feature defined as “first” and “second” may explicitly or implicitly include one or more of these features.
  • “plurality” means two or more, unless otherwise specifically defined.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Image Analysis (AREA)
  • Traffic Control Systems (AREA)

Abstract

Provided are a parking space detection model training method and apparatus, and a parking space detection method and apparatus. The parking space detection model training method comprises: obtaining a sample image of a sample parking space, and obtaining a center position of the sample parking space from the sample image; determining a parking space recognition redundant area and a parking space corner point regression redundant area in the sample image according to the center position; marking the sample image according to the parking space recognition redundant area to generate a category true value heat map, and marking the sample image according to the parking space corner point regression redundant area to generate a regression true value heat map of parking space corner points; and training a parking space detection model on the basis of the sample image, the category true value heat map, and the regression true value heat map to generate a target parking space detection model.

Description

车位检测模型训练方法、车位检测方法及其装置Parking space detection model training method, parking space detection method and device thereof
相关申请的交叉引用Cross References to Related Applications
本申请基于申请号为202111571230.X、申请日为2021年12月21日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is based on a Chinese patent application with application number 202111571230.X and a filing date of December 21, 2021, and claims the priority of this Chinese patent application. The entire content of this Chinese patent application is hereby incorporated by reference into this application.
技术领域technical field
本公开涉及自动驾驶技术领域,具体涉及一种车位检测模型训练方法、车位检测方法及其装置。The present disclosure relates to the technical field of automatic driving, in particular to a parking space detection model training method, a parking space detection method and a device thereof.
背景技术Background technique
相关技术中,用户对辅助泊车和自动泊车有着迫切需求,除了在道路上行驶以外,车辆还需驶入相应的停车位,这就需要对车辆附近的车位有精确地识别,才能规划好泊车路线,安全准确地泊车。但现有的计算车位角点的方式,每个车位只输出一组车位角点,有较强的随机性,且不同帧角点位置易跳动。In related technologies, users have an urgent need for assisted parking and automatic parking. In addition to driving on the road, vehicles also need to drive into corresponding parking spaces. This requires accurate identification of parking spaces near the vehicle in order to plan well. Parking route, park safely and accurately. However, in the existing way of calculating corner points of parking spaces, each parking space only outputs a set of corner points of parking spaces, which has strong randomness, and the corner positions of different frames are easy to jump.
发明内容Contents of the invention
本公开的第一方面实施例提出了一种车位检测模型训练方法,通过获取样本车位的样本图像,并从所述样本图像中获取所述样本车位的中心位置;根据所述中心位置,确定所述样本图像中的车位识别冗余区和车位角点回归冗余区;根据所述车位识别冗余区,对所述样本图像进行标记,生成所述样本图像的类别真值热力图,以及根据所述车位角点回归冗余区,对所述样本图像进行标记,生成所述样本图像的车位角点的回归真值热力图;基于所述样本图像、所述类别真值热力图和所述回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型。The embodiment of the first aspect of the present disclosure proposes a parking space detection model training method, by obtaining a sample image of a sample parking space, and obtaining the center position of the sample parking space from the sample image; according to the center position, determine the The parking space recognition redundant area and the parking space corner regression redundant area in the sample image; according to the parking space identification redundant area, the sample image is marked to generate the category true value heat map of the sample image, and according to The parking space corner returns to a redundant area, and the sample image is marked to generate a regression true value thermodynamic map of the parking space corner of the sample image; based on the sample image, the category true value thermodynamic map and the Regression to the true value heat map, training the parking space detection model, and generating the target parking space detection model.
本公开在训练过程中,考虑到了车位类别识别和车位角点回归计算时的冗余,使得训练好的目标车位检测模型在车位识别时会考虑冗余区,能够更准确地检测出车位的中心位置、车位类别和角点位置。In the training process of the present disclosure, the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, and can detect the center of the parking space more accurately. location, bay category, and corner location.
本公开第二方面实施例提出了一种车位检测方法,包括:获取待预测的目标图像,其中,所述目标图像中至少包括一个待测车位;将所述目标图像输入目标车位检测模型中,以获取所述待测车位的目标中心位置、类别目标热力图和回归目标热力图,其中,所述类别目标热力图中包括用于车位类别识别的冗余区,所述回归目标热力图中包括用于车位角点回归的冗余区,所述目标车位检测模型由上述第一方面实施例中所述的车位检测模型训练方法训练生成;基于所述用于车位类别识别的冗余区和所述用于车位角点回归的冗余区,确定所述待测车位的目标车位类别和目标车位角点。The embodiment of the second aspect of the present disclosure proposes a parking space detection method, including: acquiring a target image to be predicted, wherein the target image includes at least one parking space to be tested; inputting the target image into a target parking space detection model, To obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes a redundant area for identifying the category of the parking space, and the regression target heat map includes The redundant area used for the regression of the corner point of the parking space, the target parking space detection model is generated by training the parking space detection model training method described in the embodiment of the first aspect above; The redundant area used for the regression of the parking space corners is described, and the target parking space category and the target parking space corner points of the parking spaces to be tested are determined.
本公开第三方面实施例提出了一种车位检测模型训练装置,包括:获取模块,用于获 取样本车位的样本图像,并从所述样本图像中获取所述样本车位的中心位置;确定模块,用于根据所述中心位置,确定所述样本图像中的车位识别冗余区和车位角点回归冗余区;生成模块,用于根据所述车位识别冗余区,对所述样本图像进行标记,生成所述样本图像的类别真值热力图,以及根据所述车位角点回归冗余区,对所述样本图像进行标记,生成所述样本图像的车位角点的回归真值热力图;训练模块,用于基于所述样本图像、所述类别真值热力图和所述回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型。The embodiment of the third aspect of the present disclosure proposes a parking space detection model training device, including: an acquisition module, configured to acquire a sample image of a sample parking space, and obtain the center position of the sample parking space from the sample image; a determination module, It is used to determine the parking space recognition redundant area and the parking space corner regression redundant area in the sample image according to the central position; the generating module is used to mark the sample image according to the parking space identification redundant area , generating a class true value heat map of the sample image, and marking the sample image according to the regression redundant area of the parking corner point, generating a regression true value heat map of the parking space corner point of the sample image; training A module, configured to train a parking space detection model based on the sample image, the category true value thermodynamic map and the regression true value thermodynamic map, to generate a target parking space detection model.
本公开第四方面实施例提出了一种车位检测装置,包括:第一获取模块,用于获取待预测的目标图像,其中,所述目标图像中至少包括一个待测车位;第二获取模块,用于将所述目标图像输入目标车位检测模型中,以获取所述待测车位的目标中心位置、类别目标热力图和回归目标热力图,其中,所述类别目标热力图中包括用于车位类别识别的冗余区,所述回归目标热力图中包括用于车位角点回归的冗余区,所述目标车位检测模型由上述第三方面实施例中所述的车位检测模型训练装置训练生成;确定模块,用于基于所述用于车位类别识别的冗余区和所述用于车位角点回归的冗余区,确定所述待测车位的目标车位类别和目标车位角点。The embodiment of the fourth aspect of the present disclosure proposes a parking space detection device, including: a first acquisition module, configured to acquire a target image to be predicted, wherein the target image includes at least one parking space to be tested; a second acquisition module, It is used to input the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes the parking space category The identified redundant area, the regression target heat map includes a redundant area for the regression of parking space corners, and the target parking space detection model is generated by training the parking space detection model training device described in the embodiment of the third aspect above; The determination module is configured to determine the target parking space category and the target parking space corner of the parking space to be tested based on the redundant area used for identifying the parking space category and the redundant area used for the regression of the parking space corner point.
本公开第五方面实施例提出了一种自动驾驶车辆,该自动驾驶车辆包括上述车位检测模型训练装置或车位检测装置。The embodiment of the fifth aspect of the present disclosure provides an automatic driving vehicle, which includes the above-mentioned parking space detection model training device or parking space detection device.
为达上述目的,本公开第六方面实施例提出了一种电子设备,包括:至少一个处理器;以及与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以实现如本公开第一方面实施例所述的车位检测模型训练方法和第二方面实施例所述的车位检测方法。To achieve the above purpose, the embodiment of the sixth aspect of the present disclosure provides an electronic device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the Instructions executed by at least one processor, the instructions are executed by the at least one processor to implement the parking space detection model training method described in the embodiment of the first aspect of the present disclosure and the parking space detection method described in the embodiment of the second aspect .
为达上述目的,本公开第七方面实施例提出了一种存储有计算机指令的非瞬时计算机可读存储介质,其中,所述计算机指令用于实现如本公开第一方面实施例所述的车位检测模型训练方法和第二方面实施例所述的车位检测方法。To achieve the above purpose, the embodiment of the seventh aspect of the present disclosure provides a non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to realize the parking space described in the embodiment of the first aspect of the present disclosure The detection model training method and the parking space detection method described in the embodiment of the second aspect.
为达上述目的,本公开第八方面实施例提出了一种计算机程序产品,包括计算机程序,所述计算机程序在被处理器执行时实现本公开第一方面实施例所述的车位检测模型训练方法和第二方面实施例所述的车位检测方法。To achieve the above purpose, the embodiment of the eighth aspect of the present disclosure proposes a computer program product, including a computer program, when the computer program is executed by a processor, it implements the parking space detection model training method described in the embodiment of the first aspect of the present disclosure And the parking space detection method described in the embodiment of the second aspect.
为达上述目的,本公开第九方面实施例提供了一种计算机程序,所述计算机程序包括计算机程序代码,当所述计算机程序代码在计算机上运行时,使得计算机执行如本公开第一方面实施例所述的车位检测模型训练方法和第二方面实施例所述的车位检测方法。To achieve the above purpose, the embodiment of the ninth aspect of the present disclosure provides a computer program, the computer program includes computer program code, and when the computer program code is run on the computer, the computer executes the computer program according to the first aspect of the present disclosure. The parking space detection model training method described in the example and the parking space detection method described in the embodiment of the second aspect.
附图说明Description of drawings
图1是本公开一个实施例示出的一种车位检测模型训练方法的示意图。Fig. 1 is a schematic diagram of a method for training a parking space detection model according to an embodiment of the present disclosure.
图2是本公开一个实施例示出的从样本图像中获取样本车位的中心位置的示意图。Fig. 2 is a schematic diagram of obtaining the central position of a sample parking space from a sample image according to an embodiment of the present disclosure.
图3是本公开一个实施例示出的样本图像的类别真值热力图的生成过程的示意图。Fig. 3 is a schematic diagram of a process of generating a category truth heat map of a sample image according to an embodiment of the present disclosure.
图4是本公开一个实施例示出的回归真值热力图的生成过程的示意图。Fig. 4 is a schematic diagram of a process of generating a regression truth heat map according to an embodiment of the present disclosure.
图5是本公开一个实施例示出的对车位检测模型进行训练,生成目标车位检测模型的 示意图。Fig. 5 is a schematic diagram of training a parking space detection model to generate a target parking space detection model according to an embodiment of the present disclosure.
图6是本公开一个实施例示出的车位检测模型的算法示意图。Fig. 6 is a schematic diagram of an algorithm of a parking space detection model according to an embodiment of the present disclosure.
图7是本公开一个实施例示出的一种车位检测模型训练方法的示意图。Fig. 7 is a schematic diagram of a parking space detection model training method according to an embodiment of the present disclosure.
图8是本公开一个实施例示出的一种车位检测方法的示意图。Fig. 8 is a schematic diagram of a parking space detection method according to an embodiment of the present disclosure.
图9(a)是本公开一个实施例示出的目标中心位置热力图的示意图。Fig. 9(a) is a schematic diagram of a thermal map of a target center position according to an embodiment of the present disclosure.
图9(b)是本公开一个实施例示出的类别目标热力图的示意图。Fig. 9(b) is a schematic diagram of a category target heat map shown in an embodiment of the present disclosure.
图9(c)是本公开一个实施例示出的回归目标热力图的示意图。Fig. 9(c) is a schematic diagram of a regression target heat map shown in an embodiment of the present disclosure.
图10是本公开一个实施例示出的确定待测车位的目标车位类别和目标车位角点的示意图。Fig. 10 is a schematic diagram of determining a target parking space category and a target parking space corner point of a parking space to be tested according to an embodiment of the present disclosure.
图11是本公开一个实施例示出的一种车位检测方法的示意图。Fig. 11 is a schematic diagram of a parking space detection method according to an embodiment of the present disclosure.
图12是本公开一个实施例示出的一种车位检测模型训练装置的示意图。Fig. 12 is a schematic diagram of a parking space detection model training device according to an embodiment of the present disclosure.
图13是本公开一个实施例示出的一种车位检测装置的示意图。Fig. 13 is a schematic diagram of a parking space detection device according to an embodiment of the present disclosure.
图14是本公开一个实施例示出的一种电子设备的示意图。Fig. 14 is a schematic diagram of an electronic device according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下面详细描述本公开的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本公开,而不能理解为对本公开的限制。Embodiments of the present disclosure are described in detail below, examples of which are illustrated in the drawings, in which the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present disclosure and should not be construed as limiting the present disclosure.
图1是本公开提出的一种车位检测模型训练方法的一种实施方式,如图1所示,该车位检测模型训练方法,包括以下步骤:S101-S104。FIG. 1 is an implementation of a parking space detection model training method proposed in the present disclosure. As shown in FIG. 1 , the parking space detection model training method includes the following steps: S101-S104.
S101,获取样本车位的样本图像,并从样本图像中获取样本车位的中心位置。S101. Obtain a sample image of a sample parking space, and obtain a center position of the sample parking space from the sample image.
获取由图像采集装置采集的样本车位对应的样本图像。在一些实施例中,样本图像可为大量车辆在泊车过程中采集的图像,也可由预设在车位附近的摄像机采集车辆在泊车过程中的图像。A sample image corresponding to the sample parking space collected by the image acquisition device is acquired. In some embodiments, the sample images may be images collected by a large number of vehicles during parking, or images of vehicles during parking may be collected by cameras preset near the parking spaces.
在一些实施例中,从样本图像中识别样本车位的4个角点,根据4个角点的坐标得到样本车位的中心位置。In some embodiments, 4 corner points of the sample parking space are identified from the sample image, and the center position of the sample parking space is obtained according to the coordinates of the 4 corner points.
在一些实施例中,根据4个角点的坐标得到样本车位的中心位置时,可直接由人工进行标记,也可输入算法进行识别并直接输出样本车位的中心位置。In some embodiments, when the center position of the sample parking space is obtained according to the coordinates of the four corner points, it can be directly marked manually, or it can be input into an algorithm to identify and directly output the center position of the sample parking space.
S102,根据中心位置,确定样本图像中的车位识别冗余区和车位角点回归冗余区。S102. Determine a parking space recognition redundant area and a parking space corner regression redundant area in the sample image according to the center position.
为了在车位检测模型对车位的类别检测略有误差的情况下仍可以对该车位进行准确分类,可以以样本车位的中心位置为中心,选取该中心一定范围内的区域为车位识别冗余区。In order to accurately classify the parking space even if the parking space detection model has a slight error in the category detection of the parking space, the center of the sample parking space can be taken as the center, and the area within a certain range of the center can be selected as the parking space recognition redundancy area.
为了在车位检测模型对车位的角点检测略有误差的情况下仍可以对该车位的角点进行准确确定,可以以样本车位的中心位置为中心,选取该中心一定范围内的区域为车位角点回归冗余区。In order to accurately determine the corner point of the parking space even when the parking space detection model has a slight error in the corner detection of the parking space, the center position of the sample parking space can be taken as the center, and the area within a certain range of the center can be selected as the parking space angle Point regression redundant area.
S103,根据车位识别冗余区,对样本图像进行标记,生成样本图像的类别真值热力图,以及根据车位角点回归冗余区,对样本图像进行标记,生成样本图像的车位角点的回归真 值热力图。S103, mark the sample image according to the redundant area of the parking space recognition, generate the category truth heat map of the sample image, and return the redundant area according to the corner point of the parking space, mark the sample image, and generate the regression of the parking space corner point of the sample image True heat map.
根据车位识别冗余区,基于样本车位的类别,对样本图像上的车位识别冗余区和非车位识别冗余区分别进行标注,并根据标注后的图像,生成类别真值热力图。在一些实施例中,将采样样本图像中样本车位的车位识别冗余区的像素点赋值为样本车位的类别值,其他位置赋值为0。在一些实施例中,样本车位的类别可包括横向车位、纵向车位、斜向车位等,可对不同的车位类别赋予不同的类别值,比如说,可将横向车位的类别值赋值为1,可对纵向车位的类别值赋值为2。According to the parking space recognition redundant area, based on the category of the sample parking space, the parking space recognition redundant area and the non-parking space recognition redundant area on the sample image are respectively marked, and the class true value heat map is generated according to the marked image. In some embodiments, the pixel points in the parking space recognition redundant area of the sample parking space in the sampling sample image are assigned as the class value of the sample parking space, and other positions are assigned as 0. In some embodiments, the category of the sample parking spaces may include horizontal parking spaces, longitudinal parking spaces, oblique parking spaces, etc., different category values may be assigned to different parking spaces, for example, the category value of the horizontal parking spaces may be assigned a value of 1, and Assign a value of 2 to the category value of the vertical parking space.
由于每个样本车位对应四个角点,每个角点有x、y两个维度表示,所以样本车位角点回归一共有8个维度,根据车位角点回归冗余区,基于样本车位角点的多个维度,对样本图像进行标记,生成样本图像的车位角点的回归真值热力图。Since each sample parking space corresponds to four corner points, and each corner point has two dimensions of x and y, so the sample parking space corner regression has a total of 8 dimensions, according to the parking space corner regression redundant area, based on the sample parking space corner point The multiple dimensions of the sample image are marked, and the regression true value heat map of the corner points of the parking space of the sample image is generated.
S104,基于样本图像、类别真值热力图和回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型。S104, based on the sample image, the class true value heat map and the regression true value heat map, train the parking space detection model to generate a target parking space detection model.
获取车位检测模型输出的样本车位的中心位置热力图、类别热力图和回归真值热力图,根据上述获得的样本图像、类别真值热力图和回归真值热力图,获取车位检测模型输出的热力图与其各自对应的真值热力图之间的差值,并确定各自的差值生成损失函数,根据损失函数对车位检测模型进行训练,直至满足训练结束条件,得到最终的目标车位检测模型。Obtain the central location heat map, category heat map, and regression true value heat map of the sample parking spaces output by the parking space detection model, and obtain the heat output of the parking space detection model based on the sample image, category true value heat map, and regression true value heat map obtained above The difference between the map and its corresponding true value heat map, and determine the respective difference to generate a loss function, and train the parking space detection model according to the loss function until the training end condition is met, and the final target parking space detection model is obtained.
在一些实施例中,训练结束条件可以为:当对车位检测模型训练到预设训练次数后,结束训练。举例说明,若预设训练次数为1000次,则当对车位检测模型训练到1000次后,结束训练。In some embodiments, the training end condition may be: when the parking space detection model is trained for a preset number of times, the training ends. For example, if the preset number of training times is 1000 times, the training will end after the parking space detection model has been trained 1000 times.
在一些实施例中,训练结束条件可以为:当训练到车位检测模型的误差收敛到预设误差值时,结束训练。In some embodiments, the training end condition may be: when the error of the trained parking space detection model converges to a preset error value, the training ends.
在一些实施例中,训练结束条件可以为:当训练到车位检测模型的损失函数降低到预设损失值时,结束训练。In some embodiments, the training end condition may be: when the loss function of the trained parking space detection model decreases to a preset loss value, the training ends.
本公开在训练过程中,考虑到了车位类别识别和车位角点回归计算时的冗余,使得训练好的目标车位检测模型在车位识别时会考虑冗余区,使得目标车位检测模型能对车位进行准确识别,从而能够更准确地检测出车位的中心位置、车位类别和角点位置。In the training process of the present disclosure, the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, so that the target parking space detection model can be used for the parking space. Accurate recognition, so that the center position, parking space category and corner position of the parking space can be detected more accurately.
图2是本公开提出的一种车位检测模型训练方法的一种实施方式,基于上述实施例的基础上,如图2所示,从样本图像中获取样本车位的中心位置,包括以下步骤:S201-S203。Figure 2 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in Figure 2, obtaining the center position of the sample parking space from the sample image includes the following steps: S201 -S203.
S201,从样本图像中识别样本车位的角点,并进行标注。S201. Identify corner points of the sample parking space from the sample image, and mark them.
获取在泊车过程中采集的多路图像,并对多路图像进行拼接,生成拼接后的样本图像,将该样本图像的宽记为w,高记为h,即该该本图像的尺寸可记为(w,h)。在一些实施例中,可由样本车辆在泊车过程中采集多路图像,也可由预设的多路摄像机采集在泊车过程中的多路图像。Obtain the multi-channel images collected during the parking process, and stitch the multi-channel images to generate a stitched sample image, record the width of the sample image as w, and record the height as h, that is, the size of the original image can be Denote as (w, h). In some embodiments, the sample vehicle may collect multiple images during parking, or a preset multiple camera may collect multiple images during parking.
在一些实施例中,可由样本车辆采集在整个泊车过程中的多路鱼眼图像数据,并运用图像拼接算法将多路鱼眼图像数据拼接成环视拼接图,将该环视拼接图作为样本图像。In some embodiments, the multi-channel fisheye image data during the entire parking process can be collected by the sample vehicle, and the image stitching algorithm is used to stitch the multi-channel fisheye image data into a surround-view mosaic, and the surround-view mosaic is used as a sample image .
从样本图像中获取样本车位的4个角点,根据4个角点得到样本车位的中心位置,将 样本车位的中心位置记为(x,y)。在一些实施例中,样本车位的角点可由人工进行标注,也可运用目标检测算法进行车位检测进行标注。Get the 4 corner points of the sample parking space from the sample image, get the center position of the sample parking space according to the 4 corner points, and record the center position of the sample parking space as (x, y). In some embodiments, the corner points of the sample parking spaces can be marked manually, or can be marked by using a target detection algorithm to detect the parking spaces.
S202,对样本图像进行下采样,得到采样样本图像。S202. Down-sampling the sample image to obtain a sampled sample image.
对上述获得的样本图像进行下采样,获取下采样后的采样样本图像。在一些实施例中,可对上述获得的样本图像进行4倍下采样,获取下采样后的采样样本图像,该采样样本图像的尺寸为(w/4,h/4)。The sample image obtained above is down-sampled to obtain a down-sampled sample image. In some embodiments, the sample image obtained above may be down-sampled by 4 times to obtain a down-sampled sample image, and the size of the sample image is (w/4, h/4).
S203,基于角点的坐标,确定采样样本图像中样本车位的中心位置。S203. Based on the coordinates of the corner points, determine the center position of the sample parking space in the sample image.
根据上述获得的样本车位的4个角点在采样样本图像中的坐标进行计算,确定采样样本图像中样本车位的中心位置,并对采样样本图像中样本车位的中心位置进行四舍五入取整,即将样本车位的中心位置记为(round(x/4),round(y/4))。将样本车位的中心位置赋值为1,其他位置赋值为0。Calculate according to the coordinates of the four corner points of the sample parking spaces obtained above in the sampling sample image, determine the center position of the sample parking space in the sampling sample image, and round the center position of the sample parking space in the sampling sample image, that is, the sample The center position of the parking space is recorded as (round(x/4), round(y/4)). The center position of the sample parking space is assigned a value of 1, and other positions are assigned a value of 0.
本公开实施例对样本图像进行了下采样,增大了感受野,并减少了获取样本车位中的中心位置真值的计算量。In the embodiment of the present disclosure, the sample image is down-sampled, the receptive field is increased, and the calculation amount for obtaining the true value of the center position in the sample parking space is reduced.
图3是本公开提出的一种车位检测模型训练方法的一种实施方式,基于上述实施例的基础上,如图3所示,样本图像的类别真值热力图的生成过程,包括以下步骤:S301-S303。FIG. 3 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. 3 , the generation process of the category true value heat map of the sample image includes the following steps: S301-S303.
S301,确定车位识别的第一冗余偏移量。S301. Determine a first redundant offset for parking space identification.
与上述类似,对获得的样本图像进行下采样,获取下采样后的采样样本图像。在一些实施例中,可对上述获得的样本图像进行4倍下采样,获取下采样后的采样样本图像,该采样样本图像的尺寸为(w/4,h/4)。Similar to the above, the obtained sample image is down-sampled to obtain a down-sampled sample image. In some embodiments, the sample image obtained above may be down-sampled by 4 times to obtain a down-sampled sample image, and the size of the sample image is (w/4, h/4).
根据样本车位的4个角点在采样样本图像中的坐标进行计算,确定采样样本图像中样本车位的中心位置,并对采样样本图像中样本车位的中心位置进行四舍五入取整,即采样样本图像中样本车位的中心位置记为(round(x/4),round(y/4))。Calculate according to the coordinates of the four corner points of the sample parking space in the sampling sample image, determine the center position of the sample parking space in the sampling sample image, and round the center position of the sample parking space in the sampling sample image, that is, the center position of the sample parking space in the sampling sample image The center position of the sample parking space is recorded as (round(x/4), round(y/4)).
为了在车位检测模型对车位的类别检测略有误差的情况下仍可以对该车位进行准确分类,预先设定一个车位识别的第一冗余偏移量。其中,第一冗余偏移量包括ΔX1和ΔY1。其中,如果真实情况下样本车位的尺寸是[Wspot,Hspot],则ΔX1和ΔY1在[0,min(Wspot,Hspot)/2]范围内。In order to accurately classify the parking space even if the parking space detection model has a slight error in detecting the category of the parking space, a first redundant offset for parking space identification is preset. Wherein, the first redundancy offset includes ΔX1 and ΔY1. Wherein, if the size of the sample parking space is [Wspot, Hspot] in the real situation, then ΔX1 and ΔY1 are within the range of [0, min(Wspot, Hspot)/2].
S302,以中心位置为中心,以第一冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位识别冗余区。S302. Taking the central position as the center and using the first redundant offset as the pixel offset radius, expand the sample image to generate a parking space recognition redundant area in the sample image.
以中心位置为中心,以第一冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位识别冗余区,将车位识别冗余区记为(round(x/4)±ΔX1,round(y/4)±ΔY1)。Take the central position as the center, take the first redundant offset as the pixel offset radius, expand on the sample image, generate the parking space recognition redundant area in the sample image, and record the parking space identification redundant area as (round(x /4)±ΔX1, round(y/4)±ΔY1).
S303,基于样本车位的类别,分别对车位识别冗余区和除车位识别冗余区之外的第一剩余区域内的像素点的取值进行标注以生成类别真值热力图。S303. Based on the category of the sample parking space, respectively mark the values of the pixels in the parking space identification redundant area and the first remaining area except the parking space identification redundant area to generate a category true value heat map.
基于样本车位的类别,分别对车位识别冗余区和除车位识别冗余区之外的第一剩余区域内的像素点的取值进行标注。在一些实施例中,将样本车位的车位识别冗余区的像素点赋值为样本车位的类别值,其他位置赋值为0。在一些实施例中,样本车位的类别可包括 横向车位、纵向车位、斜向车位等,可对不同的车位类别赋予不同的类别值,比如说,可将横向车位的类别值赋值为1,可对纵向车位的类别值赋值为2。根据标注后的图像,生成类别真值热力图。Based on the category of the sample parking space, the values of the pixel points in the redundant parking space recognition area and the first remaining area except the redundant parking space identification area are respectively marked. In some embodiments, the pixel points in the parking space identification redundant area of the sample parking space are assigned as the class value of the sample parking space, and other positions are assigned as 0. In some embodiments, the category of the sample parking spaces may include horizontal parking spaces, longitudinal parking spaces, oblique parking spaces, etc., different category values may be assigned to different parking spaces, for example, the category value of the horizontal parking spaces may be assigned a value of 1, and Assign a value of 2 to the category value of the vertical parking space. According to the labeled image, generate a heat map of the true value of the category.
本公开实施例设定了一个用于进行车位识别的第一冗余偏移量,基于第一冗余偏移量设定车位识别冗余区,并生成类别真值热力图对车位检测模型进行训练,防止了生成的目标车位检测模型因角点跳动导致对车位类别识别不准确,增加了后续对车位类别预测的准确性和鲁棒性。The embodiment of the present disclosure sets a first redundant offset for parking space identification, sets a redundant area for parking space identification based on the first redundant offset, and generates a class true value heat map to carry out the parking space detection model Training prevents the generated target parking space detection model from being inaccurate in recognizing the parking space category due to corner jumps, and increases the accuracy and robustness of the subsequent prediction of the parking space category.
图4是本公开提出的一种车位检测模型训练方法的实施方式,基于上述实施例的基础上,如图4所示,回归真值热力图的生成过程,包括以下步骤:S401-S403。FIG. 4 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. 4 , the generation process of the regression true value heat map includes the following steps: S401-S403.
S401,确定车位角点回归的第二冗余偏移量。S401. Determine a second redundant offset for the parking corner regression.
与上述类似,对获得的样本图像进行下采样,获取下采样后的采样样本图像。在一些实施例中,可对上述获得的样本图像进行4倍下采样,获取下采样后的采样样本图像,该采样样本图像的尺寸为(w/4,h/4)。Similar to the above, the obtained sample image is down-sampled to obtain a down-sampled sample image. In some embodiments, the sample image obtained above may be down-sampled by 4 times to obtain a down-sampled sample image, and the size of the sample image is (w/4, h/4).
根据样本车位的4个角点在采样样本图像中的坐标进行计算,确定采样样本图像中样本车位的中心位置,并对采样样本图像中样本车位的中心位置进行四舍五入取整,即采样样本图像中样本车位的中心位置记为(round(x/4),round(y/4))。Calculate according to the coordinates of the four corner points of the sample parking space in the sampling sample image, determine the center position of the sample parking space in the sampling sample image, and round the center position of the sample parking space in the sampling sample image, that is, the center position of the sample parking space in the sampling sample image The center position of the sample parking space is recorded as (round(x/4), round(y/4)).
为了在车位检测模型对车位的角点检测略有误差的情况下仍可以对该车位的角点进行准确确定,预先设定一个车位角点回归的第二冗余偏移量。其中,第二冗余偏移量包括ΔX2和ΔY2。其中,如果真实情况下样本车位的尺寸是[Wspot,Hspot],则ΔX2和ΔY2在[0,min(Wspot,Hspot)/2]范围内。In order to accurately determine the corner point of the parking space even if the corner point detection of the parking space by the parking space detection model has a slight error, a second redundant offset for the regression of the corner point of the parking space is preset. Wherein, the second redundancy offset includes ΔX2 and ΔY2. Wherein, if the size of the sample parking space is [Wspot, Hspot] in the real situation, then ΔX2 and ΔY2 are within the range of [0, min(Wspot, Hspot)/2].
S402,以中心位置为中心,以第二冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位角点回归冗余区。S402. Taking the central position as the center and using the second redundant offset as the pixel offset radius, expand the sample image to generate a parking space corner regression redundant area in the sample image.
以中心位置为中心,以第二冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位角点回归冗余区,将车位角点回归冗余区记为(round(x/4)±ΔX2,round(y/4)±ΔY2)。Take the central position as the center, and take the second redundant offset as the pixel offset radius, expand on the sample image to generate the parking space corner regression redundant area in the sample image, and record the parking space corner regression redundant area as (round(x/4)±ΔX2, round(y/4)±ΔY2).
S403,基于样本车位角点的多个维度,对车位角点回归冗余区进行标注,作为车位角点回归标注图,并对车位角点回归标注图进行处理,获得车位角点的回归真值热力图。S403. Based on the multiple dimensions of the sample parking corner points, mark the regression redundant area of the parking space corner points as the parking space corner point regression labeling map, and process the parking space corner point regression labeling map to obtain the regression true value of the parking space corner points heat map.
由于每个样本车位对应四个角点,每个角点有x、y两个维度表示,所以样本车位角点回归一共有8个维度,基于样本车位角点的多个维度,对车位角点回归冗余区进行标注,作为车位角点回归标注图,并对车位角点回归标注图进行处理,获得车位角点的回归真值热力图。Since each sample parking space corresponds to four corner points, and each corner point has two dimensions of x and y, so the sample parking space corner regression has a total of 8 dimensions. Based on the multiple dimensions of the sample parking space corner points, the parking space corner point The regression redundant area is marked as the regression label map of the parking space corner point, and the regression label map of the parking space corner point is processed to obtain the regression true value heat map of the parking space corner point.
本公开实施例设定了一个用于进行车位角点回归的第二冗余偏移量,基于第二冗余偏移量获取了车位角点回归冗余区,并生成车位角点的回归真值热力图对车位检测模型进行训练,防止了生成的目标车位检测模型因角点跳动导致对车位角点识别不准确,增加了后续对车位角点预测的准确性和鲁棒性。进一步地,在获得车位角点回归真值热力图后,基于车位角点的回归真值热力图,获取多个维度下角点到车位角点回归冗余区的角点回归值。 其中。角点回归值可以理解为每个角点对应的两个维度下的角点坐标到车位角点回归冗余区内每个像素点的距离值。The embodiment of the present disclosure sets a second redundant offset for the regression of the corner of the parking space, obtains the redundant area of the corner of the parking space based on the second redundant offset, and generates the regression truth of the corner of the parking space The value heat map trains the parking space detection model, which prevents the generated target parking space detection model from inaccurate recognition of parking space corners due to corner jumps, and increases the accuracy and robustness of the subsequent prediction of parking space corners. Further, after obtaining the true value heat map of the parking corner point regression, based on the true value heat map of the parking space corner point regression, the corner point regression values from the corner point to the parking space corner point regression redundant area in multiple dimensions are obtained. in. The corner point regression value can be understood as the distance value from the corner point coordinates in the two dimensions corresponding to each corner point to each pixel point in the corner point regression redundancy area of the parking space.
图5是本公开提出的一种车位检测模型训练方法的一种实施方式,基于上述实施例的基础上,如图5所示,基于样本图像、类别真值热力图和回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型,包括以下步骤:S501-S504。FIG. 5 is an implementation of a parking space detection model training method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. 5 , based on the sample image, the category true value heat map and the regression true value heat map, The training of the parking space detection model to generate the target parking space detection model includes the following steps: S501-S504.
S501,将样本图像输入车位检测模型进行多尺度特征提取。S501. Input the sample image into the parking space detection model to perform multi-scale feature extraction.
图6是车位检测模型的算法示意图,如图6所示,为了在保证检测精度的前提下尽量减少模型的计算量,将样本图像输入车位检测模型进行多尺度特征提取,获取多尺度特征图。在一些实施例中,进行特征提取的算法可采用卷积神经网络(ConvolutionalNeuralNetworks,CNN)。Figure 6 is a schematic diagram of the algorithm of the parking space detection model. As shown in Figure 6, in order to minimize the calculation amount of the model under the premise of ensuring the detection accuracy, the sample image is input into the parking space detection model for multi-scale feature extraction to obtain multi-scale feature maps. In some embodiments, the feature extraction algorithm may use a convolutional neural network (Convolutional Neural Networks, CNN).
S502,基于提取的多尺度特征图分别进行中心位置预测、类别预测和回归预测,以获取预测中心位置、类别预测热力图、回归预测热力图。S502. Perform center position prediction, category prediction, and regression prediction based on the extracted multi-scale feature map, so as to obtain the prediction center location, category prediction heat map, and regression prediction heat map.
为了更好地提出样本车位的特征,对多尺度特征图中目标尺度的特征图进行融合,得到融合的特征图。将融合的特征图经中心位置检测头进行处理完成中心位置预测,得到预测中心位置。将融合的特征图经类别检测头进行处理完成类别预测,得到类别预测热力图。将融合的特征图经角点回归检测头进行处理完成回归预测,得到回归预测热力图。其中,中心位置检测头、类别检测头和角点回归检测头可以为由卷积层构成的网络。In order to better propose the characteristics of the sample parking spaces, the feature maps of the target scale in the multi-scale feature map are fused to obtain the fused feature map. The fused feature map is processed by the center position detection head to complete the center position prediction and obtain the predicted center position. The fused feature map is processed by the category detection head to complete the category prediction, and the category prediction heat map is obtained. The fused feature map is processed by the corner regression detection head to complete the regression prediction, and the regression prediction heat map is obtained. Wherein, the center position detection head, the category detection head and the corner point regression detection head may be a network composed of convolutional layers.
在一些实施例中,如图6所示,对样本图像采用CNN进行特征提取,并对提取的特征图进行2倍下采样,对2倍下采样特征图再进行CNN特征提取,以此类推,再依次对特征图进行4倍下采样、8倍下采样和16倍下采样,并在每次采样后对特征图进行CNN特征提取,分别获得4倍下采样特征图、8倍下采样特征图和16倍下采样特征图。In some embodiments, as shown in FIG. 6 , CNN is used for feature extraction on the sample image, and the extracted feature map is subjected to 2 times downsampling, and CNN feature extraction is performed on the 2 times downsampled feature map, and so on. Then perform 4 times downsampling, 8 times downsampling and 16 times downsampling on the feature map in turn, and perform CNN feature extraction on the feature map after each sampling, and obtain 4 times downsampling feature map and 8 times downsampling feature map respectively and 16x downsampled feature maps.
如图6所示,对16倍下采样特征图进行处理,并对处理后的图像进行16倍上采样,将进行16倍上采样后的图像与16倍下采样特征图进行融合后再进行CNN特征提取,获得第一融合图像。将第一融合图像进行8倍上采样,与8倍下采样特征图进行融合再进行CNN特征提取,得到最终的融合特征图。As shown in Figure 6, the 16-fold downsampled feature map is processed, and the processed image is 16-fold up-sampled, and the 16-fold up-sampled image is fused with the 16-fold downsampled feature map before CNN Feature extraction to obtain the first fused image. The first fused image is upsampled by 8 times, fused with the 8 times downsampled feature map, and then CNN feature extraction is performed to obtain the final fused feature map.
S503,基于预测中心位置和中心位置、类别预测热力图和类别真值热力图、以及回归预测热力图和回归真值热力图,确定车位检测模型的损失函数。S503. Determine the loss function of the parking space detection model based on the predicted center position and the center position, the class prediction heat map and the class true value heat map, and the regression prediction heat map and the regression true value heat map.
基于预测中心位置和中心位置、类别预测热力图和类别真值热力图、以及回归预测热力图和回归真值热力图,确定车位检测模型的损失函数。其中,车位检测模型的损失函数表示为:The loss function of the parking space detection model is determined based on the predicted center position and the center position, the class prediction heat map and the class true value heat map, and the regression prediction heat map and the regression true value heat map. Among them, the loss function of the parking space detection model is expressed as:
Loss=Loss_pos+Loss_kps-reg+Loss_clsLoss=Loss_pos+Loss_kps-reg+Loss_cls
上式中,Loss_pos是预测中心位置和中心位置之间的位置误差,Loss_kps-reg是回归预测热力图和回归真值热力图的回归误差,Loss_cls是类别预测热力图和类别真值热力图之间的分类误差。In the above formula, Loss_pos is the position error between the predicted center position and the center position, Loss_kps-reg is the regression error between the regression prediction heat map and the regression true value heat map, and Loss_cls is the difference between the class prediction heat map and the class true value heat map classification error.
其中,计算预测中心位置和中心位置之间的位置误差可采用交叉熵损失函数(Cross entropy loss);计算回归预测热力图和回归真值热力图的回归误差可采用回归损失函数 (Smooth L1 Loss);计算类别预测热力图和类别真值热力图之间的分类误差可采用交叉熵损失函数(Cross entropy loss)。Among them, the calculation of the position error between the predicted center position and the center position can use the cross entropy loss function (Cross entropy loss); the calculation of the regression error of the regression prediction heat map and the regression true value heat map can use the regression loss function (Smooth L1 Loss) ; To calculate the classification error between the category prediction heatmap and the category true value heatmap, the cross entropy loss function (Cross entropy loss) can be used.
S504,基于损失函数对车位检测模型进行调整并继续训练,直至训练结束,生成目标车位检测模型。S504. Adjust the parking space detection model based on the loss function and continue training until the training ends to generate a target parking space detection model.
基于每次将样本图像输入车位检测模型后生成的损失函数,对车位检测模型进行调整并继续训练,直至损失函数达到预设标准,则训练结束,生成目标车位检测模型。Based on the loss function generated after each sample image is input into the parking space detection model, the parking space detection model is adjusted and the training continues until the loss function reaches the preset standard, then the training ends and the target parking space detection model is generated.
本公开实施例经设置冗余区训练得到的目标车位检测模型,减少了输出角点时的随机性,且通过对样本图像的多尺度特征提取,获得样本图像丰富的语义信息,提高了在输出待测车位的中心位置、车位类别,角点位置时的准确度和稳定性。The target parking space detection model obtained by setting redundant area training in the embodiment of the present disclosure reduces the randomness when outputting corner points, and obtains rich semantic information of the sample image through multi-scale feature extraction of the sample image, which improves the accuracy of output. The accuracy and stability of the center position of the parking space to be tested, the type of parking space, and the corner position.
图7是本公开提出的一种车位检测模型训练方法的一种实施方式,如图7所示,该车位检测模型训练方法,包括以下步骤:S701-S715。Fig. 7 is an implementation manner of a parking space detection model training method proposed in the present disclosure. As shown in Fig. 7, the parking space detection model training method includes the following steps: S701-S715.
S701,获取在泊车过程中采集的多路图像,对多路图像进行拼接,生成拼接后的样本图像。S701. Acquire multiple images collected during parking, and splice the multiple images to generate a spliced sample image.
S702,从样本图像中识别样本车位的角点,并进行标注。S702. Identify corner points of the sample parking space from the sample image, and mark them.
S703,对样本图像进行下采样,得到采样样本图像。S703. Down-sampling the sample image to obtain a sampled sample image.
S704,基于角点的坐标,确定采样样本图像中样本车位的中心位置。S704. Based on the coordinates of the corner points, determine the center position of the sample parking space in the sample image.
关于步骤S701~S704的实现方式,可采用本公开中各实施例中的实现方式,在此不再进行赘述。Regarding the implementation manner of steps S701 to S704, the implementation manners in the embodiments of the present disclosure may be adopted, and details are not repeated here.
S705,确定车位识别的第一冗余偏移量。S705. Determine a first redundant offset for parking space identification.
S706,以中心位置为中心,以第一冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位识别冗余区。S706. Taking the central position as the center and using the first redundant offset as the pixel offset radius, expand the sample image to generate a parking space recognition redundant area in the sample image.
S707,基于样本车位的类别,分别对车位识别冗余区和除车位识别冗余区之外的第一剩余区域内的像素点的取值进行标注以生成类别真值热力图。S707. Based on the category of the sample parking space, respectively mark the values of the pixels in the parking space identification redundant area and the first remaining area except the parking space identification redundant area to generate a class true value heat map.
关于步骤S705~S707的实现方式,可采用本公开中各实施例中的实现方式,在此不再进行赘述。Regarding the implementation manner of steps S705-S707, the implementation manners in the embodiments of the present disclosure may be adopted, and details are not repeated here.
S708,确定车位角点回归的第二冗余偏移量。S708. Determine the second redundant offset for the regression of the corner point of the parking space.
S709,以中心位置为中心,以第二冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位角点回归冗余区。S709, expanding on the sample image with the center position as the center and the second redundant offset as the pixel offset radius, to generate a parking space corner regression redundant area in the sample image.
S710,基于样本车位角点的多个维度,对车位角点回归冗余区进行标注,作为车位角点回归标注图,并对车位角点回归标注图进行处理,获得车位角点的回归真值热力图。S710: Based on the multiple dimensions of the sample parking corner points, mark the regression redundant area of the parking space corner points as a parking space corner point regression labeling map, and process the parking space corner point regression labeling map to obtain the regression true value of the parking space corner points heat map.
S711,基于车位角点的回归真值热力图,获取多个维度下角点到车位角点回归冗余区的角点回归值。S711. Based on the heat map of the true regression value of the parking space corner, obtain the corner point regression value from the corner point to the parking space corner point regression redundant area in multiple dimensions.
关于步骤S708~S711的实现方式,可采用本公开中各实施例中的实现方式,在此不再进行赘述。Regarding the implementation manner of steps S708-S711, the implementation manners in the embodiments of the present disclosure may be adopted, and details are not repeated here.
S712,将样本图像输入车位检测模型进行多尺度特征提取。S712. Input the sample image into the parking space detection model to perform multi-scale feature extraction.
S713,基于提取的多尺度特征图分别进行中心位置预测、类别预测和回归预测,以获 取预测中心位置、类别预测热力图、回归预测热力图。S713. Based on the extracted multi-scale feature map, respectively perform center position prediction, category prediction and regression prediction, so as to obtain the prediction center position, category prediction heat map, and regression prediction heat map.
S714,基于预测中心位置和中心位置、类别预测热力图和类别真值热力图、以及回归预测热力图和回归真值热力图,确定车位检测模型的损失函数。S714. Determine a loss function of the parking space detection model based on the predicted center position and the center position, the class prediction heat map and the class true value heat map, and the regression prediction heat map and the regression true value heat map.
S715,基于损失函数对车位检测模型进行调整并继续训练,直至训练结束,生成目标车位检测模型。S715. Adjust the parking space detection model based on the loss function and continue training until the training ends to generate a target parking space detection model.
关于步骤S712~S715的实现方式,可采用本公开中各实施例中的实现方式,在此不再进行赘述。Regarding the implementation manner of steps S712-S715, the implementation manners in the embodiments of the present disclosure may be adopted, and details are not repeated here.
本公开在训练过程中,考虑到了车位类别识别和车位角点回归计算时的冗余,使得训练好的目标车位检测模型在车位识别时会考虑冗余区,能够更准确地检测出车位的中心位置、车位类别和角点位置。In the training process of the present disclosure, the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, and can detect the center of the parking space more accurately. location, bay category, and corner location.
图8是本公开提出的一种车位检测方法的一种实施方式,该车位检测方法的目标车位检测模型可由上述图1-图7任一实施例的车位检测模型训练方法训练得到,如图8所示,该车位检测方法,包括以下步骤:S801-S803。Fig. 8 is an embodiment of a parking space detection method proposed in the present disclosure. The target parking space detection model of the parking space detection method can be obtained by training the parking space detection model training method of any embodiment of the above-mentioned Fig. 1-Fig. 7, as shown in Fig. 8 As shown, the parking space detection method includes the following steps: S801-S803.
S801,获取待预测的目标图像,其中,目标图像中至少包括一个待测车位。S801. Acquire a target image to be predicted, wherein the target image includes at least one parking space to be tested.
获取待预测的图像,将待预测的图像作为目标图像,其中,目标图像中至少包括一个待测车位。在一些实施例中,目标图像可为待测车辆在泊车过程中采集的图像,也可由预设的摄像机采集在泊车过程中的图像。An image to be predicted is acquired, and the image to be predicted is used as a target image, wherein the target image includes at least one parking space to be tested. In some embodiments, the target image may be an image collected by the vehicle under test during parking, or may be an image collected by a preset camera during parking.
S802,将目标图像输入目标车位检测模型中,以获取待测车位的目标中心位置、类别目标热力图和回归目标热力图,其中,类别目标热力图中包括用于车位类别识别的冗余区,回归目标热力图中包括用于车位角点回归的冗余区。S802, input the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes a redundant area for identifying the category of the parking space, The regression target heatmap includes redundant regions for the parking corner regression.
将目标图像输入目标车位检测模型中,目标车位检测模型对目标图像,先进行特征提取和融合,得到融合的特征图。然后基于融合的特征图,分别通过中心位置检测头、类别检测头和角点回归检测头以分别获取目标车位检测模型输出的待测车位的目标中心位置热力图、类别目标热力图和回归目标热力图。其中,图9(a)为本公开实施例的目标中心位置热力图的示意图;图9(b)为本公开实施例的类别目标热力图的示意图;图9(c)为本公开实施例的回归目标热力图的示意图。如图9(b)所示,类别目标热力图中包括用于车位类别识别的冗余区;如图9(c)所示,回归目标热力图中包括用于车位角点回归的冗余区。Input the target image into the target parking space detection model, and the target parking space detection model first performs feature extraction and fusion on the target image to obtain a fused feature map. Then based on the fused feature map, the target center position heat map, category target heat map and regression target heat output of the target parking space detection model are respectively obtained through the center position detection head, category detection head and corner point regression detection head. picture. Among them, Fig. 9(a) is a schematic diagram of the heat map of the target center position of the embodiment of the present disclosure; Fig. 9(b) is a schematic diagram of the heat map of the category target of the embodiment of the present disclosure; Fig. 9(c) is a schematic diagram of the heat map of the embodiment of the present disclosure Schematic representation of the regression target heatmap. As shown in Figure 9(b), the category target heat map includes a redundant area for the recognition of the parking space category; as shown in Figure 9(c), the regression target heat map includes a redundant area for the parking space corner regression .
在一些实施例中,继续以图6为例,对目标图像采用CNN进行特征提取,并对提取的特征图进行2倍下采样,对2倍下采样特征图再进行CNN特征提取,以此类推,再依次对特征图进行4倍下采样、8倍下采样和16倍下采样,并在每次采样后对特征图进行CNN特征提取,分别获得4倍下采样特征图、8倍下采样特征图和16倍下采样特征图。如图6所示,对16倍下采样特征图进行处理,并对处理后的图像进行16倍上采样,将进行16倍上采样后的图像与16倍下采样特征图进行融合后再进行CNN特征提取,获得第一融合图像。将第一融合图像进行8倍上采样,与8倍下采样特征图进行融合再进行CNN特征提取,得到最终的融合特征图。In some embodiments, continuing to take Fig. 6 as an example, CNN is used for feature extraction on the target image, and the extracted feature map is subjected to 2 times downsampling, and CNN feature extraction is performed on the 2 times downsampled feature map, and so on. , and then sequentially perform 4 times downsampling, 8 times downsampling and 16 times downsampling on the feature map, and perform CNN feature extraction on the feature map after each sampling, and obtain 4 times downsampling feature map and 8 times downsampling feature respectively Figure and 16x downsampling feature map. As shown in Figure 6, the 16-fold downsampled feature map is processed, and the processed image is 16-fold up-sampled, and the 16-fold up-sampled image is fused with the 16-fold downsampled feature map before CNN Feature extraction to obtain the first fused image. The first fused image is upsampled by 8 times, fused with the 8 times downsampled feature map, and then CNN feature extraction is performed to obtain the final fused feature map.
S803,基于用于车位类别识别的冗余区和用于车位角点回归的冗余区,确定待测车位的目标车位类别和目标车位角点。S803. Based on the redundant area used for identifying the parking space category and the redundant area used for the parking space corner regression, determine the target parking space category and the target parking space corner point of the parking space to be tested.
基于类别目标热力图中用于车位类别识别的冗余区内的像素点的坐标或者用于车位类别识别的冗余区内的像素点对应的车位类别,确定待测车位的目标车位类别。Determine the target parking space category of the parking space to be tested based on the coordinates of the pixels in the redundant area for identifying the parking space category in the class target heat map or the corresponding parking space category for the pixels in the redundant area for identifying the parking space category.
基于回归目标热力图中用于车位角点回归的冗余区内的像素点的坐标,确定待测车位的目标车位角点。Based on the coordinates of the pixels in the redundant area used for the regression of the parking space corner in the regression target heat map, the target parking space corner point of the parking space to be tested is determined.
本公开在训练过程中,考虑到了车位类别识别和车位角点回归计算时的冗余,使得训练好的目标车位检测模型在车位识别时会考虑冗余区,防止了输出单一结果的随机性,能够更准确地检测出车位的中心位置、车位类别和角点位置。During the training process of the present disclosure, the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, preventing the randomness of outputting a single result. It can more accurately detect the center position of the parking space, the category of the parking space and the corner position.
在一些实施例中,对目标车位检测模型输出的待测车位的中心位置目标热力图进行非极大值抑制(Non-Maximum Suppression,NMS)算法处理,将处理后的中心位置目标热力图中大于预设阈值的像素点作为待测车位的目标中心位置。In some embodiments, a non-maximum suppression (Non-Maximum Suppression, NMS) algorithm is performed on the center position target heat map of the parking space to be tested output by the target parking space detection model, and the processed center position target heat map is greater than The pixel point of the preset threshold is used as the target center position of the parking space to be tested.
图10是本公开提出的一种车位检测方法的一种实施方式,基于上述实施例的基础上,如图10所示,基于用于车位类别识别的冗余区和用于车位角点回归的冗余区,确定待测车位的目标车位类别和目标车位角点,包括以下步骤:S1001-S1002。FIG. 10 is an implementation of a parking space detection method proposed in the present disclosure. Based on the above-mentioned embodiments, as shown in FIG. The redundant area is to determine the target parking space category and the target parking space corner of the parking space to be tested, including the following steps: S1001-S1002.
S1001,基于用于车位类别识别的冗余区,确定待测车位的目标车位类别。S1001. Determine the target parking space category of the parking space to be tested based on the redundant area used for identifying the category of the parking space.
在一些实施例中,对用于车位类别识别的冗余区内多个像素点的坐标进行平均,获取一个目标像素点。根据目标像素点的车位类别值对应的车位类别,确定待测车位的目标车位类别。举例说明,若目标像素点的车位类别值为1,而1对应的车位类别为横向车位,则待测车位的目标车位类别为横向车位。In some embodiments, the coordinates of multiple pixel points in the redundant area used for identifying the parking space category are averaged to obtain a target pixel point. According to the parking space category corresponding to the parking space category value of the target pixel point, the target parking space category of the parking space to be tested is determined. For example, if the parking space category value of the target pixel is 1, and the parking space category corresponding to 1 is a horizontal parking space, then the target parking space category of the parking space to be tested is a horizontal parking space.
在另一些实施例中,确定用于车位类别识别的冗余区内多个像素对应的多个车位类别,对多个车位类别进行投票,确定待测车位的目标车位类别。举例说明,若用于车位类别识别的冗余区内有10个像素点,其中有9个像素点的车位类别值为1,而1对应的车位类别为横向车位;1个像素点的车位类别值为2,而2对应的车位类别为纵向车位,则对这10个车位类别进行投票。由于表示车位类别为横向车位的占90%,则确定待测车位的目标车位类别为横向车位。In some other embodiments, a plurality of parking space categories corresponding to a plurality of pixels in the redundant area for identification of the parking space category is determined, voting is performed on the multiple parking space categories, and a target parking space category of the parking space to be tested is determined. For example, if there are 10 pixels in the redundant area for recognition of the parking space category, among them, the parking space category value of 9 pixels is 1, and the corresponding parking space category of 1 is a horizontal parking space; the parking space category of 1 pixel point The value is 2, and the parking space category corresponding to 2 is a longitudinal parking space, then vote for these 10 parking space categories. Since 90% of the parking spaces indicate that the category of the parking spaces is a horizontal parking space, it is determined that the target parking space category of the parking spaces to be tested is a horizontal parking space.
S1002,基于用于车位角点回归的冗余区,确定待测车位的目标车位角点。S1002. Determine a target corner point of the parking space to be tested based on the redundant area used for the regression of the corner point of the parking space.
确定用于车位角点回归的冗余区内对应的多个角点回归值,其中,多个角点回归值为上述在获得车位角点的回归真值热力图后,基于车位角点的回归真值热力图,获取的多个维度下角点到车位角点回归冗余区的多个角点回归值。Determine the corresponding multiple corner regression values in the redundant area used for parking corner regression, wherein the multiple corner regression values are based on the regression based on the parking corner after obtaining the regression true value heat map of the parking corner The true value heat map obtains multiple corner regression values from the corner point to the parking space corner regression redundant area in multiple dimensions.
对用于车位角点回归的冗余区内的多个像素点,都基于用于车位角点回归的冗余区对应的多个角点回归值进行计算,获取待测车位的4个角点分别对应的多个预测角点坐标,并对多个预测角点坐标进行平均,确定待测车位的任一角点对应的目标车位角点。以待测车位的第1个角点为例,假如说用于车位角点回归的冗余区对应的多个角点回归值分别为21,22,23个像素点,则对用于车位角点回归的冗余区内的每个像素点,以X,Y两个维度都偏移21,22,23个像素点,分别得到待测车位的第1个角点对应的多个预测角点坐标, 并对多个预测角点坐标进行平均,确定待测车位的任一角点对应的目标车位角点。Multiple pixel points in the redundant area used for parking corner regression are calculated based on multiple corner regression values corresponding to the redundant area used for parking corner regression, and four corner points of the parking space to be tested are obtained. Corresponding coordinates of a plurality of predicted corner points respectively, and averaging the coordinates of the plurality of predicted corner points to determine a corner point of a target parking space corresponding to any corner point of the parking space to be measured. Taking the first corner point of the parking space to be tested as an example, if the multiple corner regression values corresponding to the redundant area used for the parking space corner regression are 21, 22, and 23 pixels respectively, then for the parking space corner Each pixel in the redundant area of point regression is offset by 21, 22, and 23 pixels in both X and Y dimensions, and multiple predicted corners corresponding to the first corner of the parking space to be tested are respectively obtained coordinates, and average the coordinates of multiple predicted corner points to determine the corner point of the target parking space corresponding to any corner point of the parking space to be tested.
本公开的目标车位检测模型在车位识别时会考虑冗余区,基于平均像素点或者对多个像素点所对应的车位类别进行投票的方式,减少了单一结果的随机性,减少了误差,能够更准确地检测出车位的中心位置、车位类别和角点位置。The target parking space detection model of the present disclosure will consider the redundant area when identifying the parking space, based on the average pixel point or the way of voting on the parking space category corresponding to multiple pixel points, which reduces the randomness of a single result, reduces the error, and can More accurately detect the center position of the parking space, the category of the parking space and the corner position.
图11是本公开提出的一种车位检测方法的一种实施方式,该车位检测方法的目标车位检测模型可由上述图1-图7任一实施例的车位检测模型训练方法训练得到,如图11所示,该车位检测方法,包括以下步骤:S1101-S1107。Fig. 11 is an embodiment of a parking space detection method proposed in the present disclosure. The target parking space detection model of the parking space detection method can be obtained by training the parking space detection model training method of any one of the above-mentioned Fig. 1-Fig. 7 embodiments, as shown in Fig. 11 As shown, the parking space detection method includes the following steps: S1101-S1107.
S1101,获取待预测的目标图像,其中,目标图像中至少包括一个待测车位。S1101. Acquire a target image to be predicted, wherein the target image includes at least one parking space to be tested.
S1102,将目标图像输入目标车位检测模型中,以获取待测车位的目标中心位置、类别目标热力图和回归目标热力图,其中,类别目标热力图中包括用于车位类别识别的冗余区,回归目标热力图中包括用于车位角点回归的冗余区。S1102, input the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes a redundant area for identifying the category of the parking space, The regression target heatmap includes redundant regions for the parking corner regression.
S1103,对用于车位类别识别的冗余区内多个像素进行平均,获取目标像素。S1103, averaging multiple pixels in the redundant area used for identifying the category of the parking space to obtain a target pixel.
S1104,基于目标像素,确定待测车位的目标车位类别。S1104. Based on the target pixel, determine the target parking space category of the parking space to be tested.
关于步骤S1103~S1104的实现方式,可采用本公开中各实施例中的实现方式,在此不再进行赘述。Regarding the implementation manner of steps S1103-S1104, the implementation manners in the embodiments of the present disclosure may be adopted, and details are not repeated here.
S1105,确定用于车位角点回归的冗余区内对应的多个角点回归值。S1105. Determine a plurality of corresponding corner regression values in the redundant area used for parking corner regression.
S1106,基于多个角点回归值确定待测车位的任一角点对应的多个预测角点坐标。S1106. Determine a plurality of predicted corner coordinates corresponding to any corner of the parking space to be tested based on the regression values of the plurality of corners.
S1107,对多个预测角点坐标进行平均,确定待测车位的任一角点对应的目标车位角点。S1107. Average the coordinates of multiple predicted corner points, and determine the corner point of the target parking space corresponding to any corner point of the parking space to be measured.
关于步骤S1105~S1107的实现方式,可采用本公开中各实施例中的实现方式,在此不再进行赘述。Regarding the implementation manner of steps S1105-S1107, the implementation manners in the embodiments of the present disclosure may be adopted, and details are not repeated here.
本公开在训练过程中,考虑到了车位类别识别和车位角点回归计算时的冗余,使得训练好的目标车位检测模型在车位识别时会考虑冗余区,防止了输出单一结果的随机性,能够更准确地检测出车位的中心位置、车位类别和角点位置。During the training process of the present disclosure, the redundancy in the recognition of the parking space category and the regression calculation of the corner point of the parking space is considered, so that the trained target parking space detection model will consider the redundant area when recognizing the parking space, preventing the randomness of outputting a single result. It can more accurately detect the center position of the parking space, the category of the parking space and the corner position.
图12是本公开提出的一种车位检测模型训练装置,如图12所示,该车位检测模型训练装置1200,包括获取模块1201、确定模块1202、生成模块1203和训练模块1204,其中:Figure 12 is a parking space detection model training device proposed in the present disclosure. As shown in Figure 12, the parking space detection model training device 1200 includes an acquisition module 1201, a determination module 1202, a generation module 1203 and a training module 1204, wherein:
获取模块1201,用于获取样本车位的样本图像,并从样本图像中获取样本车位的中心位置。The obtaining module 1201 is configured to obtain a sample image of a sample parking space, and obtain a center position of the sample parking space from the sample image.
确定模块1202,用于根据中心位置,确定样本图像中的车位识别冗余区和车位角点回归冗余区。The determination module 1202 is configured to determine the redundant area of parking space recognition and the redundant area of parking space corner regression in the sample image according to the center position.
生成模块1203,用于根据车位识别冗余区,对样本图像进行标记,生成样本图像的类别真值热力图,以及根据车位角点回归冗余区,对样本图像进行标记,生成样本图像的车位角点的回归真值热力图。The generation module 1203 is used to identify the redundant area according to the parking space, mark the sample image, generate the category truth heat map of the sample image, and return the redundant area according to the corner point of the parking space, mark the sample image, and generate the parking space of the sample image Regression ground-truth heatmap of corner points.
训练模块1204,用于基于样本图像、类别真值热力图和回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型。The training module 1204 is configured to train the parking space detection model based on the sample image, the category true value heat map and the regression true value heat map to generate a target parking space detection model.
进一步地,获取模块1201,还用于:从样本图像中识别样本车位的角点,并进行标注;对样本图像进行下采样,得到采样样本图像;基于角点的坐标,确定采样样本图像中样本 车位的中心位置。Further, the acquisition module 1201 is also used to: identify the corner points of the sample parking space from the sample image and mark them; down-sample the sample image to obtain a sample sample image; determine the sample in the sample sample image based on the coordinates of the corner points The central location of the parking lot.
进一步地,确定模块1202,还用于:确定车位识别的第一冗余偏移量;以中心位置为中心,以第一冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位识别冗余区;基于样本车位的类别,分别对车位识别冗余区和除车位识别冗余区之外的第一剩余区域内的像素点的取值进行标注以生成类别真值热力图。Further, the determination module 1202 is also used to: determine the first redundant offset for parking space recognition; expand the sample image with the center position as the center and the first redundant offset as the pixel offset radius, Generate the parking space recognition redundant area in the sample image; based on the category of the sample parking space, mark the values of the pixels in the parking space recognition redundant area and the first remaining area except the parking space recognition redundant area to generate the category True value heat map.
进一步地,确定模块1202,还用于:确定车位角点回归的第二冗余偏移量;以中心位置为中心,以第二冗余偏移量为像素偏移半径,在样本图像上进行扩张,生成样本图像中的车位角点回归冗余区;基于样本车位角点的多个维度,对车位角点回归冗余区进行标注,作为车位角点回归标注图;对车位角点回归标注图进行处理,获得车位角点的回归真值热力图。Further, the determination module 1202 is also used to: determine the second redundant offset of the parking space corner regression; take the center position as the center, and use the second redundant offset as the pixel offset radius to perform the calculation on the sample image Expand and generate the parking space corner regression redundant area in the sample image; based on the multiple dimensions of the sample parking space corner point, mark the parking space corner regression redundant area as a parking space corner regression labeling map; mark the parking space corner regression The map is processed to obtain the regression true value heat map of the corner point of the parking space.
进一步地,生成模块1203,还用于:基于车位角点的回归真值热力图,获取多个维度下角点到车位角点回归冗余区的角点回归值。Further, the generation module 1203 is also used for: based on the heat map of the true regression value of the corner of the parking space, obtain the regression value of the corner point in multiple dimensions from the corner point to the regression redundant area of the corner point of the parking space.
进一步地,训练模块1204,还用于:将样本图像输入车位检测模型进行多尺度特征提取;基于提取的多尺度特征图分别进行中心位置预测、类别预测和回归预测,以获取预测中心位置、类别预测热力图、回归预测热力图;基于预测中心位置和中心位置、类别预测热力图和类别真值热力图、以及回归预测热力图和回归真值热力图,确定车位检测模型的损失函数;基于损失函数对车位检测模型进行调整并继续训练,直至训练结束,生成目标车位检测模型。Further, the training module 1204 is also used to: input the sample image into the parking space detection model for multi-scale feature extraction; perform center position prediction, category prediction and regression prediction based on the extracted multi-scale feature map, so as to obtain the prediction center position, category Prediction heat map, regression prediction heat map; based on the prediction center position and center position, category prediction heat map and category true value heat map, and regression prediction heat map and regression true value heat map, determine the loss function of the parking space detection model; based on loss The function adjusts the parking space detection model and continues training until the training ends to generate the target parking space detection model.
进一步地,训练模块1204,还用于:对多尺度特征图中目标尺度的特征图进行融合,得到融合的特征图;基于融合的特征图进行中心位置预测,得到预测中心位置;基于融合的特征图进行类别预测,得到类别预测热力图;基于融合的特征图进行回归预测,得到回归预测热力图。Further, the training module 1204 is also used to: fuse the feature maps of the target scale in the multi-scale feature map to obtain the fused feature map; predict the center position based on the fused feature map to obtain the predicted center position; The category prediction is performed on the map, and the category prediction heat map is obtained; the regression prediction is performed based on the fused feature map, and the regression prediction heat map is obtained.
图13是本公开提出的一种车位检测装置,在上述车位检测模型训练装置1200的基础上运行,如图13所示,该车位检测装置1300,包括第一获取模块1301、第二获取模块1302和确定模块1303,其中:FIG. 13 is a parking space detection device proposed in the present disclosure, which operates on the basis of the above-mentioned parking space detection model training device 1200. As shown in FIG. 13 , the parking space detection device 1300 includes a first acquisition module 1301 and a second acquisition module 1302 and determination module 1303, wherein:
第一获取模块1301,用于获取待预测的目标图像,其中,目标图像中至少包括一个待测车位。The first acquiring module 1301 is configured to acquire a target image to be predicted, wherein the target image includes at least one parking space to be tested.
第二获取模块1302,用于将目标图像输入目标车位检测模型中,以获取待测车位的目标中心位置、类别目标热力图和回归目标热力图,其中,类别目标热力图中包括用于车位类别识别的冗余区,回归目标热力图中包括用于车位角点回归的冗余区。The second acquisition module 1302 is used to input the target image into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes the parking space category The identified redundant area, the regression target heat map includes the redundant area used for the parking corner regression.
确定模块1303,用于基于用于车位类别识别的冗余区和用于车位角点回归的冗余区,确定待测车位的目标车位类别和目标车位角点。The determining module 1303 is configured to determine a target parking space category and a target parking space corner point of the parking space to be tested based on the redundant area used for identifying the parking space category and the redundant area used for the parking space corner regression.
进一步地,第二获取模块1302,还用于:对目标车位检测模型输出的中心位置目标热力图进行处理;将处理后的中心位置目标热力图中大于预设阈值的像素点作为待测车位的目标中心位置。Further, the second acquisition module 1302 is also used to: process the central position target thermal map output by the target parking space detection model; use the processed pixel points in the central position target thermal map greater than the preset threshold as the parking spaces to be tested Target center position.
进一步地,确定模块1303,还用于:对用于车位类别识别的冗余区内多个像素进行平 均,获取目标像素,基于目标像素,确定待测车位的目标车位类别或者确定用于车位类别识别的冗余区内多个像素对应的多个车位类别;对多个车位类别进行投票,确定待测车位的目标车位类别。Further, the determining module 1303 is also used to: average a plurality of pixels in the redundant area used for identifying the parking space category to obtain the target pixel, and based on the target pixel, determine the target parking space category of the parking space to be tested or determine the parking space category Multiple parking space categories corresponding to multiple pixels in the identified redundant area; vote for multiple parking space categories to determine the target parking space category of the parking space to be tested.
进一步地,确定模块1303,还用于:确定用于车位角点回归的冗余区内对应的多个角点回归值;基于多个角点回归值确定待测车位的任一角点对应的多个预测角点坐标;对多个预测角点坐标进行平均,确定待测车位的任一角点对应的目标车位角点。Further, the determination module 1303 is also used to: determine a plurality of corner regression values corresponding to the redundant area used for the corner regression of the parking space; coordinates of a predicted corner point; the coordinates of multiple predicted corner points are averaged to determine the corner point of the target parking space corresponding to any corner point of the parking space to be measured.
进一步地,本公开还提出了一种自动驾驶车辆,该自动驾驶车辆包括上述车位检测模型训练装置1200或车位检测装置1300。Further, the present disclosure also proposes an automatic driving vehicle, which includes the above-mentioned parking space detection model training device 1200 or the parking space detection device 1300 .
为了实现上述实施例,本公开实施例还提出一种电子设备1400,如图14所示,该电子设备1400包括:处理器1401和处理器通信连接的存储器1402,存储器1402存储有可被至少一个处理器执行的指令,指令被至少一个处理器1401执行,以实现如上述实施例所示的车位检测模型训练方法和车位检测方法。In order to realize the above-mentioned embodiments, an embodiment of the present disclosure also proposes an electronic device 1400. As shown in FIG. The instructions executed by the processor are executed by at least one processor 1401 to implement the parking space detection model training method and the parking space detection method as shown in the above embodiments.
为了实现上述实施例,本公开实施例还提出一种存储有计算机指令的非瞬时计算机可读存储介质,其中,计算机指令用于使计算机实现如上述实施例所示的车位检测模型训练方法和车位检测方法。In order to realize the above-mentioned embodiments, the embodiments of the present disclosure also propose a non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to enable the computer to implement the parking space detection model training method and the parking space as shown in the above-mentioned embodiments. Detection method.
为了实现上述实施例,本公开实施例还提供了一种计算机程序产品,包括计算机程序,该计算机程序在被处理器执行时实现如上述实施例所示的车位检测模型训练方法和车位检测方法。In order to implement the above embodiments, the embodiments of the present disclosure further provide a computer program product, including a computer program, which implements the parking space detection model training method and the parking space detection method as shown in the above embodiments when the computer program is executed by a processor.
为了实现上述实施例,本公开实施例还提出一种计算机程序,其中该计算机程序包括计算机程序代码,当该计算机程序代码在计算机上运行时,使得计算机执行如上述实施例所示的车位检测模型训练方法和车位检测方法。In order to realize the above-mentioned embodiment, the embodiment of the present disclosure also proposes a computer program, wherein the computer program includes computer program code, when the computer program code is run on the computer, it makes the computer execute the parking space detection model shown in the above-mentioned embodiment Training method and parking space detection method.
需要说明的是,前述对数据传输方法实施例的解释说明也适用于上述实施例中的装置、车辆、电子设备、非瞬时计算机可读存储介质、计算机程序产品和计算机程序,此处不再赘述。It should be noted that the foregoing explanations of the embodiments of the data transmission method are also applicable to the devices, vehicles, electronic equipment, non-transitory computer-readable storage media, computer program products, and computer programs in the above embodiments, and will not be repeated here. .
在本公开的描述中,需要理解的是,术语“中心”、“纵向”、“横向”、“长度”、“宽度”、“厚度”、“上”、“下”、“前”、“后”、“左”、“右”、“竖直”、“水平”、“顶”、“底”“内”、“外”、“顺时针”、“逆时针”、“轴向”、“径向”、“周向”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本公开和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本公开的限制。In describing the present disclosure, it is to be understood that the terms "center", "longitudinal", "transverse", "length", "width", "thickness", "upper", "lower", "front", " Back", "Left", "Right", "Vertical", "Horizontal", "Top", "Bottom", "Inner", "Outer", "Clockwise", "Counterclockwise", "Axial", The orientations or positional relationships indicated by "radial", "circumferential", etc. are based on the orientations or positional relationships shown in the drawings, and are only for the convenience of describing the present disclosure and simplifying the description, rather than indicating or implying the referred devices or elements Must be in a particular orientation, constructed, and operate in a particular orientation, and thus should not be construed as limiting on the present disclosure.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本公开的描述中,“多个”的含义是两个或两个以上,除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only, and cannot be interpreted as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, a feature defined as "first" and "second" may explicitly or implicitly include one or more of these features. In the description of the present disclosure, "plurality" means two or more, unless otherwise specifically defined.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本公开的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须 针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, descriptions referring to the terms "one embodiment", "some embodiments", "example", "specific examples", or "some examples" mean that specific features described in connection with the embodiment or example , structure, material or characteristic is included in at least one embodiment or example of the present disclosure. In this specification, the schematic representations of the above terms are not necessarily referring to the same embodiment or example. Furthermore, the described specific features, structures, materials or characteristics may be combined in any suitable manner in any one or more embodiments or examples. In addition, those skilled in the art can combine and combine different embodiments or examples and features of different embodiments or examples described in this specification without conflicting with each other.
尽管上面已经示出和描述了本公开的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本公开的限制,本领域的普通技术人员在本公开的范围内可以对上述实施例进行变化、修改、替换和变型。Although the embodiments of the present disclosure have been shown and described above, it can be understood that the above embodiments are exemplary and should not be construed as limitations on the present disclosure, and those skilled in the art can understand the above-mentioned embodiments within the scope of the present disclosure. The embodiments are subject to changes, modifications, substitutions and variations.

Claims (18)

  1. 一种车位检测模型训练方法,包括:A parking space detection model training method, comprising:
    获取样本车位的样本图像,并从所述样本图像中获取所述样本车位的中心位置;Obtain a sample image of the sample parking space, and obtain the center position of the sample parking space from the sample image;
    根据所述中心位置,确定所述样本图像中的车位识别冗余区和车位角点回归冗余区;According to the center position, determine the parking space recognition redundant area and the parking space corner regression redundant area in the sample image;
    根据所述车位识别冗余区,对所述样本图像进行标记,生成所述样本图像的类别真值热力图,以及根据所述车位角点回归冗余区,对所述样本图像进行标记,生成所述样本图像的车位角点的回归真值热力图;According to the parking space identification redundant area, the sample image is marked to generate a category truth heat map of the sample image, and according to the parking space corner regression redundant area, the sample image is marked to generate The regression true value thermodynamic map of the parking space corner of the sample image;
    基于所述样本图像、所述类别真值热力图和所述回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型。Based on the sample image, the category true value heat map and the regression true value heat map, the parking space detection model is trained to generate a target parking space detection model.
  2. 根据权利要求1所述的方法,其中,所述从所述样本图像中获取所述样本车位的中心位置,包括:The method according to claim 1, wherein said obtaining the center position of the sample parking space from the sample image comprises:
    从所述样本图像中识别所述样本车位的角点,并进行标注;identifying the corner points of the sample parking spaces from the sample images, and marking them;
    对所述样本图像进行下采样,得到采样样本图像;Downsampling the sample image to obtain a sample image;
    基于所述角点的坐标,确定所述采样样本图像中所述样本车位的中心位置。Based on the coordinates of the corner points, determine the center position of the sample parking space in the sample image.
  3. 根据权利要求1或2所述的方法,其中,所述根据所述车位识别冗余区,对所述样本图像进行标记,生成所述样本图像的类别真值热力图,包括:The method according to claim 1 or 2, wherein the step of marking the sample image according to the redundant area of the parking space identification, and generating the class truth heat map of the sample image includes:
    确定车位识别的第一冗余偏移量;Determining the first redundant offset for parking space identification;
    以所述中心位置为中心,以所述第一冗余偏移量为像素偏移半径,在所述样本图像上进行扩张,生成所述样本图像中的车位识别冗余区;Taking the center position as the center and taking the first redundant offset as the pixel offset radius, expanding on the sample image to generate a parking space recognition redundant area in the sample image;
    基于所述样本车位的类别,分别对所述车位识别冗余区和除所述车位识别冗余区之外的第一剩余区域内的像素点的取值进行标注以生成所述类别真值热力图。Based on the category of the sample parking spaces, the values of the pixels in the redundant area for parking space identification and the first remaining area except for the redundant area for parking space identification are respectively marked to generate the true value of the category. picture.
  4. 根据权利要求1至3中任一项所述的方法,其中,所述根据所述车位角点回归冗余区,对所述样本图像进行标记,生成所述样本图像的车位角点的回归真值热力图,包括:The method according to any one of claims 1 to 3, wherein, according to the regression redundant area of the corner point of the parking space, the sample image is marked, and the regression true of the corner point of the parking space of the sample image is generated. Value heat map, including:
    确定车位角点回归的第二冗余偏移量;Determining the second redundant offset for the regression of the corner point of the parking space;
    以所述中心位置为中心,以所述第二冗余偏移量为像素偏移半径,在所述样本图像上进行扩张,生成所述样本图像中的车位角点回归冗余区;Taking the center position as the center and taking the second redundant offset as the pixel offset radius, expanding on the sample image to generate a parking space corner regression redundant area in the sample image;
    基于所述样本车位所述角点的多个维度,对所述车位角点回归冗余区进行标注,作为车位角点回归标注图;Based on the plurality of dimensions of the corner points of the sample parking spaces, the regression redundant area of the corner points of the parking spaces is marked as a regression labeling map of the corner points of the parking spaces;
    对所述车位角点回归标注图进行处理,获得所述车位角点的回归真值热力图。The regression annotation map of the parking space corner is processed to obtain the regression true value heat map of the parking space corner.
  5. 根据权利要求1至4中任一项所述的方法,其中所述方法还包括:The method according to any one of claims 1 to 4, wherein said method further comprises:
    基于所述车位角点的回归真值热力图,获取多个维度下所述角点到所述车位角点回归 冗余区的角点回归值。Based on the regression true value thermodynamic map of the corner of the parking space, obtain the corner regression value from the corner to the redundant area of the corner of the parking space in multiple dimensions.
  6. 根据权利要求1至5中任一项所述的方法,其中,所述基于所述样本图像、所述类别真值热力图和所述回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型,包括:The method according to any one of claims 1 to 5, wherein, based on the sample image, the category true value heat map and the regression true value heat map, the parking space detection model is trained to generate a target Parking space detection model, including:
    将所述样本图像输入所述车位检测模型进行多尺度特征提取;Inputting the sample image into the parking space detection model for multi-scale feature extraction;
    基于提取的多尺度特征图分别进行中心位置预测、类别预测和回归预测,以获取预测中心位置、类别预测热力图、回归预测热力图;Based on the extracted multi-scale feature map, the center position prediction, category prediction and regression prediction are respectively performed to obtain the prediction center position, category prediction heat map, and regression prediction heat map;
    基于所述预测中心位置和所述中心位置、所述类别预测热力图和所述类别真值热力图、以及所述回归预测热力图和所述回归真值热力图,确定所述车位检测模型的损失函数;Based on the predicted center position and the center position, the category prediction heat map and the category true value heat map, and the regression prediction heat map and the regression true value heat map, determine the parking space detection model loss function;
    基于所述损失函数对所述车位检测模型进行调整并继续训练,直至训练结束,生成所述目标车位检测模型。Adjust the parking space detection model based on the loss function and continue training until the training ends to generate the target parking space detection model.
  7. 根据权利要求6所述的方法,其中,所述基于提取的多尺度特征图分别进行中心位置预测、类别预测和回归预测,以获取预测中心位置、类别预测热力图、回归预测热力图,包括:The method according to claim 6, wherein the center position prediction, category prediction and regression prediction are respectively performed based on the extracted multi-scale feature map, so as to obtain a prediction center position, a category prediction heat map, and a regression prediction heat map, including:
    对所述多尺度特征图中目标尺度的特征图进行融合,得到融合的特征图;Fusing the feature maps of the target scale in the multi-scale feature map to obtain a fused feature map;
    基于所述融合的特征图进行中心位置预测,得到所述预测中心位置;Predicting the central position based on the fused feature map to obtain the predicted central position;
    基于所述融合的特征图进行类别预测,得到所述类别预测热力图;performing category prediction based on the fused feature map to obtain the category prediction heat map;
    基于所述融合的特征图进行回归预测,得到所述回归预测热力图。Regression prediction is performed based on the fused feature map to obtain the regression prediction heat map.
  8. 一种车位检测方法,包括:A parking space detection method, comprising:
    获取待预测的目标图像,其中,所述目标图像中至少包括一个待测车位;Acquiring a target image to be predicted, wherein the target image includes at least one parking space to be tested;
    将所述目标图像输入目标车位检测模型中,以获取所述待测车位的目标中心位置、类别目标热力图和回归目标热力图,其中,所述类别目标热力图中包括用于车位类别识别的冗余区,所述回归目标热力图中包括用于车位角点回归的冗余区,所述目标车位检测模型由权利要求1-7中任一项所述的车位检测模型训练方法训练生成;The target image is input into the target parking space detection model to obtain the target center position of the parking space to be tested, the category target heat map and the regression target heat map, wherein the category target heat map includes A redundant area, the regression target heat map includes a redundant area for the regression of parking space corners, and the target parking space detection model is generated by training the parking space detection model training method described in any one of claims 1-7;
    基于所述用于车位类别识别的冗余区和所述用于车位角点回归的冗余区,确定所述待测车位的目标车位类别和目标车位角点。The target parking space category and the target parking space corner point of the parking space to be tested are determined based on the redundant area used for identifying the category of the parking space and the redundant area used for the regression of the corner point of the parking space.
  9. 根据权利要求8所述的方法,其中,所述获取所述待测车位的目标中心位置,包括:The method according to claim 8, wherein said obtaining the target center position of the parking space to be tested comprises:
    对所述目标车位检测模型输出的中心位置目标热力图进行处理;Processing the center position target heat map output by the target parking space detection model;
    将处理后的所述中心位置目标热力图中大于预设阈值的像素点作为所述待测车位的所述目标中心位置。A pixel point in the processed center position target heat map greater than a preset threshold is used as the target center position of the parking space to be tested.
  10. 根据权利要求8或9所述的方法,其中,所述确定所述待测车位的目标车位类别, 包括:The method according to claim 8 or 9, wherein said determining the target parking space category of said parking space to be tested comprises:
    对所述用于车位类别识别的冗余区内多个像素进行平均,获取目标像素,并基于所述目标像素,确定所述待测车位的所述目标车位类别;或者Averaging a plurality of pixels in the redundant area for identifying the parking space category to obtain a target pixel, and based on the target pixel, determining the target parking space category of the parking space to be tested; or
    确定所述用于车位类别识别的冗余区内多个像素对应的多个车位类别,并对所述多个车位类别进行投票,确定所述待测车位的所述目标车位类别。Determining a plurality of parking space categories corresponding to a plurality of pixels in the redundant area for identifying the parking space category, and voting on the plurality of parking space categories to determine the target parking space category of the parking space to be tested.
  11. 根据权利要求8至10中任一项所述的方法,其中,确定所述待测车位的目标车位角点,包括:The method according to any one of claims 8 to 10, wherein determining the target corner point of the parking space to be tested comprises:
    确定所述用于车位角点回归的冗余区内对应的多个角点回归值;Determining a plurality of corresponding corner regression values in the redundancy zone used for parking corner regression;
    基于所述多个角点回归值确定所述待测车位的任一所述角点对应的多个预测角点坐标;determining a plurality of predicted corner coordinates corresponding to any one of the corner points of the parking space to be tested based on the plurality of corner point regression values;
    对所述多个预测角点坐标进行平均,确定所述待测车位的任一所述角点对应的目标车位角点。The coordinates of the plurality of predicted corner points are averaged to determine the corner point of the target parking space corresponding to any one of the corner points of the parking space to be tested.
  12. 一种车位检测模型训练装置,包括:A parking space detection model training device, comprising:
    获取模块,用于获取样本车位的样本图像,并从所述样本图像中获取所述样本车位的中心位置;An acquisition module, configured to acquire a sample image of a sample parking space, and obtain the center position of the sample parking space from the sample image;
    确定模块,用于根据所述中心位置,确定所述样本图像中的车位识别冗余区和车位角点回归冗余区;A determining module, configured to determine a parking space recognition redundant area and a parking space corner regression redundant area in the sample image according to the center position;
    生成模块,用于根据所述车位识别冗余区,对所述样本图像进行标记,生成所述样本图像的类别真值热力图,以及根据所述车位角点回归冗余区,对所述样本图像进行标记,生成所述样本图像的车位角点的回归真值热力图;The generation module is used to mark the sample image according to the parking space identification redundant area, generate the category true value heat map of the sample image, and return the redundant area according to the parking space corner point to the sample image The image is marked, and the regression true value heat map of the parking space corner of the sample image is generated;
    训练模块,用于基于所述样本图像、所述类别真值热力图和所述回归真值热力图,对车位检测模型进行训练,生成目标车位检测模型。The training module is configured to train the parking space detection model based on the sample image, the category true value thermodynamic map and the regression true value thermodynamic map to generate a target parking space detection model.
  13. 一种车位检测装置,包括:A parking space detection device, comprising:
    第一获取模块,用于获取待预测的目标图像,其中,所述目标图像中至少包括一个待测车位;A first acquisition module, configured to acquire a target image to be predicted, wherein the target image includes at least one parking space to be tested;
    第二获取模块,用于将所述目标图像输入目标车位检测模型中,以获取所述待测车位的目标中心位置、类别目标热力图和回归目标热力图,其中,所述类别目标热力图中包括用于车位类别识别的冗余区,所述回归目标热力图中包括用于车位角点回归的冗余区,所述目标车位检测模型由权利要求12所述的车位检测模型训练装置训练生成;The second acquisition module is used to input the target image into the target parking space detection model, so as to obtain the target center position, category target heat map and regression target heat map of the parking space to be tested, wherein the category target heat map Including a redundant area for class recognition of parking spaces, the regression target heat map includes a redundant area for corner regression of parking spaces, and the target parking space detection model is generated by training the parking space detection model training device described in claim 12 ;
    确定模块,用于基于所述用于车位类别识别的冗余区和所述用于车位角点回归的冗余区,确定所述待测车位的目标车位类别和目标车位角点。The determination module is configured to determine the target parking space category and the target parking space corner of the parking space to be tested based on the redundant area used for identifying the parking space category and the redundant area used for the regression of the parking space corner point.
  14. 一种自动驾驶车辆,包括权利要求12或13所述的装置。A self-driving vehicle comprising the device according to claim 12 or 13.
  15. 一种电子设备,包括:An electronic device comprising:
    至少一个处理器;和at least one processor; and
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1至7中任一项所述的车位检测模型训练方法或权利要求8至11中任一项所述的车位检测方法。The memory stores instructions executable by the at least one processor, the instructions are executed by the at least one processor, so that the at least one processor can perform any one of claims 1 to 7. The parking space detection model training method or the parking space detection method described in any one of claims 8 to 11.
  16. 一种存储有计算机指令的非瞬时计算机可读存储介质,其中,所述计算机指令用于使所述计算机执行根据权利要求1至7中任一项所述的车位检测模型训练方法或权利要求8至11中任一项所述的车位检测方法。A non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to enable the computer to execute the parking space detection model training method according to any one of claims 1 to 7 or claim 8 to the parking space detection method described in any one of 11.
  17. 一种计算机程序产品,包括计算机程序,其中所述计算机程序在被处理器执行时实现如权利要求1至7中任一项所述的车位检测模型训练方法或权利要求8至11中任一项所述的车位检测方法。A computer program product, comprising a computer program, wherein said computer program realizes the parking space detection model training method according to any one of claims 1 to 7 or any one of claims 8 to 11 when executed by a processor The parking space detection method.
  18. 一种计算机程序,所述计算机程序包括计算机程序代码,其中当所述计算机程序代码在计算机上运行时,使得计算机执行如权利要求1至7中任一项所述的车位检测模型训练方法或权利要求8至11中任一项所述的车位检测方法。A computer program, the computer program comprising computer program code, wherein when the computer program code is run on the computer, the computer is made to execute the parking space detection model training method or the right according to any one of claims 1 to 7 The parking space detection method described in any one of requirements 8 to 11.
PCT/CN2022/140134 2021-12-21 2022-12-19 Parking space detection model training method and apparatus, and parking space detection method and apparatus WO2023116641A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111571230.XA CN114445794A (en) 2021-12-21 2021-12-21 Parking space detection model training method, parking space detection method and device
CN202111571230.X 2021-12-21

Publications (1)

Publication Number Publication Date
WO2023116641A1 true WO2023116641A1 (en) 2023-06-29

Family

ID=81363845

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/140134 WO2023116641A1 (en) 2021-12-21 2022-12-19 Parking space detection model training method and apparatus, and parking space detection method and apparatus

Country Status (2)

Country Link
CN (1) CN114445794A (en)
WO (1) WO2023116641A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117437647A (en) * 2023-12-20 2024-01-23 吉林大学 Oracle character detection method based on deep learning and computer vision

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114445794A (en) * 2021-12-21 2022-05-06 北京罗克维尔斯科技有限公司 Parking space detection model training method, parking space detection method and device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180068564A1 (en) * 2016-09-05 2018-03-08 Panasonic Intellectual Property Corporation Of America Parking position identification method, parking position learning method, parking position identification system, parking position learning device, and non-transitory recording medium for recording program
CN110348297A (en) * 2019-05-31 2019-10-18 纵目科技(上海)股份有限公司 A kind of detection method, system, terminal and the storage medium of parking systems for identification
CN111191730A (en) * 2020-01-02 2020-05-22 中国航空工业集团公司西安航空计算技术研究所 Method and system for detecting oversized image target facing embedded deep learning
CN112101361A (en) * 2020-11-20 2020-12-18 深圳佑驾创新科技有限公司 Target detection method, device and equipment for fisheye image and storage medium
CN112329601A (en) * 2020-11-02 2021-02-05 东软睿驰汽车技术(沈阳)有限公司 Parking space detection method and device based on multi-task network
CN113095228A (en) * 2021-04-13 2021-07-09 地平线(上海)人工智能技术有限公司 Method and device for detecting target in image and computer readable storage medium
CN114445794A (en) * 2021-12-21 2022-05-06 北京罗克维尔斯科技有限公司 Parking space detection model training method, parking space detection method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180068564A1 (en) * 2016-09-05 2018-03-08 Panasonic Intellectual Property Corporation Of America Parking position identification method, parking position learning method, parking position identification system, parking position learning device, and non-transitory recording medium for recording program
CN110348297A (en) * 2019-05-31 2019-10-18 纵目科技(上海)股份有限公司 A kind of detection method, system, terminal and the storage medium of parking systems for identification
CN111191730A (en) * 2020-01-02 2020-05-22 中国航空工业集团公司西安航空计算技术研究所 Method and system for detecting oversized image target facing embedded deep learning
CN112329601A (en) * 2020-11-02 2021-02-05 东软睿驰汽车技术(沈阳)有限公司 Parking space detection method and device based on multi-task network
CN112101361A (en) * 2020-11-20 2020-12-18 深圳佑驾创新科技有限公司 Target detection method, device and equipment for fisheye image and storage medium
CN113095228A (en) * 2021-04-13 2021-07-09 地平线(上海)人工智能技术有限公司 Method and device for detecting target in image and computer readable storage medium
CN114445794A (en) * 2021-12-21 2022-05-06 北京罗克维尔斯科技有限公司 Parking space detection model training method, parking space detection method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117437647A (en) * 2023-12-20 2024-01-23 吉林大学 Oracle character detection method based on deep learning and computer vision
CN117437647B (en) * 2023-12-20 2024-03-26 吉林大学 Oracle character detection method based on deep learning and computer vision

Also Published As

Publication number Publication date
CN114445794A (en) 2022-05-06

Similar Documents

Publication Publication Date Title
CN110163930B (en) Lane line generation method, device, equipment, system and readable storage medium
WO2023116641A1 (en) Parking space detection model training method and apparatus, and parking space detection method and apparatus
US20220245952A1 (en) Parking spot detection method and parking spot detection system
WO2022110049A1 (en) Navigation method, apparatus, and system
CN110598512B (en) Parking space detection method and device
JP4985166B2 (en) Self-position estimation device
WO2022237272A1 (en) Road image marking method and device for lane line recognition
CN109426800B (en) Lane line detection method and device
US20130034305A1 (en) Image-based crack quantification
JP7456536B2 (en) Mobile object tracking system, mobile object tracking method and program
KR20160123668A (en) Device and method for recognition of obstacles and parking slots for unmanned autonomous parking
CN103770704A (en) System and method for recognizing parking space line markings for vehicle
EP2405393A1 (en) Device for creating information for positional estimation of matter, method for creating information for positional estimation of matter, and program
US9471982B2 (en) Information processing apparatus and information processing method for associating an image with related information
CN111169468A (en) Automatic parking system and method
CN114898314B (en) Method, device, equipment and storage medium for detecting target of driving scene
CN111967396A (en) Processing method, device and equipment for obstacle detection and storage medium
JP2009229226A (en) Object detection device and method for detecting object
CN113515981A (en) Identification method, device, equipment and storage medium
EP2738740A1 (en) Image processing device and image processing method
CN115366885A (en) Method for assisting a driving maneuver of a motor vehicle, assistance device and motor vehicle
CN113420704A (en) Object identification method and device based on visual sensor and robot
CN113450575B (en) Management method and device for roadside parking
KR102260556B1 (en) Deep learning-based parking slot detection method and apparatus integrating global and local information
CN113591720A (en) Lane departure detection method, apparatus and computer storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22909953

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE