[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN113822278A - License plate recognition method for unlimited scene - Google Patents

License plate recognition method for unlimited scene Download PDF

Info

Publication number
CN113822278A
CN113822278A CN202111384274.1A CN202111384274A CN113822278A CN 113822278 A CN113822278 A CN 113822278A CN 202111384274 A CN202111384274 A CN 202111384274A CN 113822278 A CN113822278 A CN 113822278A
Authority
CN
China
Prior art keywords
license plate
convolution
network
feature
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111384274.1A
Other languages
Chinese (zh)
Other versions
CN113822278B (en
Inventor
刘寒松
王永
王国强
刘瑞
曲妍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sonli Holdings Group Co Ltd
Original Assignee
Sonli Holdings Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sonli Holdings Group Co Ltd filed Critical Sonli Holdings Group Co Ltd
Priority to CN202111384274.1A priority Critical patent/CN113822278B/en
Publication of CN113822278A publication Critical patent/CN113822278A/en
Application granted granted Critical
Publication of CN113822278B publication Critical patent/CN113822278B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention belongs to the technical field of license plate recognition, and relates to a license plate recognition method in an unlimited scene.

Description

License plate recognition method for unlimited scene
Technical Field
The invention belongs to the technical field of license plate recognition, relates to a method for recognizing a license plate in an unlimited scene, and particularly relates to a method for recognizing a license plate in an unlimited scene based on depth feature alignment of improved deformation convolution.
Background
With the rapid development of technologies such as artificial intelligence, internet of things and 5G, intelligent traffic plays an important role in intelligent cities, and license plate detection and recognition technology plays an important role in an intelligent traffic system. The traditional Chinese license plate detection method has the defects that the detection precision is greatly influenced by the environment, stronger robustness is difficult to be shown when complex scenes such as license plate distortion, rotation and the like are faced, the phenomenon of low detection precision often occurs, and the application requirements can not be met far away.
Most of the early license plate recognition algorithms are researched based on a machine learning algorithm, and the license plate is positioned and recognized by using manually selected features. In recent years, with the arrival of a big data era and the improvement of computer computing power, deep learning makes a major breakthrough in the direction of license plate recognition, and the positioning and recognition of license plates are newly developed due to the proposal of deep learning algorithms such as Faster R-CNN, YOLO and the like. The existing license plate recognition technology is mainly applied to specific environments such as toll parking lot entrances and exits, highway ETC channels and the like, under the condition that the front-view detection visual angle and the detection area are fixed, the accuracy rate of the license plate recognition technology can reach a very high level, but the recognition effect is poor under a complex scene.
In the situation that the license plate detection in a complex scene may cause the rotation or distortion of the license plate due to an oblique view, the conventional method adopts a convolutional neural network based on deep learning to extract features, and the method is mainly divided into two types: (1) a license plate detection method based on a horizontal rectangular frame introduces a large amount of background information when detecting a tilted or malformed license plate, so that subsequent license plate identification and positioning are inaccurate. (2) The method based on affine transformation divides the license plate detection into two steps of detection and license plate correction, firstly, the horizontal frame of the license plate is detected, then the license plate image cut by the horizontal frame is subjected to affine parameter learning, and finally correction is carried out to correct the image. Therefore, aiming at the unconstrained scene, the technical problem of low detection precision exists in the existing license plate detection and identification technology, and a more effective method for carrying out feature alignment modeling to realize accurate and effective license plate identification is urgently needed.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a depth feature alignment unlimited scene license plate detection method based on improved deformation convolution, which is used for solving the problem of depth feature misalignment of an inclined license plate and a distorted license plate, can be used for a license plate detection and identification task of an unlimited scene, and can efficiently realize license plate detection and correction.
In order to achieve the above object, the convolution feature extracted through a backbone is an axis-aligned feature, then classification and regression position information of an anchor frame are respectively calculated by using two parallel branches, so as to obtain a candidate frame, the improved deformed convolution is used for aligning the convolution feature with a license plate feature according to the candidate frame self-adaptation, namely, a sampling point is deviated and is concentrated on the feature of a license plate area, and finally the aligned feature is used for positioning, wherein in order to align the convolution feature by the deformed convolution, an offset is calculated by using the position information of the candidate frame and the corresponding convolution feature, and then the offset and the axis-aligned feature are sent to the aligned convolution together, so as to extract the aligned feature, the method specifically comprises the following steps:
(1) and (3) data set construction: collecting images containing conventional, inclined and distorted license plates of scenes such as traffic monitoring, side parking lots and the like, constructing a data set not lower than 20000 license plates, marking the positions of four vertexes of the license plates, calculating the coordinates of a horizontal rectangular frame corresponding to the license plates according to the positions of the four vertexes, and dividing the data set into a training set (60%), a verification set (20%) and a test set (20%);
(2) deep convolution feature extraction: firstly, initializing the size and the numerical range of an image in the training set in the step (1), wherein the size of the image is 512 x 512, the numerical range is 0-1, inputting the processed image into a backbone network of a deep convolution network for convolution feature extraction, wherein the backbone network uses VGG16 as a feature extraction network, and a feature pyramid network is added after VGG16, and the feature pyramid network is used for strengthening and utilizing multi-scale features formed in VGG16 to obtain a multi-scale license plate convolution feature map set with stronger expressive force;
(3) high quality candidate box generation: the multi-scale license plate convolution feature map set obtained from the main network of the deep convolution network in the step (2) respectively uses two full-connection layer sub-networks with the same structure but without sharing parameters to learn classification and position information, so as to complete the tasks of classification and position regression of a target frame, wherein the classification is whether the license plate is the target or not, the position information is four vertex coordinates of the license plate, when the network is trained and tested in the subsequent steps, each feature point in the multi-scale license plate convolution feature map is only provided with one anchor frame for learning the position of the target, different thresholds are respectively set for classification scores in the training and testing processes to obtain 100 high-quality candidate frames, the classification score threshold is set to be 0.01 in the training process to achieve a better training effect, and the classification score threshold is set to be 0.1 in the testing process to achieve a faster reasoning speed;
(4) depth feature alignment: for each position on the feature map in the multi-scale license plate convolution feature map set output in the step (2)
Figure 122645DEST_PATH_IMAGE001
The operation of the deformable convolution is on a regular grid of conventional convolutionsR
Figure 379314DEST_PATH_IMAGE002
By adding an offset
Figure 484673DEST_PATH_IMAGE003
Is expanded, thus in position
Figure 168596DEST_PATH_IMAGE001
The calculation formula of (a) is as follows:
Figure 726616DEST_PATH_IMAGE004
wherein
Figure 551090DEST_PATH_IMAGE005
Is toRAn enumeration of the positions listed in (a),
Figure 448639DEST_PATH_IMAGE006
in order to be the weights of the convolution,
Figure 365780DEST_PATH_IMAGE007
as input features, here
Figure 614358DEST_PATH_IMAGE008
Is the offset obtained by convolutional layer operation;
the coordinates of the high-quality candidate frame obtained in the step (3) are recorded as Poly1
Figure 275147DEST_PATH_IMAGE009
WhereinPoly1Representing a location
Figure 27202DEST_PATH_IMAGE001
High quality candidate box of (1), utilizingPoly1The maximum value and the minimum value of the abscissa and the maximum value and the minimum value of the ordinate are used for obtaining a minimum external horizontal rectangular framePoly2[
Figure 380823DEST_PATH_IMAGE010
The high-quality candidate box represents the coordinates of the feature region of the license plate, and the minimum circumscribed horizontal rectangular box represents the feature region to be aligned; knowing the coordinates of the two regions by
Figure 851119DEST_PATH_IMAGE011
Their affine transformation matrices can be calculatedMThus for each feature point of the feature map
Figure 315598DEST_PATH_IMAGE012
Sample position based on anchor frame
Figure 922160DEST_PATH_IMAGE013
Expressed as:
Figure 446682DEST_PATH_IMAGE014
whereinkWhich represents the size of the convolution kernel,Srepresenting the step size of the feature map, with the modified deformable convolution at the location
Figure 404274DEST_PATH_IMAGE001
Amount of deviation of
Figure 406865DEST_PATH_IMAGE003
Expressed as:
Figure 133512DEST_PATH_IMAGE015
then the obtained offset
Figure 828936DEST_PATH_IMAGE003
Inputting the convolution characteristic graph obtained in the step (2) and the convolution characteristic graph into convolution, extracting alignment characteristics and forming improved deformable convolution; for each high quality candidate box, the sampling point is 9 points, an offset value of 18 dimensions is obtained, and the position is given by the method
Figure 506780DEST_PATH_IMAGE016
The axis-aligned convolution feature of (a) is converted into a convolution feature based on any direction and attitude of the corresponding candidate frame;
(5) and (3) fine license plate recognition of feature alignment: refining and positioning the license plate position again through the alignment features obtained in the step (4), inputting the alignment features into a convolution layer of 3 ✖ 3, and then accessing two branches for classification and regression, wherein the classification is to judge whether the vehicle is a vehicle or notThe card regression is the coordinates of four vertexes of the license plate
Figure 313062DEST_PATH_IMAGE017
Thus obtaining the accurate coordinate position of the license plate;
(6) and (3) correcting the position of the license plate: carrying out affine transformation on the license plate coordinate position obtained in the step (5) and the license plate coordinate with a preset size, and calculating an affine matrix through the affine transformation of the coordinate positions of the license plate coordinate position and the license plate coordinate with the preset size
Figure 159795DEST_PATH_IMAGE018
Then will be
Figure 760541DEST_PATH_IMAGE018
Acting on the license plate cut by coordinates from the original image to obtain a license plate image after recognition and correction;
(7) training a network structure to obtain trained model parameters: using images of the training set in the dataset, picture size 512
Figure 692725DEST_PATH_IMAGE019
512
Figure 37118DEST_PATH_IMAGE019
3, according to the batch size (B), inputting the batch size (B) into the deep convolution network in sequence, and inputting the whole network
Figure 738358DEST_PATH_IMAGE020
And using the IOU threshold value as a measurement standard of a sample distribution strategy to output the classification confidence of the license plate
Figure 510005DEST_PATH_IMAGE021
And the location of the regressive coordinate
Figure 929485DEST_PATH_IMAGE022
Wherein Class is 2, namely whether the license plate is detected, N is the number of output predicted license plate targets, and 8 is the horizontal and vertical coordinates of four vertexes of the license plate; predicting the category and the real category by Focal loss calculation to obtain an error, and obtaining the loss by using Smooth L1Calculating the error between the predicted license plate position and the real license plate position, updating parameters through back propagation, saving model parameters with the best result on a training set after training iteration of a complete training set for set times (50 times), and taking the model parameters as final model trained parameters to obtain trained license plate recognition network parameters for testing of a testing set;
(8) testing a network and correcting a license plate: testing the license plate recognition network parameters in the test set, scaling (resize) the long edge of the image to 512 under the condition of keeping the proportion of the long edge and the short edge of the image unchanged, and filling the short edge of the image to ensure that the image size is 512
Figure 77569DEST_PATH_IMAGE019
And 512, sequentially inputting the license plate classification confidence coefficient and the coordinate position of the license plate into a deep convolutional network, setting a threshold value to filter the license plate with low confidence coefficient, finally deleting redundant frames output by the network by using non-maximum suppression (NMS), and finally correcting the license plate by using the step (6).
The technology which is not disclosed in the invention adopts the prior art.
Compared with the prior art, the invention provides a depth feature alignment unlimited scene license plate recognition method based on improved deformation convolution, which uses a convolution neural network to generate a high-quality candidate frame, uses the improved deformation convolution to self-adaptively align the convolution features with the license plate features according to the candidate frame, is used for solving the problem that the depth features of an inclined license plate and a distorted license plate are not aligned, further obtains a corrected image by directly carrying out affine transformation on four vertex coordinates of the detected license plate, does not need to learn affine parameters, reduces the calculation consumption of feature repeated extraction, greatly improves the performance of the detection method based on the convolution neural network on a rotation and distortion target on the basis of increasing few operations, and is different from the common deformation convolution in that the offset of the improved deformation convolution is directly deduced from the candidate frame, the improved deformable convolution is added into the existing method, in CCPD license plate detection rotation (Rotate) data concentration, the recognition accuracy is improved from 94.7% to 98.2%, and meanwhile, a small amount of calculation is increased.
Drawings
FIG. 1 is a diagram of an improved deformable convolution module according to the present invention.
Fig. 2 is a diagram illustrating the overall network structure according to the present invention.
Fig. 3 is a flow chart of the license plate detection method provided by the invention.
FIG. 4 is a comparison of the license plate detection results provided by the present invention with other methods.
FIG. 5 is a comparison of another license plate detection result provided by the present invention with other methods.
Detailed Description
The invention will be further described by way of examples, without in any way limiting the scope of the invention, with reference to the accompanying drawings.
Example (b):
in this embodiment, a high-quality candidate frame is generated by using a convolutional neural network, and the convolution feature is adaptively aligned with the license plate feature according to the candidate frame by using an improved modified convolutional, so as to solve the problem of misalignment of the depth feature of an inclined or distorted license plate, as shown in fig. 1 to 3, the specific implementation includes the following steps:
(1) and (3) data set construction: collecting images containing conventional, inclined and distorted license plates of scenes such as traffic monitoring, side parking lots and the like, constructing a license plate data set containing 20000 images, labeling the positions of four vertexes of a license plate, calculating the coordinates of a horizontal rectangular frame corresponding to the license plate according to the positions of the four vertexes, and dividing the data set into a training set (60%), a verification set (20%) and a test set (20%);
(2) deep convolution feature extraction: firstly, initializing the size and the numerical range of an image in the training set in the step (1), wherein the size of the image is 512 x 512, the numerical range is 0-1, inputting the processed image into a backbone network of a deep convolution network for convolution feature extraction, wherein the backbone network uses VGG16 as a feature extraction network, and a feature pyramid network is added after VGG16, and the feature pyramid network is used for strengthening and utilizing multi-scale features formed in VGG16 to obtain a multi-scale license plate convolution feature map set with stronger expressive force;
(3) high quality candidate box generation: the multi-scale license plate convolution feature map set obtained from the main network of the deep convolution network in the step (2) respectively uses two full-connection layer sub-networks with the same structure but without sharing parameters to learn classification and position information, so as to complete the tasks of classification and position regression of a target frame, wherein the classification is whether the license plate is the target or not, the position information is four vertex coordinates of the license plate, each feature point in the feature map is only provided with one anchor frame for learning the position of the target in the training (step (7)) and the testing (step (8)), 100 high-quality candidate frames are obtained by respectively setting different threshold values for classification scores in the training and the testing, the classification score threshold value is set to be 0.01 in the training process to achieve a better training effect, and the classification score setting threshold value is set to be 0.1 in the testing process to achieve a faster reasoning speed;
(4) depth feature alignment: for each position on the output feature map (the feature map in the multi-scale license plate convolution feature map set output in the step (2))
Figure 633316DEST_PATH_IMAGE001
The operation of the deformable convolution is on a regular grid of conventional convolutionsR
Figure 841443DEST_PATH_IMAGE002
By adding an offset
Figure 482640DEST_PATH_IMAGE003
Is expanded, thus in position
Figure 168836DEST_PATH_IMAGE001
The calculation formula of (a) is as follows:
Figure 844668DEST_PATH_IMAGE004
wherein
Figure 958118DEST_PATH_IMAGE005
Is toRAn enumeration of the positions listed in (a),
Figure 608584DEST_PATH_IMAGE006
in order to be the weights of the convolution,
Figure 98471DEST_PATH_IMAGE007
as input features, here
Figure 628809DEST_PATH_IMAGE008
Is the offset obtained by the convolutional layer operation.
And (4) recording the coordinates of the high-quality candidate frame obtained in the step (3) asPoly1
Figure 178739DEST_PATH_IMAGE009
WhereinPoly1Representing a location
Figure 794528DEST_PATH_IMAGE001
High quality candidate box of (1), utilizingPoly1The maximum value and the minimum value of the abscissa and the maximum value and the minimum value of the ordinate are used for obtaining a minimum external horizontal rectangular framePoly2[
Figure 822527DEST_PATH_IMAGE010
The high-quality candidate box represents the coordinates of the feature region of the license plate, and the minimum circumscribed horizontal rectangular box represents the feature region to be aligned; knowing the coordinates of the two regions by
Figure 207372DEST_PATH_IMAGE011
Their affine transformation matrices can be calculatedMThus for each feature of the feature mapDot
Figure 928203DEST_PATH_IMAGE012
Sample position based on anchor frame
Figure 31289DEST_PATH_IMAGE013
Expressed as:
Figure 862978DEST_PATH_IMAGE014
whereinkWhich represents the size of the convolution kernel,Srepresenting the step size of the feature map, with the modified deformable convolution at the location
Figure 102330DEST_PATH_IMAGE001
Amount of deviation of
Figure 994063DEST_PATH_IMAGE003
Expressed as:
Figure 584444DEST_PATH_IMAGE015
then the obtained offset
Figure 954245DEST_PATH_IMAGE003
Inputting the convolution characteristic graph obtained in the step (2) and the convolution characteristic graph into convolution, extracting alignment characteristics and forming improved deformable convolution; for each high quality candidate box, the sampling point is 9 points, an offset value of 18 dimensions is obtained, and the position is given by the method
Figure 313682DEST_PATH_IMAGE001
The axis-aligned convolution feature of (a) is converted into a convolution feature based on any direction and attitude of the corresponding candidate frame;
(5) and (3) fine license plate recognition of feature alignment: refining and positioning the license plate position again through the alignment features obtained in the step (4), inputting the alignment features into a convolution layer of 3 ✖ 3, and then accessing two branches for classification and regression, wherein the branchesThe class is to judge whether the license plate is the license plate or not, and the regression is the coordinates of four vertexes of the license plate
Figure 376316DEST_PATH_IMAGE017
Thus obtaining the accurate coordinate position of the license plate;
(6) and (3) correcting the position of the license plate: carrying out affine transformation on the license plate coordinate position obtained in the step (5) and the license plate coordinate with a preset size, and calculating an affine matrix through the affine transformation of the coordinate positions of the license plate coordinate position and the license plate coordinate with the preset size
Figure 686950DEST_PATH_IMAGE018
Then will be
Figure 126021DEST_PATH_IMAGE018
Acting on the license plate cut by coordinates from the original image to obtain a license plate image after recognition and correction;
(7) training a network structure to obtain trained model parameters: using images of the training set in the dataset, picture size 512
Figure 339965DEST_PATH_IMAGE019
512
Figure 573500DEST_PATH_IMAGE019
3, according to the batch size (B), sequentially inputting the batch size (B) into the deep convolutional network (step (2) -step (5)), and inputting the whole network
Figure 872895DEST_PATH_IMAGE020
And using the IOU threshold value as a measurement standard of a sample distribution strategy to output the classification confidence of the license plate
Figure 584499DEST_PATH_IMAGE021
And the location of the regressive coordinate
Figure 918528DEST_PATH_IMAGE022
Wherein Class is 2, namely whether the license plate is detected, N is the number of output predicted license plate targets, and 8 is the horizontal and vertical coordinates of four vertexes of the license plate; using the Focal loss calculationMeasuring the category and the real category to obtain an error, calculating the error between the predicted license plate position and the real license plate position by adopting Smooth L1 loss, updating parameters through back propagation, saving model parameters with the best result on a training set after training iteration of a complete training set for set times (50 times), and taking the model parameters as final model trained parameters to obtain trained license plate recognition network parameters for testing the testing set;
(8) testing a network and correcting a license plate: testing the license plate recognition network parameters in the test set, scaling (resize) the long edge of the image to 512 under the condition of keeping the proportion of the long edge and the short edge of the image unchanged, and filling the short edge of the image to ensure that the image size is 512
Figure 322965DEST_PATH_IMAGE019
And 512, sequentially inputting the license plate classification confidence coefficient and the coordinate position of the license plate into a deep convolutional network, setting a threshold value to filter the license plate with low confidence coefficient, finally deleting redundant frames output by the network by using non-maximum suppression (NMS), and finally correcting the license plate by using the step (6).
In the embodiment, the license plate recognition result is compared with the license plate recognition result based on the horizontal frame detection method in the prior art, different license plate recognition results are shown in fig. 4 and fig. 5, wherein the left side image is the recognition result in the prior art, the right side image is the recognition result of the method in the embodiment, and finding out the recognition result can find out the result, the method for detecting the license plate in the unlimited scene based on the depth feature alignment of the improved deformed convolution generates a high-quality candidate frame by using the convolutional neural network, aligns the convolution feature with the license plate feature in a self-adaptive manner according to the candidate frame by using the improved deformed convolution, solves the problem that the depth feature of the inclined or distorted license plate is not aligned, and can efficiently realize the license plate detection and correction.
The technologies not disclosed in this embodiment are all the prior art.
It is noted that the disclosed embodiments are intended to aid in further understanding of the invention, but those skilled in the art will appreciate that: various substitutions and modifications are possible without departing from the spirit and scope of the invention and appended claims. Therefore, the invention should not be limited to the embodiments disclosed, but the scope of the invention is defined by the appended claims.

Claims (2)

1. A license plate recognition method without a limited scene is characterized by comprising the following steps:
1) and (3) data set construction: collecting images containing conventional, inclined and distorted license plates of a parking lot with traffic monitoring and side positions, constructing a data set, marking the positions of four vertexes of the license plate, calculating the coordinates of a horizontal rectangular frame corresponding to the license plate according to the positions of the four vertexes, and dividing the data set into a training set, a verification set and a test set;
2) deep convolution feature extraction: firstly, initializing the size and numerical range of an image in a training set in the step 1), wherein the size of the image is 512 x 512, the numerical range is 0-1, inputting the processed image into a backbone network of a deep convolution network for convolution feature extraction, wherein the backbone network uses VGG16 as a feature extraction network, and a feature pyramid network is added after VGG16, and the feature pyramid network is used for strengthening and utilizing multi-scale features formed in VGG16 to obtain a multi-scale license plate convolution feature map set with stronger expressive force;
3) high quality candidate box generation: respectively learning classification and position information by using two full-connection layer sub-networks with the same structure but without shared parameters from a multi-scale license plate convolution feature map set obtained from the backbone network of the deep convolution network in the step 2), thereby completing tasks of classification and position regression of a target frame, wherein the classification is whether the license plate is present, and the position information is four vertex coordinates of the license plate;
4) depth feature alignment: for each position on the characteristic diagram in the multi-scale license plate convolution characteristic diagram set obtained in the step 2)
Figure DEST_PATH_IMAGE001
The operation of the deformable convolution is in the regular net of the conventional convolutionGrid (C)R
Figure 966174DEST_PATH_IMAGE002
By adding an offset
Figure DEST_PATH_IMAGE003
Is expanded, thus in position
Figure 282886DEST_PATH_IMAGE001
The calculation formula of (a) is as follows:
Figure DEST_PATH_IMAGE005
wherein
Figure 801592DEST_PATH_IMAGE006
Is toRAn enumeration of the positions listed in (a),
Figure DEST_PATH_IMAGE007
in order to be the weights of the convolution,
Figure 229162DEST_PATH_IMAGE008
as input features, here
Figure DEST_PATH_IMAGE009
Is the offset obtained by convolutional layer operation;
the coordinates of the high-quality candidate box obtained in the step 3) are recorded as Poly1
Figure 149714DEST_PATH_IMAGE010
WhereinPoly1Representing a location
Figure 168485DEST_PATH_IMAGE001
High quality candidate box of (1), utilizingPoly1The maximum value and the minimum value of the abscissa and the maximum value and the minimum value of the ordinate are used for obtaining a minimum external horizontal rectangular framePoly2[
Figure DEST_PATH_IMAGE011
The high-quality candidate box represents the coordinates of the feature region of the license plate, and the minimum circumscribed horizontal rectangular box represents the feature region to be aligned; knowing the coordinates of the two regions by
Figure 678882DEST_PATH_IMAGE012
Their affine transformation matrices can be calculatedMThus for each feature point of the feature map
Figure DEST_PATH_IMAGE013
Sample position based on anchor frame
Figure 910143DEST_PATH_IMAGE014
Expressed as:
Figure 826147DEST_PATH_IMAGE016
whereinkWhich represents the size of the convolution kernel,Srepresenting the step size of the feature map, with the modified deformable convolution at the location
Figure 140453DEST_PATH_IMAGE001
Amount of deviation of
Figure 774697DEST_PATH_IMAGE003
Expressed as:
Figure 75228DEST_PATH_IMAGE018
then the obtained offset
Figure 845738DEST_PATH_IMAGE003
Inputting the feature map and the convolution feature map obtained in the step 2) into convolution, extracting alignment features,forming an improved deformable convolution; for each high quality candidate box, the sampling point is 9 points, an offset value of 18 dimensions is obtained, and the position is given by the method
Figure DEST_PATH_IMAGE019
The axis-aligned convolution feature of (a) is converted into a convolution feature based on any direction and attitude of the corresponding candidate frame;
5) and (3) fine license plate recognition of feature alignment: refining and positioning the license plate position again through the alignment features obtained in the step 4), inputting the alignment features into a convolution layer of 3 ✖ 3, and then accessing two branches for classification and regression, wherein the classification is to judge whether the license plate is the license plate, and the regression is the coordinate of four vertexes of the license plate
Figure 799788DEST_PATH_IMAGE020
Thus obtaining the accurate coordinate position of the license plate;
6) and (3) correcting the position of the license plate: carrying out affine transformation on the license plate coordinate position obtained in the step 5) and the license plate coordinate with the preset size, and calculating an affine matrix through the affine transformation of the coordinate positions of the license plate coordinate position and the license plate coordinate with the preset size
Figure DEST_PATH_IMAGE021
Then will be
Figure 780382DEST_PATH_IMAGE021
Acting on the license plate cut by coordinates from the original image to obtain a license plate image after recognition and correction;
7) training a network structure to obtain trained model parameters: using images of the training set in the dataset, picture size 512
Figure 619025DEST_PATH_IMAGE022
512
Figure 509621DEST_PATH_IMAGE022
3, according to the batch size (B), inputting the batch size (B) into the deep convolution network in sequence, and outputting the whole networkInto
Figure DEST_PATH_IMAGE023
And using the IOU threshold value as a measurement standard of a sample distribution strategy to output the classification confidence of the license plate
Figure 136036DEST_PATH_IMAGE024
And the location of the regressive coordinate
Figure DEST_PATH_IMAGE025
Wherein Class is 2, namely whether the license plate is detected, N is the number of output predicted license plate targets, and 8 is the horizontal and vertical coordinates of four vertexes of the license plate; calculating a prediction type and a real type by adopting Focal loss to obtain errors, calculating the errors of the predicted license plate position and the real license plate position by adopting Smooth L1 loss, updating parameters through back propagation, saving model parameters with the best results on a training set after setting 50 times of training iterations of the complete training set, and taking the model parameters as final model trained parameters to obtain trained license plate recognition network parameters for testing the testing set;
8) testing a network and correcting a license plate: testing the trained license plate recognition network parameters in a test set, scaling the long edge of the image to 512 under the condition of keeping the proportion of the long edge and the short edge of the image unchanged, and filling the short edge of the image to ensure that the image has the size of 512
Figure 213714DEST_PATH_IMAGE022
And 512, sequentially inputting the license plate classification confidence coefficient and the coordinate position of the license plate into a deep convolutional network, setting a threshold value to filter the license plate with low confidence coefficient, finally using a frame which is not greatly inhibited and deletes the redundancy output by the network, and finally using the step 6) to correct the license plate.
2. The unlimited scene license plate recognition method of claim 1, wherein the data set in step 1) is not less than 20000 license plate images, wherein the training set accounts for 60%, the verification set accounts for 20%, and the test set accounts for 20%.
CN202111384274.1A 2021-11-22 2021-11-22 License plate recognition method for unlimited scene Active CN113822278B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111384274.1A CN113822278B (en) 2021-11-22 2021-11-22 License plate recognition method for unlimited scene

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111384274.1A CN113822278B (en) 2021-11-22 2021-11-22 License plate recognition method for unlimited scene

Publications (2)

Publication Number Publication Date
CN113822278A true CN113822278A (en) 2021-12-21
CN113822278B CN113822278B (en) 2022-02-11

Family

ID=78918005

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111384274.1A Active CN113822278B (en) 2021-11-22 2021-11-22 License plate recognition method for unlimited scene

Country Status (1)

Country Link
CN (1) CN113822278B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114677502A (en) * 2022-05-30 2022-06-28 松立控股集团股份有限公司 License plate detection method with any inclination angle
CN114677501A (en) * 2022-05-30 2022-06-28 松立控股集团股份有限公司 License plate detection method based on two-dimensional Gaussian bounding box overlapping degree measurement
CN114882490A (en) * 2022-07-08 2022-08-09 松立控股集团股份有限公司 Unlimited scene license plate detection and classification method based on point-guided positioning
CN115050028A (en) * 2022-06-15 2022-09-13 松立控股集团股份有限公司 Sample vehicle license plate detection method in severe weather
CN115063786A (en) * 2022-08-18 2022-09-16 松立控股集团股份有限公司 High-order distant view fuzzy license plate detection method
CN115346206A (en) * 2022-10-20 2022-11-15 松立控股集团股份有限公司 License plate detection method based on improved super-resolution deep convolution feature recognition
CN117333799A (en) * 2023-10-07 2024-01-02 淮阴工学院 Middle and primary school classroom behavior detection method and device based on deformable anchor frame

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110443258A (en) * 2019-07-08 2019-11-12 北京三快在线科技有限公司 Character detecting method, device, electronic equipment and storage medium
CN112132258A (en) * 2020-08-26 2020-12-25 中国海洋大学 Multi-task learning model construction and optimization method based on deformable convolution
CN112200192A (en) * 2020-12-03 2021-01-08 南京风兴科技有限公司 License plate recognition method and device
CN112241936A (en) * 2019-07-18 2021-01-19 杭州海康威视数字技术股份有限公司 Image processing method, device and equipment and storage medium
US20210056351A1 (en) * 2018-06-04 2021-02-25 Jiangnan University Multi-scale aware pedestrian detection method based on improved full convolutional network
CN112749644A (en) * 2020-12-30 2021-05-04 大连海事大学 Improved deformable convolution-based Faster RCNN fire smoke detection method
CN112966631A (en) * 2021-03-19 2021-06-15 浪潮云信息技术股份公司 License plate detection and identification system and method under unlimited security scene
CN113052170A (en) * 2021-03-22 2021-06-29 江苏东大金智信息系统有限公司 Small target license plate recognition method under unconstrained scene
CN113111727A (en) * 2021-03-19 2021-07-13 西北工业大学 Method for detecting rotating target in remote sensing scene based on feature alignment
CN113139539A (en) * 2021-03-16 2021-07-20 中国科学院信息工程研究所 Method and device for detecting characters of arbitrary-shaped scene with asymptotic regression boundary
US20210319231A1 (en) * 2020-10-23 2021-10-14 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and Apparatus for Detecting Target Object, Electronic Device and Storage Medium

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210056351A1 (en) * 2018-06-04 2021-02-25 Jiangnan University Multi-scale aware pedestrian detection method based on improved full convolutional network
CN110443258A (en) * 2019-07-08 2019-11-12 北京三快在线科技有限公司 Character detecting method, device, electronic equipment and storage medium
CN112241936A (en) * 2019-07-18 2021-01-19 杭州海康威视数字技术股份有限公司 Image processing method, device and equipment and storage medium
CN112132258A (en) * 2020-08-26 2020-12-25 中国海洋大学 Multi-task learning model construction and optimization method based on deformable convolution
US20210319231A1 (en) * 2020-10-23 2021-10-14 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and Apparatus for Detecting Target Object, Electronic Device and Storage Medium
CN112200192A (en) * 2020-12-03 2021-01-08 南京风兴科技有限公司 License plate recognition method and device
CN112749644A (en) * 2020-12-30 2021-05-04 大连海事大学 Improved deformable convolution-based Faster RCNN fire smoke detection method
CN113139539A (en) * 2021-03-16 2021-07-20 中国科学院信息工程研究所 Method and device for detecting characters of arbitrary-shaped scene with asymptotic regression boundary
CN112966631A (en) * 2021-03-19 2021-06-15 浪潮云信息技术股份公司 License plate detection and identification system and method under unlimited security scene
CN113111727A (en) * 2021-03-19 2021-07-13 西北工业大学 Method for detecting rotating target in remote sensing scene based on feature alignment
CN113052170A (en) * 2021-03-22 2021-06-29 江苏东大金智信息系统有限公司 Small target license plate recognition method under unconstrained scene

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
ANNA ZHU等: ""Scene Text Detection with Selected Anchors"", 《2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)》 *
JIFENG DAI 等: ""Deformable convolutional networks"", 《PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION》 *
LANG REN等: ""A Transformer-Based Decoupled Attention Network for Text Recognition in Shopping Receipt Images"", 《NCAA 2021》 *
易鑫: ""基于深度学习的不规则文本识别算法研究"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
王雪娇: ""基于可变形卷积网络及推理网络的人体动作识别"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
邓霖杰: ""自然场景下端对端文本识别研究"", 《中国博士学位论文全文数据库 信息科技辑》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114677502A (en) * 2022-05-30 2022-06-28 松立控股集团股份有限公司 License plate detection method with any inclination angle
CN114677501A (en) * 2022-05-30 2022-06-28 松立控股集团股份有限公司 License plate detection method based on two-dimensional Gaussian bounding box overlapping degree measurement
CN115050028A (en) * 2022-06-15 2022-09-13 松立控股集团股份有限公司 Sample vehicle license plate detection method in severe weather
CN115050028B (en) * 2022-06-15 2024-03-29 松立控股集团股份有限公司 Small sample license plate detection method in severe weather
CN114882490A (en) * 2022-07-08 2022-08-09 松立控股集团股份有限公司 Unlimited scene license plate detection and classification method based on point-guided positioning
CN114882490B (en) * 2022-07-08 2022-09-20 松立控股集团股份有限公司 Unlimited scene license plate detection and classification method based on point-guided positioning
CN115063786A (en) * 2022-08-18 2022-09-16 松立控股集团股份有限公司 High-order distant view fuzzy license plate detection method
CN115346206A (en) * 2022-10-20 2022-11-15 松立控股集团股份有限公司 License plate detection method based on improved super-resolution deep convolution feature recognition
CN117333799A (en) * 2023-10-07 2024-01-02 淮阴工学院 Middle and primary school classroom behavior detection method and device based on deformable anchor frame
CN117333799B (en) * 2023-10-07 2024-08-06 淮阴工学院 Middle and primary school classroom behavior detection method and device based on deformable anchor frame

Also Published As

Publication number Publication date
CN113822278B (en) 2022-02-11

Similar Documents

Publication Publication Date Title
CN113822278B (en) License plate recognition method for unlimited scene
WO2024077812A1 (en) Single building three-dimensional reconstruction method based on point cloud semantic segmentation and structure fitting
CN111178213B (en) Aerial photography vehicle detection method based on deep learning
CN108427924B (en) Text regression detection method based on rotation sensitive characteristics
CN111126359B (en) High-definition image small target detection method based on self-encoder and YOLO algorithm
CN110929577A (en) Improved target identification method based on YOLOv3 lightweight framework
CN111882620B (en) Road drivable area segmentation method based on multi-scale information
CN108961235A (en) A kind of disordered insulator recognition methods based on YOLOv3 network and particle filter algorithm
CN113486886B (en) License plate recognition method and device in natural scene
CN111833237B (en) Image registration method based on convolutional neural network and local homography transformation
CN111582339B (en) Vehicle detection and recognition method based on deep learning
CN114519819B (en) Remote sensing image target detection method based on global context awareness
CN111242026B (en) Remote sensing image target detection method based on spatial hierarchy perception module and metric learning
CN113420619A (en) Remote sensing image building extraction method
CN109858327B (en) Character segmentation method based on deep learning
CN111046856A (en) Parallel pose tracking and map creating method based on dynamic and static feature extraction
CN115063786A (en) High-order distant view fuzzy license plate detection method
CN112329559A (en) Method for detecting homestead target based on deep convolutional neural network
CN117152625B (en) CoordConv and YOLOv 5-based remote sensing small target identification method, system, equipment and medium
CN111461113A (en) Large-angle license plate detection method based on deformed plane object detection network
CN114677501A (en) License plate detection method based on two-dimensional Gaussian bounding box overlapping degree measurement
CN114898327A (en) Vehicle detection method based on lightweight deep learning network
CN103353941A (en) Natural marker registration method based on viewpoint classification
CN110909623A (en) Three-dimensional target detection method and three-dimensional target detector
CN112784494B (en) Training method of false positive recognition model, target recognition method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant