CN114387571A

CN114387571A - Hierarchical matching-based method and device for unsupervised vehicle re-identification

Info

Publication number: CN114387571A
Application number: CN202111550495.1A
Authority: CN
Inventors: 李熙莹; 钟诚; 王宇航; 黄国煜; 徐勇志
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2021-12-17
Filing date: 2021-12-17
Publication date: 2022-04-22
Anticipated expiration: 2041-12-17

Abstract

The invention discloses an unsupervised vehicle re-identification method and a device based on hierarchical matching, wherein the method comprises the following steps: acquiring a picture of a vehicle to be detected from a video source; extracting overall characteristics, vehicle local component information, vehicle body color information and vehicle type information of a vehicle from a picture of the vehicle to be detected; carrying out first processing on the overall characteristics of the vehicle to obtain the overall similarity of the vehicle; performing second processing on the vehicle local component information to obtain a vehicle local characteristic distance value; performing third processing on the vehicle body color information and the vehicle type information to obtain a color characteristic vector and a vehicle type characteristic vector; carrying out feature fusion on the overall vehicle similarity, the local vehicle feature distance value, the color feature vector and the vehicle type feature vector to obtain the vehicle similarity; and determining a target vehicle corresponding to the vehicle to be detected from all vehicles in the candidate vehicle set according to the vehicle similarity. The invention improves the accuracy of vehicle weight recognition and can be widely applied to the technical field of vehicle recognition.

Description

Hierarchical matching-based method and device for unsupervised vehicle re-identification

技术领域technical field

本发明涉及车辆识别技术领域，尤其是基于层次匹配的无监督车辆重识别方法及装置。The invention relates to the technical field of vehicle identification, in particular to an unsupervised vehicle re-identification method and device based on hierarchical matching.

背景技术Background technique

车辆重识别问题是指在特定的交通监控场景下，判断安装在不同区域的摄像头拍摄的图像中车辆是否属于同一辆车的检索问题。车辆重识别算法能够应用在车辆识别、追踪和定位等众多难点问题上，进一步提升监控系统的安全性和可靠性。Vehicle re-identification problem refers to the retrieval problem of judging whether the vehicles in the images captured by cameras installed in different areas belong to the same vehicle in a specific traffic monitoring scenario. The vehicle re-identification algorithm can be applied to many difficult problems such as vehicle identification, tracking and positioning, and further improves the security and reliability of the monitoring system.

由于交通场景复杂、摄像机视点有限、光照变化等约束的影响，车辆重识别仍是个极具挑战性的问题。目前，车牌识别技术是判断车辆身份最简单最直接的方法，但仅仅依靠此方法仍然存在明显的弊端。例如，交通监控由于摄像头位置的限制导致其拍摄视角相对固定，在拥堵排队、大型车遮挡或者驾驶员故意遮挡伪造号牌等复杂场景中，车牌检测的难度增加或不再适用。为了弥补车牌识别的缺点，越来越多的研究学者开始使用车辆除号牌以外的视觉信息来进行车辆检索任务。在过去视觉领域的研究中，车辆检索主要涉及车辆颜色或者型号等内容信息的检索识别，但这常常做不到精确地检索出特定车辆，而是得到与目标车辆具有相同内容(如颜色、型号)的车辆。近几年，车辆检索的子领域——车辆重识别的研究逐渐成为了研究重点。现有的车辆重识别技术的缺点主要体现在两个方面：一是在不同的摄像头视角下，相同的车辆呈现出来的特征差异较大，从而致使车辆重识别任务的数据的类内差异较大；二是具有相似全局特征(如颜色、车型等)的不同车辆在多视角下的差异被进一步缩小，从而致使车辆重识别任务的类间差异较小。Vehicle re-identification is still an extremely challenging problem due to constraints such as complex traffic scenes, limited camera viewpoints, and illumination changes. At present, license plate recognition technology is the simplest and most direct method to judge the identity of a vehicle, but there are still obvious drawbacks only relying on this method. For example, traffic monitoring has a relatively fixed viewing angle due to the limitation of the camera position. In complex scenes such as congestion queuing, blocking by large vehicles, or drivers deliberately blocking fake license plates, the difficulty of license plate detection increases or is no longer applicable. In order to make up for the shortcomings of license plate recognition, more and more researchers have begun to use the visual information of vehicles other than the number plate for vehicle retrieval tasks. In the past research in the field of vision, vehicle retrieval mainly involves the retrieval and identification of content information such as vehicle color or model, but this often fails to accurately retrieve a specific vehicle, but obtains the same content as the target vehicle (such as color, model )Vehicles. In recent years, the research on vehicle re-identification, a sub-field of vehicle retrieval, has gradually become the focus of research. The shortcomings of the existing vehicle re-identification technology are mainly reflected in two aspects: First, under different camera perspectives, the characteristics of the same vehicle are quite different, resulting in large intra-class differences in the data of the vehicle re-identification task. The second is that the differences between different vehicles with similar global features (such as color, model, etc.) under multi-view are further reduced, resulting in smaller inter-class differences in the vehicle re-identification task.

发明内容SUMMARY OF THE INVENTION

有鉴于此，本发明实施例提供一种准确度高的基于层次匹配的无监督车辆重识别方法及装置。In view of this, embodiments of the present invention provide a method and device for unsupervised vehicle re-identification based on hierarchical matching with high accuracy.

本发明的第一方面提供了基于层次匹配的无监督车辆重识别方法，包括：A first aspect of the present invention provides an unsupervised vehicle re-identification method based on hierarchical matching, including:

从视频源中获取待检测车辆图片；Obtain the image of the vehicle to be detected from the video source;

从所述待检测车辆图片中提取车辆整体特征、车辆局部构件信息、车身颜色信息和车型信息；Extracting overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the image of the vehicle to be detected;

对所述车辆整体特征进行第一处理，得到车辆整体相似度；Perform a first process on the overall characteristics of the vehicle to obtain the overall similarity of the vehicle;

对所述车辆局部构件信息进行第二处理，得到车辆局部特征距离值；performing a second process on the vehicle local component information to obtain a vehicle local feature distance value;

对所述车身颜色信息和所述车型信息进行第三处理，得到颜色特征向量和车型特征向量；performing a third process on the vehicle body color information and the vehicle model information to obtain a color feature vector and a vehicle model feature vector;

对所述车辆整体相似度、所述车辆局部特征距离值、所述颜色特征向量和所述车型特征向量进行特征融合，得到车辆相似度；Performing feature fusion on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain the vehicle similarity;

根据所述车辆相似度，从候选车辆集的所有车辆中确定与待检测车辆对应的目标车辆。According to the vehicle similarity, a target vehicle corresponding to the vehicle to be detected is determined from all vehicles in the candidate vehicle set.

可选地，所述从视频源中获取待检测车辆图片，包括：Optionally, the obtaining a picture of the vehicle to be detected from a video source includes:

按照预设的时间间隔，从所述视频源中获取图像帧；Acquiring image frames from the video source according to preset time intervals;

对提取到的图像帧进行目标检测，找到所述图像帧中的车辆目标；Perform target detection on the extracted image frame to find the vehicle target in the image frame;

将存在车辆目标的图像帧截取得到车辆集合，并将所述车辆集划分为查找图片集和候选图片集。A vehicle set is obtained by intercepting an image frame with a vehicle target, and the vehicle set is divided into a search picture set and a candidate picture set.

可选地，所述从视频源中获取待检测车辆图片这一步骤，还包括：Optionally, the step of acquiring the picture of the vehicle to be detected from the video source further includes:

计算任一当前图像帧的检测框和对应的上一图像帧的检测框之间的交并比；Calculate the intersection ratio between the detection frame of any current image frame and the detection frame of the corresponding previous image frame;

当所述交并比大于预设阈值时，将所述当前图像帧对应的检测结果剔除；When the intersection ratio is greater than a preset threshold, remove the detection result corresponding to the current image frame;

其中，所述交并比的计算公式为：Wherein, the calculation formula of the intersection ratio is:

其中，IoU代表交并比；R₁为上一图像帧的检测框；R₂为当前图像帧的检测框；area(R₁)∩area(R₂)求两个检测框之间的区域交集；area(R₁)∪area(R₂)求两个检测框的区域并集。Among them, IoU represents the intersection ratio; R ₁ is the detection frame of the previous image frame; R ₂ is the detection frame of the current image frame; area(R ₁ )∩area(R ₂ ) finds the area intersection between the two detection frames ;area(R ₁ )∪area(R ₂ ) finds the area union of the two detection boxes.

可选地，所述从所述待检测车辆图片中提取车辆整体特征、车辆局部构件信息、车身颜色信息和车型信息，包括：Optionally, the extraction of overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the picture of the vehicle to be detected includes:

通过ResNet50特征提取器将车辆分为车辆整体特征和车辆局部构件信息；The vehicle is divided into overall vehicle features and vehicle local component information through the ResNet50 feature extractor;

通过全局平均池化对待检测车辆图片进行处理，得到车辆整体特征；Through global average pooling, the image of the vehicle to be detected is processed to obtain the overall characteristics of the vehicle;

通过CNN网络输出所述车辆整体特征；Output the overall characteristics of the vehicle through the CNN network;

对车辆各个视角的特征进行比较，获取车辆各个视角下的视角可视性分数；Compare the features of each view angle of the vehicle to obtain the view angle visibility score under each view angle of the vehicle;

计算所述视角可视性分数的权重后，结合欧氏距离计算方法计算得到局部特征距离值，并确定车辆局部构件信息；After calculating the weight of the visual angle visibility score, the local feature distance value is obtained by combining with the Euclidean distance calculation method, and the local component information of the vehicle is determined;

通过Resnet50网络进行特征提取得到所有车辆的颜色特征向量和车型特征向量，确定所述车身颜色信息和所述车型信息。The color feature vector and the model feature vector of all vehicles are obtained by feature extraction through the Resnet50 network, and the vehicle body color information and the model information are determined.

可选地，所述从所述待检测车辆图片中提取车辆整体特征、车辆局部构件信息、车身颜色信息和车型信息，还包括：Optionally, the extraction of overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the picture of the vehicle to be detected further includes:

将所述待检测车辆图片中的车辆分为正面、顶面、背面和侧面四个方向；Divide the vehicles in the picture of the vehicle to be detected into four directions: front, top, back and side;

计算车辆的四个方向的面积值作为视角可视度分数，并将所述视角可视度分数作为车辆对应的局部特征的置信值，所述置信值为车辆四个局部特征的置信值，用于表征后续计算车辆间每个部件的特征距离占局部特征距离的权重。Calculate the area values of the four directions of the vehicle as the view angle visibility score, and use the view angle visibility score as the confidence value of the local features corresponding to the vehicle, and the confidence value is the confidence value of the four local features of the vehicle, using It is used to characterize the weight of the feature distance of each component between the vehicles in the subsequent calculation of the local feature distance.

可选地，所述计算所述视角可视性分数的权重的计算公式为：Optionally, the calculation formula for calculating the weight of the visual angle visibility score is:

其中，

代表所述视角可视性分数的权重；

代表待检测车辆的可视性分数；

代表图像库中对应的车辆的可视性分数；m,n代表待检测车辆与图像库中对应的车辆；i代表可视性分数和权重对应的区域；N代表待搜索的图像库中车辆数；in,

a weight representing the visibility score of said view;

represents the visibility score of the vehicle to be detected;

represents the visibility score of the corresponding vehicle in the image database; m, n represent the vehicle to be detected and the corresponding vehicle in the image database; i represents the area corresponding to the visibility score and weight; N represents the number of vehicles in the image database to be searched ;

所述局部特征距离值的计算公式为：The calculation formula of the local feature distance value is:

其中，D^m,n代表所述局部特征距离值；D代表车辆各个方向特征的欧氏距离；f_i ^m代表待检测车辆的特征信息；f_i ⁿ代表图像库中对应的车辆的特征信息。Among them, D ^m,n represents the local feature distance value; D represents the Euclidean distance of the vehicle in all directions; f _i ^m represents the feature information of the vehicle to be detected; f _i ⁿ represents the feature information of the corresponding vehicle in the image library.

可选地，所述对所述车辆整体相似度、所述车辆局部特征距离值、所述颜色特征向量和所述车型特征向量进行特征融合，得到车辆相似度，包括：Optionally, performing feature fusion on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain the vehicle similarity, including:

对所述车辆整体相似度、所述车辆局部特征距离值、所述颜色特征向量和所述车型特征向量进行L2范数规范化处理，得到规范化后的特征；Performing L2 norm normalization processing on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain normalized features;

根据所述规范化后的特征，计算待检测车辆与图像库中对应的车辆之间的初始重识别距离值；According to the normalized features, calculate the initial re-identification distance value between the vehicle to be detected and the corresponding vehicle in the image library;

对所述车辆整体相似度的初始重识别距离值、所述车辆局部特征距离值的初始重识别距离值、所述颜色特征向量的初始重识别距离值和所述车型特征向量的初始重识别距离值进行加权求和计算，得到总重识别距离值；The initial re-recognition distance value of the overall similarity of the vehicle, the initial re-recognition distance value of the vehicle local feature distance value, the initial re-recognition distance value of the color feature vector and the initial re-recognition distance of the vehicle type feature vector The value is weighted and summed to obtain the total weight identification distance value;

根据所述总重识别距离值确定车辆相似度。The vehicle similarity is determined according to the total weight identification distance value.

本发明实施例的另一方面还提供了一种基于层次匹配的无监督车辆重识别装置，包括：Another aspect of the embodiments of the present invention also provides an apparatus for unsupervised vehicle re-identification based on hierarchical matching, including:

第一模块，用于从视频源中获取待检测车辆图片；The first module is used to obtain the picture of the vehicle to be detected from the video source;

第二模块，用于从所述待检测车辆图片中提取车辆整体特征、车辆局部构件信息、车身颜色信息和车型信息；The second module is used for extracting overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the picture of the vehicle to be detected;

第三模块，用于对所述车辆整体特征进行第一处理，得到车辆整体相似度；The third module is used to perform the first processing on the overall characteristics of the vehicle to obtain the overall similarity of the vehicle;

第四模块，用于对所述车辆局部构件信息进行第二处理，得到车辆局部特征距离值；a fourth module, configured to perform second processing on the vehicle local component information to obtain a vehicle local characteristic distance value;

第五模块，用于对所述车身颜色信息和所述车型信息进行第三处理，得到颜色特征向量和车型特征向量；a fifth module, configured to perform third processing on the vehicle body color information and the vehicle model information to obtain a color feature vector and a vehicle model feature vector;

第六模块，用于对所述车辆整体相似度、所述车辆局部特征距离值、所述颜色特征向量和所述车型特征向量进行特征融合，得到车辆相似度；The sixth module is used to perform feature fusion on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain the vehicle similarity;

第七模块，用于根据所述车辆相似度，从候选车辆集的所有车辆中确定与待检测车辆对应的目标车辆。The seventh module is configured to determine, according to the vehicle similarity, a target vehicle corresponding to the vehicle to be detected from all vehicles in the candidate vehicle set.

本发明实施例的另一方面还提供了一种电子设备，包括处理器以及存储器；Another aspect of the embodiments of the present invention further provides an electronic device, including a processor and a memory;

所述存储器用于存储程序；the memory is used to store programs;

所述处理器执行所述程序实现如前面所述的方法。The processor executes the program to implement the method as described above.

本发明实施例的另一方面还提供了一种计算机可读存储介质，所述存储介质存储有程序，所述程序被处理器执行实现如前面所述的方法。Another aspect of the embodiments of the present invention further provides a computer-readable storage medium, where the storage medium stores a program, and the program is executed by a processor to implement the aforementioned method.

本发明实施例还公开了一种计算机程序产品或计算机程序，该计算机程序产品或计算机程序包括计算机指令，该计算机指令存储在计算机可读存储介质中。计算机设备的处理器可以从计算机可读存储介质读取该计算机指令，处理器执行该计算机指令，使得该计算机设备执行前面的方法。The embodiment of the present invention also discloses a computer program product or computer program, where the computer program product or computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. The computer instructions can be read from the computer-readable storage medium by a processor of the computer device, and the processor executes the computer instructions to cause the computer device to perform the foregoing method.

本发明的实施例从视频源中获取待检测车辆图片；从所述待检测车辆图片中提取车辆整体特征、车辆局部构件信息、车身颜色信息和车型信息；对所述车辆整体特征进行第一处理，得到车辆整体相似度；对所述车辆局部构件信息进行第二处理，得到车辆局部特征距离值；对所述车身颜色信息和所述车型信息进行第三处理，得到颜色特征向量和车型特征向量；对所述车辆整体相似度、所述车辆局部特征距离值、所述颜色特征向量和所述车型特征向量进行特征融合，得到车辆相似度；根据所述车辆相似度，从候选车辆集的所有车辆中确定与待检测车辆对应的目标车辆。本发明提高了车辆重识别的准确度。The embodiment of the present invention obtains a picture of a vehicle to be detected from a video source; extracts overall vehicle features, vehicle local component information, body color information and vehicle type information from the picture of the vehicle to be detected; and performs first processing on the overall vehicle characteristics , obtain the overall similarity of the vehicle; perform the second processing on the local component information of the vehicle to obtain the local characteristic distance value of the vehicle; perform the third processing on the body color information and the vehicle type information to obtain the color feature vector and the vehicle type feature vector. ; Perform feature fusion on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain the vehicle similarity; In the vehicle, a target vehicle corresponding to the vehicle to be detected is determined. The invention improves the accuracy of vehicle re-identification.

附图说明Description of drawings

为了更清楚地说明本申请实施例中的技术方案，下面将对实施例描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本申请的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative effort.

图1为本发明实施例提供的整体步骤流程图。FIG. 1 is a flowchart of an overall step provided by an embodiment of the present invention.

具体实施方式Detailed ways

为了使本申请的目的、技术方案及优点更加清楚明白，以下结合附图及实施例，对本申请进行进一步详细说明。应当理解，此处所描述的具体实施例仅仅用以解释本申请，并不用于限定本申请。In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the present application will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.

针对现有技术存在的问题，本发明实施例提供了基于层次匹配的无监督车辆重识别方法，如图1所示，本发明的方法包括：In view of the problems existing in the prior art, an embodiment of the present invention provides an unsupervised vehicle re-identification method based on hierarchical matching. As shown in FIG. 1 , the method of the present invention includes:

其中，IoU代表交并比；R₁为上一图像帧的检测框；R₂为当前图像帧的检测框；area(R₁)∩area(R₂)求两个检测框之间的区域交集；area(R₁)∪area(R₂)求两个检测框的区域并集。Among them, IoU represents the intersection ratio; R ₁ is the detection frame of the previous image frame; R ₂ is the detection frame of the current image frame; area(R ₁ )∩area(R ₂ ) finds the area intersection between the two detection frames ; area(R ₁ )∪area(R ₂ ) finds the area union of the two detection boxes.

其中，

代表所述视角可视性分数的权重；

代表待检测车辆的可视性分数；

a weight representing the visibility score of said view;

represents the visibility score of the vehicle to be detected;

所述存储器用于存储程序；the memory is used to store programs;

下面结合说明书附图，对本发明的具体实现原理进行详细说明：The specific implementation principle of the present invention will be described in detail below in conjunction with the accompanying drawings:

针对现有技术存在的问题，本发明提供了一种基于多模型融合的车辆重识别方法，包括从视频车辆检测、提取、比对完整流程。首先通过YOLO检测算法从视频中检测提取车辆并保存，当车辆牌照清晰且能被识别时将优先使用车牌比对，剩余车牌清晰度低的车辆图片将经过后续重识别流程。重识别部分将车辆分为车辆整体特征、车辆局部构件、车身颜色、车型四个部分比对。车辆整体特征输入网络后利用CNN输出整体相似度；车辆局部构件特征通过引入视角可视度分数实现不同视角下的车辆特征比较，视角可视性分数将车辆分为正面、背面、侧面、顶部四个方向，统计车辆在这四个方向的面积并将其作为权重加权到之后的各部件欧氏距离值得到车辆局部特征距离值。车身颜色、车型则通过ResNet50网络进行分类得到车辆的颜色特征向量和车型特征向量S。最后通过对上述四个车辆特征融合得到最终的车辆相似度。旨在通过全局特征和车辆解析后的特征相融合，扩大数据的类间差异并缩小类内差异，从而达到提高车辆重识别准确度的效果。最后对目标车辆与候选车辆集中所有车辆的相似度进行排序，同时利用摄像头空间、车辆出现时间信息，筛选出相似度最高的前5(或10)辆车作为输出。In view of the problems existing in the prior art, the present invention provides a vehicle re-identification method based on multi-model fusion, including a complete process of vehicle detection, extraction and comparison from video. First, the vehicle is detected and extracted from the video by the YOLO detection algorithm and saved. When the vehicle license plate is clear and can be recognized, the license plate will be used first for comparison, and the remaining vehicle images with low license plate definition will go through the subsequent re-identification process. In the re-identification part, the vehicle is divided into four parts: the overall characteristics of the vehicle, the local components of the vehicle, the color of the body, and the model of the vehicle. After inputting the overall features of the vehicle into the network, the CNN is used to output the overall similarity; the local component features of the vehicle are used to compare the vehicle features under different perspectives by introducing the perspective visibility score. The perspective visibility score divides the vehicle into four categories: front, back, side, and top. In each direction, count the area of the vehicle in these four directions and use it as a weight to weight the Euclidean distance value of each component to obtain the local characteristic distance value of the vehicle. The body color and model are classified through the ResNet50 network to obtain the color feature vector of the vehicle and the model feature vector S. Finally, the final vehicle similarity is obtained by fusing the above four vehicle features. The purpose is to expand the inter-class difference of data and reduce the intra-class difference through the fusion of global features and vehicle parsed features, so as to achieve the effect of improving the accuracy of vehicle re-identification. Finally, the similarity between the target vehicle and all the vehicles in the candidate vehicle set is sorted, and the top 5 (or 10) vehicles with the highest similarity are screened out using the camera space and vehicle appearance time information as the output.

本发明实现了一个基于多特征融合的车辆重识别方法，通过对视频车辆进行检测提取，通过车辆颜色、车型分类，不同视角车辆构件特征比对，车辆出现位置比对，之后利用加权特征值，对整个车辆相似度值进行排序，选取相似度最高的车辆作为输出车辆。The invention realizes a vehicle re-identification method based on multi-feature fusion. By detecting and extracting video vehicles, classifying vehicle colors and vehicle types, comparing vehicle component features from different viewing angles, and comparing vehicle appearance positions, and then using weighted feature values, Sort the entire vehicle similarity value, and select the vehicle with the highest similarity as the output vehicle.

车辆检测和车牌识别部分：Vehicle detection and license plate recognition section:

车辆检测部分首先对给定的视频按一定的间隔取帧，通过对帧图像进行目标检测以判断场景中是否出现了车辆(包括汽车car、公共巴士bus和卡车truck三类)，检测结果与设定的取帧间隔、视频帧率以及运动物体的速度均有关。目标检测采用的是YOLO v4算法，它将目标检测转换为一个回归问题，在模型中分类和边框回归同时进行，输入图像经过主干网络(Backbone)CSPDarknet-53、颈部(Neck)金字塔池化(SPP)和路径聚合网络(PAN)以及头部(Head)YOLO v3来提取有效特征，再映射到一个张量，从而得到目标检测结果。方法对视频帧图像进行输入，然后检测图像中是否存在车辆目标。The vehicle detection part firstly takes the frame of the given video at a certain interval, and determines whether there is a vehicle (including three types of car, public bus and truck) in the scene by performing target detection on the frame image. The fixed frame interval, video frame rate and the speed of moving objects are all related. The target detection uses the YOLO v4 algorithm, which converts the target detection into a regression problem. In the model, classification and border regression are performed at the same time. The input image is passed through the backbone network (Backbone) CSPDarknet-53, neck (Neck) Pyramid pooling ( SPP) and Path Aggregation Network (PAN) and Head (Head) YOLO v3 to extract effective features, and then map to a tensor to get the target detection result. The method takes video frame images as input, and then detects whether there is a vehicle target in the image.

对于输入视频中可能存在的静止车辆或低速车辆重复检测问题，对当前检测框和上一次检测框计算其交并比(Intersection over Union，IoU)，剔除IoU大于0.5的检测结果，从而改善重复检测问题。For the problem of repeated detection of stationary vehicles or low-speed vehicles that may exist in the input video, calculate the Intersection over Union (IoU) between the current detection frame and the previous detection frame, and eliminate the detection results with IoU greater than 0.5, thereby improving the repeated detection. question.

其中，R₁为上一视频帧的检测框，R₂为当前视频帧的检测框；area(R₁)∩area(R₂)求两个检测框的区域交集，area(R₁)∪area(R₂)求两个检测框的区域并集。Among them, R ₁ is the detection frame of the previous video frame, R ₂ is the detection frame of the current video frame; area(R ₁ )∩area(R ₂ ) finds the area intersection of the two detection frames, area(R ₁ )∪area (R ₂ ) Find the area union of the two detection boxes.

在完成车辆检测后，将帧图像中的车辆检测部分截取并保存成车辆集，后续可根据用户需求划分为查找图片集(query set)和候选图片集(gallery set)，用于车辆重识别部分。After the vehicle detection is completed, the vehicle detection part in the frame image is intercepted and saved as a vehicle set, which can be divided into a query set (query set) and a candidate image set (gallery set) according to user needs, which are used for the vehicle re-identification part. .

车辆检测获得的车辆集除了直接用于车辆重识别部分，还用于车牌识别以作为车辆重识别的辅助参考信息。对于车牌的识别部分，使用车牌识别网络(License PlateRecognition Net，LPRNet)，输入车辆图片，通过由SqueezeNet Fire模块和Inception模块搭建的CNN模型对车牌部分进行识别。然后将车牌识别结果保存成csv结果文件，用于重识别过程中车牌信息比对。The vehicle set obtained by vehicle detection is not only directly used in the vehicle re-identification part, but also used for license plate recognition as auxiliary reference information for vehicle re-identification. For the recognition part of the license plate, the license plate recognition network (License Plate Recognition Net, LPRNet) is used to input the vehicle picture, and the license plate part is recognized by the CNN model built by the SqueezeNet Fire module and the Inception module. Then, the license plate recognition result is saved as a csv result file, which is used for the license plate information comparison in the re-recognition process.

在车辆重识别比对部分：先将车辆分为三个主要部分预测。在车辆构件特征比较部分利用视角分类网络将车辆分为正面、顶面、背面、侧面四个方向，统计车辆这四个方向的面积值作为视角可视度分数(vis_scores)，将该分数视为车辆对应局部特征的置信值。In the vehicle re-identification comparison part: first, the vehicle is divided into three main parts for prediction. In the vehicle component feature comparison part, the view classification network is used to divide the vehicle into four directions: front, top, back, and side, and the area values of the four directions of the vehicle are counted as the view visibility score (vis_scores), and the score is regarded as The confidence value of the vehicle corresponding to the local feature.

首先通过ResNet50特征提取器将车辆分为全局特征和车辆构件(局部)特征两部分。通过全局平均池化用于特征图得到车辆全局特征，通过CNN网络输出全局特征，在匹配相同视角车辆时，此全局特征可带来更高的检测精度。First, the vehicle is divided into two parts: global features and vehicle component (local) features by the ResNet50 feature extractor. The global average pooling is used for the feature map to obtain the global features of the vehicle, and the global features are output through the CNN network, which can bring higher detection accuracy when matching vehicles with the same viewing angle.

对于车辆各个视角的特征比较，在获得车辆各个视角的视角可视性分数后，可在此基础上将不同视角的车辆特征信息解耦为对应的局部特征实现各个视角下的特征比较。由于此时可视性分数为区域面积，因此需要再将其进行求权重：For the feature comparison of various perspectives of the vehicle, after obtaining the perspective visibility scores of the various perspectives of the vehicle, the vehicle feature information of different perspectives can be decoupled into corresponding local features to realize the feature comparison under each perspective. Since the visibility score is the area area at this time, it needs to be weighted again:

为权重，

和

为对应两辆车的可视性分数，i为可视性分数和权重对应的区域，m,n为对应待搜索车辆及图像库对应比对车辆。之后计算车辆各个方向特征的欧氏距离D并将可视性分数权重与其相乘得到最终的局部特征距离值：

is the weight,

and

is the visibility score corresponding to the two vehicles, i is the area corresponding to the visibility score and weight, m, n are the corresponding vehicle to be searched and the image library corresponding to the comparison vehicle. Then calculate the Euclidean distance D of the features in each direction of the vehicle and multiply the visibility score weight with it to get the final local feature distance value:

f_i ^m和f_i ⁿ为对比的两辆车对应特征，

为上述计算得到的权重。f _i ^m and f _i ⁿ are the corresponding features of the two vehicles for comparison,

is the weight calculated above.

上述两个部分对车辆整体特征以及局部特征求相似度，以下则通过车辆的车型、颜色特征分类进一步提高重识别准确度。车型和颜色均通过Resnet50网络进行特征提取得到所有车辆的颜色特征向量和车型特征向量S。The above two parts seek the similarity of the overall features and local features of the vehicle, and the following will further improve the re-identification accuracy by classifying the vehicle type and color features. The model and color are extracted through the Resnet50 network to obtain the color feature vector and model feature vector S of all vehicles.

此时，本实施例已经得到四个车辆特征值即车辆全局特征向量、车辆局部特征向量、车型特征向量、车身颜色特征向量，各特征向量通过L2范数规范化后进行query车辆与gallery车辆间的距离计算，其中局部距离和全局距离的计算方法如前面公式(1)和(2)所示，颜色特征距离和车型特征距离通过计算特征间欧氏距离得到，最后通过线性组合将其加权得到总体车辆重识别距离值D。At this point, four vehicle feature values have been obtained in this embodiment, namely the vehicle global feature vector, the vehicle local feature vector, the vehicle type feature vector, and the vehicle body color feature vector. Distance calculation, in which the calculation methods of local distance and global distance are shown in the previous formulas (1) and (2), the color feature distance and the model feature distance are obtained by calculating the Euclidean distance between the features, and finally weighted by linear combination to obtain the overall Vehicle re-identification distance value D.

其中，F′_i表示规范化后的特征，F_i为网络输出的特征向量。Among them, F′ _i represents the normalized feature, and F _i is the feature vector output by the network.

其中，F^m,Fⁿ代表不同车辆的特征向量，N代表特征向量的维度。Among them, F ^m , F ⁿ represent the eigenvectors of different vehicles, and N represents the dimension of the eigenvectors.

D＝mD_global+βD_local+γD_color+λD_type#(5)D=mD _global +βD _local +γD _color +λD _type #(5)

综上所述，本发明首先利用车辆识别技术对照片或者视频流中的车辆进行选取，再对所选的车辆照片分三部分进行预测。在车辆构件特征中利用视角分类网络对车辆的四个视角进行打分，作为车辆对应局部特征的置信值；再通过特征提取器将车辆分为全局特征和局部特征两部分，通过全局平均池化用于特征图得到车辆全局特征；最后通过车辆车型及颜色特征进行识别。最后将这四种特征计算特征距离，线性组合加权得到最后的总距离值，然后将得分最高的作为车辆重识别的结果输出，实现从视频到车辆比对的完整重识别流程，具有较高的实用性与可行性，最终检测准确度较高。To sum up, the present invention firstly selects the vehicle in the photo or video stream by using the vehicle identification technology, and then predicts the selected vehicle photo in three parts. In the feature of the vehicle components, the perspective classification network is used to score the four perspectives of the vehicle as the confidence value of the corresponding local features of the vehicle; then the vehicle is divided into two parts, the global feature and the local feature, by the feature extractor. The global features of the vehicle are obtained from the feature map; finally, the vehicle type and color features are used for identification. Finally, the feature distances of these four features are calculated, and the linear combination is weighted to obtain the final total distance value, and then the highest score is output as the result of vehicle re-identification to realize the complete re-identification process from video to vehicle comparison. Practicality and feasibility, and the final detection accuracy is high.

在一些可选择的实施例中，在方框图中提到的功能/操作可以不按照操作示图提到的顺序发生。例如，取决于所涉及的功能/操作，连续示出的两个方框实际上可以被大体上同时地执行或所述方框有时能以相反顺序被执行。此外，在本发明的流程图中所呈现和描述的实施例以示例的方式被提供，目的在于提供对技术更全面的理解。所公开的方法不限于本文所呈现的操作和逻辑流程。可选择的实施例是可预期的，其中各种操作的顺序被改变以及其中被描述为较大操作的一部分的子操作被独立地执行。In some alternative implementations, the functions/operations noted in the block diagrams may occur out of the order noted in the operational diagrams. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/operations involved. Furthermore, the embodiments presented and described in the flowcharts of the present invention are provided by way of example in order to provide a more comprehensive understanding of the technology. The disclosed methods are not limited to the operations and logic flows presented herein. Alternative embodiments are contemplated in which the order of the various operations are altered and in which sub-operations described as part of larger operations are performed independently.

此外，虽然在功能性模块的背景下描述了本发明，但应当理解的是，除非另有相反说明，所述的功能和/或特征中的一个或多个可以被集成在单个物理装置和/或软件模块中，或者一个或多个功能和/或特征可以在单独的物理装置或软件模块中被实现。还可以理解的是，有关每个模块的实际实现的详细讨论对于理解本发明是不必要的。更确切地说，考虑到在本文中公开的装置中各种功能模块的属性、功能和内部关系的情况下，在工程师的常规技术内将会了解该模块的实际实现。因此，本领域技术人员运用普通技术就能够在无需过度试验的情况下实现在权利要求书中所阐明的本发明。还可以理解的是，所公开的特定概念仅仅是说明性的，并不意在限制本发明的范围，本发明的范围由所附权利要求书及其等同方案的全部范围来决定。Furthermore, while the invention is described in the context of functional modules, it is to be understood that, unless stated to the contrary, one or more of the described functions and/or features may be integrated in a single physical device and/or or software modules, or one or more functions and/or features may be implemented in separate physical devices or software modules. It will also be appreciated that a detailed discussion of the actual implementation of each module is not necessary to understand the present invention. Rather, given the attributes, functions, and internal relationships of the various functional modules in the apparatus disclosed herein, the actual implementation of the modules will be within the routine skill of the engineer. Accordingly, those skilled in the art, using ordinary skill, can implement the invention as set forth in the claims without undue experimentation. It is also to be understood that the specific concepts disclosed are illustrative only and are not intended to limit the scope of the invention, which is to be determined by the appended claims along with their full scope of equivalents.

所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、只读存储器(ROM，Read-Only Memory)、随机存取存储器(RAM，Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。The functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium. Based on such understanding, the technical solution of the present invention can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .

在流程图中表示或在此以其他方式描述的逻辑和/或步骤，例如，可以被认为是用于实现逻辑功能的可执行指令的定序列表，可以具体实现在任何计算机可读介质中，以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用，或结合这些指令执行系统、装置或设备而使用。就本说明书而言，“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。The logic and/or steps represented in flowcharts or otherwise described herein, for example, may be considered an ordered listing of executable instructions for implementing the logical functions, may be embodied in any computer-readable medium, For use with, or in conjunction with, an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from and execute instructions from an instruction execution system, apparatus, or apparatus) or equipment. For the purposes of this specification, a "computer-readable medium" can be any device that can contain, store, communicate, propagate, or transport the program for use by or in connection with an instruction execution system, apparatus, or apparatus.

计算机可读介质的更具体的示例(非穷尽性列表)包括以下：具有一个或多个布线的电连接部(电子装置)，便携式计算机盘盒(磁装置)，随机存取存储器(RAM)，只读存储器(ROM)，可擦除可编辑只读存储器(EPROM或闪速存储器)，光纤装置，以及便携式光盘只读存储器(CDROM)。另外，计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质，因为可以例如通过对纸或其他介质进行光学扫描，接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序，然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wiring (electronic devices), portable computer disk cartridges (magnetic devices), random access memory (RAM), Read Only Memory (ROM), Erasable Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, followed by editing, interpretation, or other suitable medium as necessary process to obtain the program electronically and then store it in computer memory.

应当理解，本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中，多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如，如果用硬件来实现，和在另一实施方式中一样，可用本领域公知的下列技术中的任一项或他们的组合来实现：具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路，具有合适的组合逻辑门电路的专用集成电路，可编程门阵列(PGA)，现场可编程门阵列(FPGA)等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware or a combination thereof. In the above-described embodiments, various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following techniques known in the art: Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, Programmable Gate Arrays (PGA), Field Programmable Gate Arrays (FPGA), etc.

在本说明书的描述中，参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不一定指的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structure, material or feature is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

尽管已经示出和描述了本发明的实施例，本领域的普通技术人员可以理解：在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型，本发明的范围由权利要求及其等同物限定。Although embodiments of the present invention have been shown and described, it will be understood by those of ordinary skill in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, The scope of the invention is defined by the claims and their equivalents.

以上是对本发明的较佳实施进行了具体说明，但本发明并不限于所述实施例，熟悉本领域的技术人员在不违背本发明精神的前提下还可做出种种的等同变形或替换，这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the present invention is not limited to the described embodiments, those skilled in the art can also make various equivalent deformations or replacements under the premise of not violating the spirit of the present invention, These equivalent modifications or substitutions are all included within the scope defined by the claims of the present application.

Claims

1. An unsupervised vehicle re-identification method based on hierarchical matching, characterized in that it comprises:

Obtain the image of the vehicle to be detected from the video source;

Extracting overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the image of the vehicle to be detected;

Perform a first process on the overall characteristics of the vehicle to obtain the overall similarity of the vehicle;

performing a second process on the vehicle local component information to obtain a vehicle local feature distance value;

performing a third process on the vehicle body color information and the vehicle model information to obtain a color feature vector and a vehicle model feature vector;

Performing feature fusion on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain the vehicle similarity;

According to the vehicle similarity, a target vehicle corresponding to the vehicle to be detected is determined from all vehicles in the candidate vehicle set.

2. The unsupervised vehicle re-identification method based on hierarchical matching according to claim 1, wherein the obtaining a picture of a vehicle to be detected from a video source comprises:

Acquiring image frames from the video source according to preset time intervals;

Perform target detection on the extracted image frame to find the vehicle target in the image frame;

A vehicle set is obtained by intercepting an image frame with a vehicle target, and the vehicle set is divided into a search picture set and a candidate picture set.

3. The unsupervised vehicle re-identification method based on hierarchical matching according to claim 2, wherein the step of obtaining a picture of the vehicle to be detected from the video source further comprises:

Calculate the intersection ratio between the detection frame of any current image frame and the detection frame of the corresponding previous image frame;

When the intersection ratio is greater than a preset threshold, remove the detection result corresponding to the current image frame;

Wherein, the calculation formula of the intersection ratio is:

Among them, IoU represents the intersection ratio; R ₁ is the detection frame of the previous image frame; R ₂ is the detection frame of the current image frame; area(R ₁ )∩area(R ₂ ) finds the area intersection between the two detection frames ; area(R ₁ )∪area(R ₂ ) finds the area union of the two detection boxes.

4 . The method for unsupervised vehicle re-identification based on hierarchical matching according to claim 1 , wherein the extraction of overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the picture of the vehicle to be detected is performed. 5 . ,include:

The vehicle is divided into overall vehicle features and vehicle local component information through the ResNet50 feature extractor;

Through global average pooling, the image of the vehicle to be detected is processed to obtain the overall characteristics of the vehicle;

Output the overall characteristics of the vehicle through the CNN network;

Compare the features of each view angle of the vehicle to obtain the view angle visibility score under each view angle of the vehicle;

After calculating the weight of the visual angle visibility score, the local feature distance value is obtained by combining with the Euclidean distance calculation method, and the local component information of the vehicle is determined;

The color feature vector and the model feature vector of all vehicles are obtained by feature extraction through the Resnet50 network, and the vehicle body color information and the model information are determined.

5 . The method for unsupervised vehicle re-identification based on hierarchical matching according to claim 4 , wherein the extraction of overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the picture of the vehicle to be detected is performed. 6 . ,Also includes:

Divide the vehicles in the picture of the vehicle to be detected into four directions: front, top, back and side;

Calculate the area value of the four directions of the vehicle as the view angle visibility score, and use the view angle visibility score as the confidence value of the local features corresponding to the vehicle, and the confidence value is the confidence value of the four local features of the vehicle, using It is used to characterize the weight of the feature distance of each component between the vehicles in the subsequent calculation of the local feature distance.

6. The unsupervised vehicle re-identification method based on hierarchical matching according to claim 4, wherein,

The calculation formula for calculating the weight of the viewing angle visibility score is:

in,

a weight representing the visibility score of said view;

represents the visibility score of the vehicle to be detected;

The calculation formula of the local feature distance value is:

Among them, D ^m,n represents the local feature distance value; D represents the Euclidean distance of the vehicle in all directions; f _i ^m represents the feature information of the vehicle to be detected; f _i ⁿ represents the feature information of the corresponding vehicle in the image library.

7 . The method for unsupervised vehicle re-identification based on hierarchical matching according to claim 1 , wherein the overall similarity of the vehicle, the distance value of the local feature of the vehicle, the color feature vector and the The vehicle model feature vector is used for feature fusion to obtain vehicle similarity, including:

L2 norm normalization processing is performed on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain normalized features;

According to the normalized features, calculate the initial re-identification distance value between the vehicle to be detected and the corresponding vehicle in the image library;

The initial re-recognition distance value of the overall similarity of the vehicle, the initial re-recognition distance value of the vehicle local feature distance value, the initial re-recognition distance value of the color feature vector and the initial re-recognition distance of the vehicle type feature vector The value is weighted and summed to obtain the total weight identification distance value;

The vehicle similarity is determined according to the total weight identification distance value.

8. An unsupervised vehicle re-identification device based on hierarchical matching, characterized in that it comprises:

The first module is used to obtain the picture of the vehicle to be detected from the video source;

The second module is used for extracting overall vehicle features, vehicle local component information, vehicle body color information and vehicle type information from the picture of the vehicle to be detected;

The third module is used to perform the first processing on the overall characteristics of the vehicle to obtain the overall similarity of the vehicle;

a fourth module, configured to perform second processing on the vehicle local component information to obtain a vehicle local characteristic distance value;

a fifth module, configured to perform third processing on the vehicle body color information and the vehicle model information to obtain a color feature vector and a vehicle model feature vector;

A sixth module, configured to perform feature fusion on the overall similarity of the vehicle, the local feature distance value of the vehicle, the color feature vector and the vehicle type feature vector to obtain the vehicle similarity;

The seventh module is configured to determine, according to the vehicle similarity, a target vehicle corresponding to the vehicle to be detected from all vehicles in the candidate vehicle set.

9. An electronic device, comprising a processor and a memory;

the memory is used to store programs;

The processor executes the program to implement the method according to any one of claims 1 to 7 .

10. A computer-readable storage medium, wherein the storage medium stores a program, and the program is executed by a processor to implement the method according to any one of claims 1 to 7.