[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN112069940B - Cross-domain pedestrian re-identification method based on staged feature learning - Google Patents

Cross-domain pedestrian re-identification method based on staged feature learning Download PDF

Info

Publication number
CN112069940B
CN112069940B CN202010854297.3A CN202010854297A CN112069940B CN 112069940 B CN112069940 B CN 112069940B CN 202010854297 A CN202010854297 A CN 202010854297A CN 112069940 B CN112069940 B CN 112069940B
Authority
CN
China
Prior art keywords
pedestrian
sample
label
target domain
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN202010854297.3A
Other languages
Chinese (zh)
Other versions
CN112069940A (en
Inventor
种衍文
彭程威
潘少明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN202010854297.3A priority Critical patent/CN112069940B/en
Publication of CN112069940A publication Critical patent/CN112069940A/en
Application granted granted Critical
Publication of CN112069940B publication Critical patent/CN112069940B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects
    • G06V20/53Recognition of crowd images, e.g. recognition of crowd congestion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention belongs to the technical field of pedestrian re-recognition and discloses a cross-domain pedestrian re-recognition method based on staged feature learning, which comprises the steps of obtaining a source domain sample with a tag based on a source domain data set with the tag, and inputting the source domain sample with the tag into a pedestrian general similarity model for training to obtain an initial model; obtaining a label-free target domain sample based on the label-free target domain data set, and inputting the label-free target domain sample into the initial model for feature extraction to obtain pedestrian features; calculating to obtain a pseudo label of the target domain sample by adopting a mutual neighbor pseudo label distribution method according to the characteristics of the pedestrian; forming a target domain sample with a label based on the pseudo label of the target domain sample, and inputting the target domain sample into a pedestrian general similarity model for training to obtain a deployment model; and inputting the sample to be queried into the deployment model for feature extraction to obtain the pedestrian feature to be queried and matching and outputting a retrieval result. The invention solves the problem of low accuracy of cross-domain pedestrian re-identification in the prior art, and can obtain the cross-domain performance with high accuracy.

Description

Cross-domain pedestrian re-identification method based on staged feature learning
Technical Field
The invention relates to the technical field of pedestrian re-identification, in particular to a cross-domain pedestrian re-identification method based on staged feature learning.
Background
The pedestrian Re-identification technology (Re-ID) is an important research direction in the field of computer vision and pattern recognition, and the goal thereof is to automatically retrieve a specific pedestrian from an image or video sequence with disjoint view angles, which has become a hot research problem in recent years due to the large-scale application of monitoring networks and the enhancement of public safety awareness.
The early pedestrian re-identification method belongs to a supervised method, and the performance of the model can be ensured only by carrying out supervised training on a large amount of label data acquired from an application scene; and when the trained model is deployed in other application scenes, the accuracy of the model is greatly reduced due to data distribution difference. Therefore, such methods require sufficient amounts of labeled data to be collected for each deployment scenario to be trained, which greatly increases deployment costs.
With the interest of researchers and increasing real-world demands, cross-domain pedestrian re-identification has been proposed. Under the cross-domain setting, data from two domains is needed: one is source domain data, i.e., existing annotated data, which may be from an open academic dataset, with low acquisition costs, and the other is target domain data, i.e., non-annotated data from an application scenario acquisition. Different from the previous supervision method, the target domain data does not need to be subjected to complicated labeling work, so that the deployment cost is greatly reduced.
At present, cross-domain pedestrian re-recognition is mainly divided into two types of methods, the first type is a method based on style migration, the data distribution difference between two domains is reduced by using a generative countermeasure network, and a pedestrian re-recognition model suitable for a target domain is trained by migrating source domain data to the style of the target domain; the second type is a method based on pseudo label training, the method firstly uses source domain data training to obtain an initial model, then uses the initial model to calculate characteristics of target domain label-free samples and cluster and endow pseudo labels, and finally uses the pseudo labels to train and fine tune the initial model.
The two methods have some problems respectively, the quality of style migration in the first method is difficult to control, and misleading samples are generated to cause error of supervision information; the second method is that the initial model is not specially designed, so that the performance of the target domain is low, and then the pseudo label obtained by clustering calculation is unreliable. The same result they do is that cross-domain pedestrian re-identification accuracy is not high.
Disclosure of Invention
The embodiment of the application solves the problem of low cross-domain pedestrian re-identification accuracy in the prior art by providing the cross-domain pedestrian re-identification method based on staged feature learning.
The embodiment of the application provides a cross-domain pedestrian re-identification method based on staged feature learning, which comprises the following steps of:
the method comprises the following steps that 1, a source domain sample with a label is obtained based on a source domain data set with the label, the source domain sample with the label is input into a pedestrian general similarity model for training, the pedestrian general similarity model which is trained by using source domain data is obtained, and the pedestrian general similarity model which is trained by using the source domain data is used as an initial model;
step 2, obtaining a label-free target domain sample based on a label-free target domain data set, inputting the label-free target domain sample into the initial model for feature extraction, and obtaining pedestrian features formed by splicing pedestrian local features and pedestrian global features; calculating to obtain a pseudo label of the target domain sample by adopting a mutual neighbor pseudo label distribution method according to the pedestrian characteristics;
step 3, forming a labeled target domain sample based on the pseudo label of the target domain sample, inputting the labeled target domain sample into the pedestrian general similarity model for training to obtain a pedestrian general similarity model trained by using target domain data, wherein the pedestrian general similarity model trained by using the target domain data is used as a deployment model;
and 4, inputting the sample to be queried into the deployment model for feature extraction to obtain the pedestrian feature to be queried, comparing the features of the pedestrian to be queried in a pedestrian database, and matching and outputting a retrieval result.
Preferably, the pedestrian general similarity model selects ResNet50 as a component of the style-independent backbone network, selects a part before layer4 to construct the style-independent backbone network, and selects layer4 as a residual block, and normalizes the examples and adds the examples into the residual block of the style-independent backbone network.
Preferably, the pedestrian general similarity model comprises a style-independent backbone network, a first branch and a second branch, wherein the first branch and the second branch are respectively connected with the output end of the style-independent backbone network;
the first branch comprises an attention model, a residual block, a global maximum pooling layer and a bottleneck layer which are connected in sequence;
the second branch comprises a residual block, a global maximum pooling layer, a background elimination module and a bottleneck layer; the output end of the residual block is respectively connected with the input ends of the global maximum pooling layer and the background elimination module, the output end of the background elimination module is connected with the input end of the global maximum pooling layer, and the output end of the global maximum pooling layer is connected with the input end of the bottleneck layer;
the first branch is used for obtaining the local features of the pedestrians, and the second branch is used for obtaining the global features of the pedestrians.
Preferably, the attention model is constructed by a spatial transformation network, the bottleneck layer is composed of a full connection layer and batch normalization, and the background elimination module adopts an SCDA (sparse code division multiple access) in the field of fine-grained image retrieval to realize background elimination.
Preferably, the pedestrian local feature and the pedestrian global feature are both subjected to the constraint of triple loss, and both adopt cross entropy loss as an optimization function.
Preferably, the mutual neighbor pseudo label allocation method is implemented as follows:
calculating each sample according to the Euclidean distance between the characteristics of the samples to obtain k neighbors, if the two samples a and b are the k neighbors of the other side, the two samples a and b meet the mutual neighbor relation, and the two samples a and b meeting the strong constraint relation of the mutual neighbor relation belong to the same pedestrian;
and propagating the mutual neighbor relation, and when the two samples a and c meet the mutual neighbor relation and the two samples a and b meet the mutual neighbor relation, determining that the three samples a, b and c belong to the same individual.
Preferably, in the step 1, before inputting the labeled source domain sample into a pedestrian general similarity model for training, preprocessing the labeled source domain sample; the preprocessing comprises the steps of scaling the size of a sample image, converting the sample image into a tensor type specified by a pytorech framework, and normalizing;
in the step 2, the unlabeled target domain sample is preprocessed before being input into the initial model for feature extraction; the preprocessing comprises the steps of scaling the size of the sample image, converting the sample image into a tensor type specified by the pytorech framework, and normalizing.
One or more technical solutions provided in the embodiments of the present application have at least the following technical effects or advantages:
in the embodiment of the application, aiming at the problem that the pedestrian features hardly meet the regional differentiation and the inter-domain generalization simultaneously, the method provides the steps of dividing the cross-domain pedestrian re-identification into two stages of the region invariant feature learning and the region specific feature learning, and designs each stage, so that the high-accuracy cross-domain performance is realized. In addition, aiming at the problem of low reliability of the target domain pseudo label, the invention designs a mutual neighbor-based pseudo label distribution method, and effectively solves the problem that the conventional clustering algorithm is difficult to effectively cluster the same pedestrian with huge appearance difference, so that the high-quality pseudo label is provided to meet the requirement of realizing domain specific feature learning on the label-free sample of the target domain.
Drawings
FIG. 1 is an overall frame diagram of a cross-domain pedestrian re-identification method based on staged feature learning according to the present invention;
FIG. 2 is a schematic diagram of an improved residual block of the present invention;
FIG. 3 is an illustration of parameters in an attention model of the present invention;
FIG. 4 is an overall flowchart of a cross-domain pedestrian re-identification method based on staged feature learning according to the present invention;
fig. 5 is a cross-domain pedestrian re-recognition result obtained by the cross-domain pedestrian re-recognition method based on staged feature learning provided by the invention.
Detailed Description
In order to better understand the technical solution, the technical solution will be described in detail with reference to the drawings and the specific embodiments.
The invention provides a cross-domain pedestrian re-identification method based on staged feature learning, which mainly comprises the following steps of:
the method comprises the following steps of 1, obtaining a source domain sample with a label based on a source domain data set with the label, inputting the source domain sample with the label into a pedestrian general similarity model for training, obtaining the pedestrian general similarity model trained by using the source domain data, and taking the pedestrian general similarity model trained by using the source domain data as an initial model.
Step 2, obtaining a label-free target domain sample based on a label-free target domain data set, inputting the label-free target domain sample into the initial model for feature extraction, and obtaining pedestrian features formed by splicing pedestrian local features and pedestrian global features; and calculating the pseudo label of the target domain sample by adopting a mutual neighbor pseudo label distribution method according to the pedestrian characteristics.
And 3, forming a labeled target domain sample based on the pseudo label of the target domain sample, inputting the labeled target domain sample into the pedestrian general similarity model for training to obtain the pedestrian general similarity model trained by using the target domain data, and taking the pedestrian general similarity model trained by using the target domain data as a deployment model.
And 4, inputting the sample to be queried into the deployment model for feature extraction to obtain the pedestrian feature to be queried, comparing the features of the pedestrian to be queried in a pedestrian database, and matching and outputting a retrieval result.
The pedestrian general similarity model selects ResNet50 as a component of the style-independent backbone network, selects a part before layer4 to construct the style-independent backbone network, uses layer4 as a residual block, and normalizes the examples and adds the examples into the residual block of the style-independent backbone network.
Referring to fig. 1, the pedestrian general similarity model includes a style-independent backbone network, a first branch, and a second branch, where the first branch and the second branch are respectively connected to an output end of the style-independent backbone network; the first branch comprises an attention model, a residual block, a global maximum pooling layer and a bottleneck layer which are connected in sequence; the second branch comprises a residual block, a global maximum pooling layer, a background elimination module and a bottleneck layer; the output end of the residual block is respectively connected with the input ends of the global maximum pooling layer and the background elimination module, the output end of the background elimination module is connected with the input end of the global maximum pooling layer, and the output end of the global maximum pooling layer is connected with the input end of the bottleneck layer; the first branch is used for obtaining the local features of the pedestrians, and the second branch is used for obtaining the global features of the pedestrians.
The attention model is constructed by a space transformation network, the bottleneck layer is composed of a full connection layer and batch normalization, and the background elimination module adopts SCDA in the field of fine-grained image retrieval to realize background elimination.
And the pedestrian local feature and the pedestrian global feature are both subjected to the constraint of triple loss, and cross entropy loss is adopted as an optimization function.
The mutual neighbor pseudo label allocation method is specifically implemented as follows: calculating each sample according to the Euclidean distance between the characteristics of the samples to obtain k neighbors, if the two samples a and b are the k neighbors of the other side, the two samples a and b meet the mutual neighbor relation, and the two samples a and b meeting the strong constraint relation of the mutual neighbor relation belong to the same pedestrian; and propagating the mutual-neighbor relation, and if two samples a and c satisfy the mutual-neighbor relation and two samples a and b satisfy the mutual-neighbor relation, determining that the three samples a, b and c belong to the same individual.
The present invention is further described below.
The embodiment provides a cross-domain pedestrian re-identification method based on staged feature learning, wherein a pyrrch is used as a convolutional neural network building frame, and a staged learning process of a domain invariant feature and a domain specific feature is built, so that efficient automatic retrieval of a target pedestrian is realized. Specifically, in consideration that the pedestrian features hardly satisfy inter-domain generalization and intra-domain discrimination at the same time, the cross-domain pedestrian feature learning is divided into two stages, namely, domain invariant feature learning and domain specific feature learning. In the first stage of domain invariant feature learning, the pedestrian features with strong universality are learned from source domain data through network structure design, and the model is used as an initial model. In the second stage of domain specific feature learning, the method utilizes the naturally existing similarity among the samples of the target domain, provides a novel pseudo label assignment method according to the general pedestrian feature extracted by the initial model, and then uses the pseudo label as supervision information to realize the supervision training on the samples of the target domain and learn the domain specific feature from the target domain.
The method mainly comprises two parts, namely a pedestrian general similarity model designed for domain invariant feature learning and a mutual neighbor pseudo label distribution method designed for domain specific feature learning.
In order to ignore the characteristics of the data set and learn the pedestrian characteristics with strong universality, the pedestrian general similarity model designed by the scheme is shown as the attached drawing 1. In the present solution, two main factors that are concerned with influencing the generalization of the model are the style difference of the camera and the background information.
In the domain invariant feature learning, the pedestrian features which are irrelevant to the style are extracted from a source domain sample through a style-irrelevant backbone network, so that the pedestrian features caused by the style difference of cameras are avoided being different. Classical ResNet50 was chosen as a component of the style independent backbone, specifically, the part before layer4 was chosen to build the style independent backbone, while layer4 was used as the residual block shown in FIG. 1. ResNet50 is not specifically designed for style issues, and the present invention adds Instance Normalization to the Residual Block of a style-independent backbone, as shown in FIG. 2. The Instance Normalization eliminates the intra-sample commonality by carrying out statistics on each sample independently, and endows the network with the capability of extracting style-independent features to a certain extent.
After the style-independent features are extracted, the network is divided into an upper branch and a lower branch, in the upper branch, the network automatically finds the pedestrian region with the most distinguishing force through an attention model, the attention model is constructed by STN (Spatial transform Networks), and specific parameters are shown in figure 3. And then obtaining the local features of the pedestrians through the residual blocks and the global maximum pooling and bottleneck layers, wherein the local features represent the features which have the most distinguishing capability for identifying the pedestrians. The bottleneck layer consists of a 1024-dimensional fully connected layer and batch normalization. The lower branch does not go through the attention model to learn the global features of the pedestrian. In addition, the following branches are different in training and reasoning, and the global features of the pedestrians can be obtained through background elimination in reasoning. Background elimination is done using classical working scda (selective relational Descriptor aggregation) in the field of fine-grained search. The basic principle is that the average response of the features extracted by the trained model on the channels has higher intensity in the main body area, and the error response of some channels on the background area can be filtered by taking the average response intensity as a threshold value, so that the influence of background information is ignored. On one hand, the local features and the global features are directly restricted by triple loss, on the other hand, prediction categories are output through a classifier and cross entropy loss is calculated through the labels, and in the domain invariant feature learning at the stage, real labels from a source domain are used as labels needed for supervision.
In the domain specific feature learning, firstly, a pseudo label is obtained by calculating an unlabeled sample in a target domain, and then the model is retrained according to the above process by using the pseudo label as supervision information. In order to obtain the pseudo label, the pedestrian general similarity model is used for reasoning on the unlabeled sample of the target domain to respectively obtain the local characteristic and the global characteristic of the pedestrian, and the local characteristic and the global characteristic of the pedestrian are directly connected in series to form the final pedestrian characteristic. After extracting features of all target domain samples, calculating the pseudo labels by using the mutual neighbor pseudo label distribution method provided by the scheme.
The specific operation of the mutual neighbor pseudo label allocation method is as follows: firstly, k neighbor is obtained by calculating Euclidean distance between characteristics of each sample, and if two samples of a and b are the k neighbor of the other side, the a and b satisfy the mutual neighbor relation. The samples of a and b which satisfy the strong constraint relationship of mutual neighbor belong to the same pedestrian, and the mutual neighbor relationship is propagated, namely when a and c satisfy the mutual neighbor and a and b satisfy the mutual neighbor, the a, b and c are considered to belong to the same individual.
The method has the advantages that the visual difference caused by different angles of the pedestrians is considered, if a is a front photograph, b is a side photograph and c is a back photograph of a certain pedestrian, the conventional clustering algorithm is difficult to judge that a and c with large appearance difference belong to the same person, but the front photograph a and the back photograph c respectively have certain similarity with the side photograph b. This pseudo label is then used to supervise model training.
Through the above domain invariant feature learning and the domain specific feature learning, a final cross-domain pedestrian re-identification model is obtained. Deploying the model to a target domain scene, receiving pedestrian picture input, executing inference to obtain pedestrian features, then performing Euclidean distance comparison between the features in a pedestrian database, and outputting a retrieval result according to the ascending order of the distances, as shown in figure 5.
The cross-domain pedestrian re-identification process is further described below.
With 384 × 128 × 3 images as training and testing images, referring to fig. 1 and 4, the main steps include:
(1) domain invariant feature learning
The image size of the source domain data set (e.g., Market1501) is first scaled to 384 × 128 × 3, then converted to the tensor type specified by the pytorech framework, and normalized. The batch size used for training is 32, the epoch is 64, input samples pass through a training process in the pedestrian general similarity model in the attached figure 1 to obtain global features and local features, the global features and the local features calculate triple losses and cross entropy losses according to real labels provided by source domain data, then a random gradient descent method is used for model parameter optimization, domain invariant feature learning is completed, and a model trained by the source domain data is obtained.
(2) Pseudo label assignment
An image of the target domain data (e.g., DukeMTMC) is subjected to the same data preprocessing as in step (1). Then inputting each sample into a pedestrian general similarity model which is trained by source domain data in the attached figure 1, executing a test process to obtain pedestrian characteristics formed by splicing local characteristics and global characteristics, and then calculating to obtain a pseudo label of a target domain sample according to a mutual neighbor pseudo label distribution method provided by the invention.
(3) Domain specific feature learning
And (3) using the pseudo label obtained in the step (2) as supervision information, inputting a target domain sample, training the target domain sample according to the process in the step (1), completing the learning of domain specific characteristics, and obtaining a model trained by using target domain data, wherein the model is used as a deployment model.
(4) Pedestrian retrieval
When the query occurs, preprocessing the query sample by the same data as the data in the step (1), inputting the preprocessed query sample into the deployment model to perform reasoning to obtain the pedestrian characteristics, comparing the characteristics in a pedestrian database by using the pedestrian characteristics, and outputting a retrieval result in an ascending Euclidean distance manner.
In order to verify the performance of the invention, the cross-domain performance of the scheme is respectively tested on the Market1501 and DukeMTMC-reiD data sets, and the results are as follows:
(1) market1501 data set: rank-1 was 82.2 and mAP was 58.6.
(2) DukeMTMC-reiD dataset: rank-1 was 75.7 and mAP was 58.0.
The above data are superior to the cross-domain performance of existing schemes.
The cross-domain pedestrian re-identification method based on staged feature learning provided by the embodiment of the invention at least comprises the following technical effects:
(1) aiming at the problem that the pedestrian characteristics are difficult to simultaneously meet the regional component and inter-domain generalization in the region, the invention provides the method for identifying the pedestrian across the region into two stages of learning the region-invariant characteristics and learning the region-specific characteristics, and each stage is specially designed, so that the cross-region performance with high accuracy is realized.
(2) Aiming at the problem that the pedestrian characteristics are greatly influenced by the difference between the camera style and the background, the style-independent backbone network is constructed by introducing Instance Normalization and the SCDA is introduced to eliminate the picture background so as to learn the pedestrian characteristics with strong domain invariance in a source domain sample; meanwhile, an attention mechanism is introduced, and a network architecture combining global features and local features is constructed to enhance feature differentiation force.
(3) Aiming at the problem of low reliability of the target domain pseudo label, the invention designs a mutual neighbor-based pseudo label distribution method, which effectively solves the problem that the existing clustering algorithm is difficult to effectively cluster the same pedestrian with huge appearance difference, thereby providing a high-quality pseudo label to meet the requirement of realizing domain specific feature learning on the label-free sample of the target domain.
Finally, it should be noted that the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, and although the present invention is described in detail with reference to examples, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made to the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention, which should be covered by the claims of the present invention.

Claims (5)

1. A cross-domain pedestrian re-identification method based on staged feature learning is characterized by comprising the following steps of:
the method comprises the following steps that 1, a source domain sample with a label is obtained based on a source domain data set with the label, the source domain sample with the label is input into a pedestrian general similarity model for training, the pedestrian general similarity model which is trained by using source domain data is obtained, and the pedestrian general similarity model which is trained by using the source domain data is used as an initial model;
step 2, obtaining a label-free target domain sample based on a label-free target domain data set, inputting the label-free target domain sample into the initial model for feature extraction, and obtaining a pedestrian feature formed by splicing a pedestrian local feature and a pedestrian global feature; calculating to obtain a pseudo label of the target domain sample by adopting a mutual neighbor pseudo label distribution method according to the pedestrian characteristics;
step 3, forming a labeled target domain sample based on the pseudo label of the target domain sample, inputting the labeled target domain sample into the pedestrian general similarity model for training to obtain a pedestrian general similarity model trained by using target domain data, wherein the pedestrian general similarity model trained by using the target domain data is used as a deployment model;
step 4, inputting a sample to be queried into the deployment model for feature extraction to obtain pedestrian features to be queried, performing feature comparison in a pedestrian database through the pedestrian features to be queried, and matching and outputting a retrieval result;
the pedestrian general similarity model comprises a style-independent backbone network, a first branch and a second branch, wherein the first branch and the second branch are respectively connected with the output end of the style-independent backbone network; the first branch comprises an attention model, a residual block, a global maximum pooling layer and a bottleneck layer which are connected in sequence; the second branch comprises a residual block, a global maximum pooling layer, a background elimination module and a bottleneck layer; the output end of the residual block is respectively connected with the input ends of the global maximum pooling layer and the background elimination module, the output end of the background elimination module is connected with the input end of the global maximum pooling layer, and the output end of the global maximum pooling layer is connected with the input end of the bottleneck layer; the first branch is used for obtaining the local features of the pedestrians, and the second branch is used for obtaining the global features of the pedestrians;
the mutual neighbor pseudo label allocation method is specifically implemented as follows: calculating the Euclidean distance between the characteristics of each sample to obtain k neighbor, if the two samples a and b are the k neighbor of the other side, the a and b satisfy the mutual neighbor relation, and the two samples a and b satisfying the strong constraint relation of the mutual neighbor relation belong to the same pedestrian; and propagating the mutual neighbor relation, and when the two samples a and c meet the mutual neighbor relation and the two samples a and b meet the mutual neighbor relation, determining that the three samples a, b and c belong to the same individual.
2. The method for cross-domain pedestrian re-identification based on staged feature learning according to claim 1, wherein the pedestrian general similarity model selects ResNet50 as a component of a style-independent backbone network, selects a part before layer4 to construct the style-independent backbone network, and selects layer4 as a residual block, and normalizes instances to be added into the residual block of the style-independent backbone network.
3. The method according to claim 1, wherein the attention model is constructed by a space transformation network, the bottleneck layer is composed of a full connection layer and batch normalization, and the background elimination module adopts SCDA in the fine-grained image retrieval field to realize background elimination.
4. The method for cross-domain pedestrian re-identification based on staged feature learning according to claim 1, wherein the pedestrian local features and the pedestrian global features are both subject to triple loss constraint and both adopt cross entropy loss as an optimization function.
5. The method for cross-domain pedestrian re-recognition based on staged feature learning according to claim 1, wherein in the step 1, the labeled source domain samples are preprocessed before being input into a pedestrian general similarity model for training; the preprocessing comprises the steps of scaling the size of a sample image, converting the sample image into a tensor type specified by a pytorech framework, and normalizing;
in the step 2, before inputting the unlabeled target domain sample into the initial model for feature extraction, preprocessing the unlabeled target domain sample; the preprocessing comprises the steps of scaling the size of the sample image, converting the sample image into a tensor type specified by the pytorech framework, and normalizing.
CN202010854297.3A 2020-08-24 2020-08-24 Cross-domain pedestrian re-identification method based on staged feature learning Expired - Fee Related CN112069940B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010854297.3A CN112069940B (en) 2020-08-24 2020-08-24 Cross-domain pedestrian re-identification method based on staged feature learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010854297.3A CN112069940B (en) 2020-08-24 2020-08-24 Cross-domain pedestrian re-identification method based on staged feature learning

Publications (2)

Publication Number Publication Date
CN112069940A CN112069940A (en) 2020-12-11
CN112069940B true CN112069940B (en) 2022-09-13

Family

ID=73659218

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010854297.3A Expired - Fee Related CN112069940B (en) 2020-08-24 2020-08-24 Cross-domain pedestrian re-identification method based on staged feature learning

Country Status (1)

Country Link
CN (1) CN112069940B (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112906704A (en) * 2021-03-09 2021-06-04 深圳海翼智新科技有限公司 Method and apparatus for cross-domain target detection
CN113095221B (en) * 2021-04-13 2022-10-18 电子科技大学 Cross-domain pedestrian re-identification method based on attribute feature and identity feature fusion
CN113221656B (en) * 2021-04-13 2022-07-22 电子科技大学 Cross-domain pedestrian re-identification device and method based on domain invariant features
CN113158881B (en) * 2021-04-19 2022-06-14 电子科技大学 Cross-domain pedestrian re-identification method based on attention mechanism
CN113449676B (en) * 2021-07-13 2024-05-10 凌坤(南通)智能科技有限公司 Pedestrian re-identification method based on two-way interaction-based disentanglement learning
CN113627380B (en) * 2021-08-20 2024-03-15 中国矿业大学 Cross-vision pedestrian re-identification method and system for intelligent security and early warning
CN113627383A (en) * 2021-08-25 2021-11-09 中国矿业大学 Pedestrian loitering re-identification method for panoramic intelligent security
CN114092964A (en) * 2021-10-19 2022-02-25 杭州电子科技大学 Cross-domain pedestrian re-identification method based on attention guidance and multi-scale label generation
CN113902613B (en) * 2021-11-19 2024-08-06 江苏科技大学 Image style migration system and method based on three-branch clustering semantic segmentation
CN114973350B (en) * 2022-03-24 2024-02-27 西北工业大学 Cross-domain facial expression recognition method irrelevant to source domain data
CN116168348B (en) * 2023-04-21 2024-01-30 成都睿瞳科技有限责任公司 Security monitoring method, system and storage medium based on image processing
CN116258751A (en) * 2023-05-08 2023-06-13 中国人民解放军国防科技大学 Security patrol management method, system and equipment based on unsupervised learning
CN117274619B (en) * 2023-11-21 2024-02-09 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) Cross-domain target recognition method based on style fusion contrast learning

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2564668B (en) * 2017-07-18 2022-04-13 Vision Semantics Ltd Target re-identification
CN110942025A (en) * 2019-11-26 2020-03-31 河海大学 Unsupervised cross-domain pedestrian re-identification method based on clustering
CN111027442A (en) * 2019-12-03 2020-04-17 腾讯科技(深圳)有限公司 Model training method, recognition method, device and medium for pedestrian re-recognition
CN111476168B (en) * 2020-04-08 2022-06-21 山东师范大学 Cross-domain pedestrian re-identification method and system based on three stages

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Learning Discriminative Features with Multiple Granularities for Person Re-Identification;Guanshuo Wang et al.;《https://arxiv.org/abs/1804.01438》;20180831;全文 *
基于深度学习的行人再识别技术研究;余忠永;《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》;20200215;第2020年卷(第2期);第I138-1280页 *

Also Published As

Publication number Publication date
CN112069940A (en) 2020-12-11

Similar Documents

Publication Publication Date Title
CN112069940B (en) Cross-domain pedestrian re-identification method based on staged feature learning
CN110414368B (en) Unsupervised pedestrian re-identification method based on knowledge distillation
CN111259786B (en) Pedestrian re-identification method based on synchronous enhancement of appearance and motion information of video
Dong et al. Deep metric learning-based for multi-target few-shot pavement distress classification
CN111898736B (en) Efficient pedestrian re-identification method based on attribute perception
CN112784728B (en) Multi-granularity clothes changing pedestrian re-identification method based on clothing desensitization network
CN110633632A (en) Weak supervision combined target detection and semantic segmentation method based on loop guidance
CN111767927A (en) Lightweight license plate recognition method and system based on full convolution network
CN109063649B (en) Pedestrian re-identification method based on twin pedestrian alignment residual error network
CN112990282B (en) Classification method and device for fine-granularity small sample images
CN111797814A (en) Unsupervised cross-domain action recognition method based on channel fusion and classifier confrontation
CN111612051A (en) Weak supervision target detection method based on graph convolution neural network
CN108681696A (en) Merge the infrared target robustness recognition methods of multiple features dimensionality reduction and transfer learning
CN115063832B (en) Global and local feature-based anti-learning cross-modal pedestrian re-identification method
CN112819065A (en) Unsupervised pedestrian sample mining method and unsupervised pedestrian sample mining system based on multi-clustering information
Symeonidis et al. Neural attention-driven non-maximum suppression for person detection
CN112507778B (en) Loop detection method of improved bag-of-words model based on line characteristics
CN108416795B (en) Video action identification method based on sorting pooling fusion space characteristics
CN113947101A (en) Unsupervised pedestrian re-identification method and system based on softening similarity learning
CN113139501A (en) Pedestrian multi-attribute identification method combining local area detection and multi-level feature capture
CN113222072A (en) Lung X-ray image classification method based on K-means clustering and GAN
CN113158901A (en) Domain-adaptive pedestrian re-identification method
CN110110598A (en) The pedestrian of a kind of view-based access control model feature and space-time restriction recognition methods and system again
CN113705348A (en) Pedestrian re-identification method integrating multiple attention mechanisms
CN114529578A (en) Multi-target tracking method based on comparison learning mode

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20220913