CN113947704A

CN113947704A - Confrontation sample defense system and method based on attention ranking

Info

Publication number: CN113947704A
Application number: CN202111175218.7A
Authority: CN
Inventors: 王恒友; 李文; 吴佳薇; 宋艳飞
Original assignee: Beijing University of Civil Engineering and Architecture
Current assignee: Beijing University of Civil Engineering and Architecture
Priority date: 2021-10-09
Filing date: 2021-10-09
Publication date: 2022-01-18

Abstract

The invention provides an attention ranking-based confrontation sample defense system and method. The method comprises the following steps: s101, pre-training a classification model to obtain a confrontation sample; s102, preprocessing a challenge sample through a first module discrete cosine transform layer of a challenge sample defense model to eliminate partial attacks; s103, the feature map passes through an attention module to obtain attention weight; and S104, according to the attention weight, recovering the feature vector with the global features. And S105, training the network according to the loss function, and storing the model. The countermeasure sample defense method improves the problems that more hardware resources and longer time are needed in the training process of the model in the prior art by using a network modification mode, and the influence of key points is considered when the network is modified.

Description

Confrontation sample defense system and method based on attention ranking

Technical Field

The invention relates to the technical field of computer attack and defense, in particular to an anti-sample defense system and method based on attention ranking.

Background

In real life, images are often contaminated with noise. There are many sources of noise, such as channel instability during image transmission, system equipment imperfections, etc. Deep neural networks are vulnerable to countering noise, which threatens the security of deep learning applications by adding human-indistinguishable subtle perturbations to the inputs, creating antagonistic samples, thus spoofing the neural network to produce false outputs with high probability.

There are two main approaches to combat attacks to generate deep neural networks: in the face of these powerful attacks, both the sign-gradient-based approach and the optimization-based approach are crucial to defend against the attacks.

However, from the perspective of data preprocessing, the existing defense method needs more hardware resources and longer time consumption in the training process of the model, and also brings more time consumption to the testing link.

Disclosure of Invention

The invention aims to provide an attention ranking-based confrontation sample defense system and method, and the attention ranking-based confrontation sample defense system can solve the problems that in the prior art, more hardware resources and longer time are needed in the training process of a model, and the problem of key points is considered for the attack characteristics.

In order to achieve the above purpose, the invention provides the following technical scheme:

an attention ranking-based confrontation sample defense method, which specifically comprises the following steps:

s101, pre-training a classification model to obtain a confrontation sample;

s102, preprocessing a challenge sample through a first module discrete cosine transform layer of a challenge sample defense model to eliminate partial attacks;

s103, the feature map passes through an attention module to obtain attention weight;

and S104, according to the attention weight, recovering the feature vector with the global features.

And S105, training the network according to the loss function, and storing the model.

On the basis of the technical scheme, the invention can be further improved as follows:

further, the S101 specifically includes:

s1011, inputting original images of a training set, and obtaining weights through pre-training;

and S1012, generating a confrontation sample by using the weight.

Further, the S102 specifically includes:

s1021, after the confrontation sample passes through the first convolution layer, Discrete Cosine Transform (DCT) is used;

s1022, using the designed activation function eta (x) in the frequency domain to eliminate the partial attack, wherein eta (x) is as follows:

s1023, restoring to the space domain by using inverse discrete cosine transform to obtain a characteristic diagram for eliminating partial attacks.

Further, the S103 specifically includes:

s1031, respectively obtaining feature vector sets by sequentially passing the feature map through four residual blocks

S1032，L₄Obtaining a global vector g through average pooling, and simultaneously adding L_sS is the dimension of {1,2,3} and is mapped into the dimension n of g;

s1033, local feature l_s ⁱConnecting the global feature g in a mode of adding item by item, mapping the obtained feature into an attention weight C by learning an FC mapping, and calculating the attention weight C to define the attention weight C as a formula 2;

wherein,

indicating a post-lifting at each residual blockA set of feature vectors is taken, where s ∈ {1,2,3},

representing a set of feature vectors L_sThe ith of the n feature vectors; g represents the global feature vector output by the fully-connected layer in figure 7,

and the dimensions of g are both n.

Further, the S104 specifically includes:

s1041, converting the characteristic diagram L_sAnd predicted attention map C_sReshaped into a one-dimensional array, respectively consisting of { L_s(n) } and { C_s(n) }; wherein n is H × W, { L_s(n) is a feature vector of channel size C;

s1042, from the attention map { C_s(n) } selecting the first K values { C_s(k) Record their point locations from a profile L_s(n) } these positions of extracted feature points { L_s(k)}；

S1043, calculating a global vector g based on the key points through a formula 2_s；

g_s＝C_s(k)e L_s(k) Formula 3;

wherein e represents a key point { C_s(k) And a feature map L_s(k) Inner product between each channel of.

Further, the S105 specifically includes:

s1051, calculating loss according to the classification type, reversely propagating and updating model weight, and finely adjusting the model, wherein a cross entropy loss function is used as a formula 4 for a loss function;

and S1052, loading the test set to generate confrontation samples through the trained model, calculating the accuracy to judge and evaluate the performance of the model, and storing the optimal model weight.

An attention-ranking based confrontation sample defense system comprising:

the countermeasure sample defense system modifies the base model Resnet 18. The whole system is divided into three modules: the system comprises a discrete cosine transform activation module, an attention module and a key point sorting module. A Discrete Cosine Transform (DCT) active layer is introduced after the first layer convolutional layer to effectively suppress noise modes based on gradient attacks. And in the attention module and the key point sorting module, dynamically selecting key points on the feature map for classification based on an attention mechanism so as to reduce the influence of other attacked pixel points.

Further, the discrete cosine transform activation module is configured to:

after the challenge samples pass through the first convolutional layer, Discrete Cosine Transform (DCT) is used;

and eliminating partial attack by using a designed activation function eta (x) in a frequency domain, wherein eta (x) is as follows:

and restoring to the space domain again by using the inverse discrete cosine transform to obtain a characteristic diagram for eliminating partial attacks. Further, the attention module is to:

calculating attention weight by using L_sIs mapped as dimension n of g;

local features

Connecting the global feature g in a mode of adding item by item, mapping the obtained feature into an attention weight C by learning an FC mapping, and calculating the attention weight C to define the attention weight C as a formula 2;

wherein,

representing a set of feature vectors extracted after each residual block, where s e 1,2,3,

and the dimensions of g are both n; l will be calculated before the attention weight is calculated_sIs mapped to dimension n of g.

Further, the keypoint ranking module is further to:

will feature map L_sAnd predicted attention map C_sReshaped into a one-dimensional array, respectively consisting of { L_s(n) } and { C_s(n) }; wherein n is H × W, { L_s(n) is a feature vector of channel size C;

from the attention map { C_s(n) } selecting the first K values { C_s(k) Record their point locations from a profile L_s(n) } these positions of extracted feature points { L_s(k)}；

Calculating a global vector g based on the keypoints by equation 3_s；

g_s＝C_s(k)e L_s(k) Formula 3;

The invention has the following advantages:

according to the defense method for the confrontation sample based on attention sequencing, the discrete cosine transform active layer is introduced, partial confrontation attack is eliminated to a certain extent, and meanwhile, the back propagation of the gradient is effectively inhibited in a white-box attacker. And the residual part of the antagonistic noise is eliminated by the dynamic key point selection of the attention mechanism, and a correct classification result is obtained. The new Resnet18 network architecture provided by the invention has a good defense effect against attacks, solves the problems that more hardware resources and longer time consumption are needed in the training process of the existing defense model, and considers the influence of key points when modifying the network.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.

FIG. 1 is a flow chart of a defense method against a sample in an embodiment of the invention;

FIG. 2 is a flowchart illustrating an embodiment of S101;

FIG. 3 is a flowchart illustrating the detailed process of S102 according to an embodiment of the present invention;

FIG. 4 is a flowchart illustrating the detailed process of S103 according to an embodiment of the present invention;

FIG. 5 is a flowchart illustrating S104 according to an embodiment of the present invention;

FIG. 6 is a flowchart illustrating an embodiment of S105;

FIG. 7 is a block diagram of an attention-ranked defense network framework for confrontation samples in accordance with an embodiment of the invention.

Detailed Description

The technical solutions of the present invention will be described clearly and completely with reference to the following embodiments, and it should be understood that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As shown in fig. 1, the method for defending a confrontation sample based on attention ranking specifically comprises:

s101, obtaining a confrontation sample;

in the step, a classification model is pre-trained to obtain a confrontation sample;

s102, preprocessing a challenge sample;

in the step, the countermeasure sample is preprocessed through a first module discrete cosine transform layer of the countermeasure sample defense model to eliminate partial attacks;

s103, obtaining attention weight;

in the step, the feature map passes through an attention module to obtain attention weight;

s104, obtaining a feature vector with global features;

in this step, a feature vector having global features is retrieved according to the attention weight.

The discrete cosine transform active layer is introduced into the convolutional layer, partial counterattack is eliminated to a certain extent, and simultaneously the backward propagation of the gradient is effectively inhibited in a white-box attacker. And the residual part of the antagonistic noise is eliminated by the dynamic key point selection of the attention mechanism, and a correct classification result is obtained. The new Resnet18 network architecture provided by the invention has good effect of defending against attacks.

The invention refers to the following anti-attack: the method for destroying the recognition system by utilizing the deep learning disadvantage can be generally called as resisting attack, namely, the method is specially changed aiming at the recognition object, so that no abnormity can be seen by naked eyes of people, but the recognition model can be caused to be out of order.

The countermeasure sample mentioned in the present invention refers to: some perturbation imperceptible to the human eye is added to the original sample (such perturbation does not affect human recognition but easily fools the model), causing the machine to make an erroneous decision.

The confrontational defenses referred to herein refer to: a series of defense strategies are developed to defend against attacks.

The attention model referred to in the present invention refers to: the attention model in deep learning is similar to a selective visual attention mechanism of human beings in nature, and the core target is to select information which is more critical to the current task target from a plurality of information and inhibit other useless information, so that the efficiency and the accuracy of information processing are improved.

The Discrete Cosine Transform (DCT) mentioned in this invention refers to: the discrete cosine transform is a transform defined on a real signal, and a real signal is obtained in a frequency domain after the transform. DCT also has a very important property (energy concentration property): since most of natural signals (audio and video) have energy concentrated in a low-frequency portion after discrete cosine transform, DCT is widely used for (audio and video) data compression.

The ResNet18 mentioned in the present invention refers to: is a neural network structure in deep learning. Also called deep residual networks, can solve the degradation problem that occurs as the network depth increases. The network structure therein is called a residual block.

The invention provides an anti-sample defense method based on attention key point sequencing, which is used for defending attacks and improving the robustness of a deep neural network. The existing method adds an external model to preprocess an input image, and the external model is more and more complex for better defense and attack. The invention improves the robustness of the deep neural network by modifying the form of the network and selecting key points on the attention map, and introduces discrete cosine transform and an activation function in the network to effectively inhibit a noise mode based on gradient attack.

The present invention is motivated by the human eye classifying images and recognizing objects without analyzing the images pixel by pixel based on a grid structure. Instead, they can quickly detect discriminative keypoints (or object portions) from the image, extracting features from these keypoints, to infer decisions for image classification or object detection. Therefore, the invention adopts the method of extracting key points on the attention map, designs a new network architecture to support the dynamic key point selection of the attention mechanism, and improves the robustness of the deep learning network. Meanwhile, discrete cosine transform and an activation function are introduced into the network, so that a noise mode based on gradient attack is effectively inhibited.

as shown in fig. 2, further, the S101 specifically includes:

s1011, obtaining the weight;

in the step, an original image of a training set is input, and a weight is obtained through pre-training;

s1012, generating a confrontation sample;

in this step, the weights are used to generate confrontation samples for training the robustness network.

As shown in fig. 3, the S102 specifically includes:

As shown in fig. 4, the S103 specifically includes:

s1033, local feature

wherein,

and the dimensions of g are both n.

As shown in fig. 5, further, the S104 specifically includes:

g_s＝C_s(k)e L_s(k) Formula 3;

As shown in fig. 6, the S105 specifically includes: s1051, calculating loss according to the classification type, reversely propagating and updating model weight, and finely adjusting the model, wherein a cross entropy loss function is used as a formula 4 for a loss function;

The invention provides a novel model algorithm, and relates to an attention-ranking-based confrontation sample defense method, which is shown in figure 7. Our proposed confrontational defense network based on the ordering of key points in an attention-graph has two main ideas: (1) a Discrete Cosine Transform (DCT) active layer is introduced after the first layer convolutional layer to effectively suppress noise modes based on gradient attacks. The motivation for this design is that the reactive noise convolution process is gradually amplified and leads to erroneous classification, so called error amplification effect, so we do not directly perform DCT transform on the original image. (2) In order to eliminate the residual interference of the antagonistic noise, the key points are dynamically selected on the feature map for classification based on the attention mechanism, so that the influence of other attacked pixel points is reduced.

The whole process is as follows: image X^*First, it is preprocessed by convolutional layer and transferred to Discrete Cosine Transform (DCT) active layer, where most of the reactive noise is destroyed, and then the feature pattern X% is recovered by Inverse Discrete Cosine Transform (IDCT). X% will continue to enter the backbone network of Resnet18, as shown by the 4 residual layers, with the output of each layer being L₁,L₂,L₃,L₄。L₄After average pooling, an n-dimensional feature vector g is obtained, which has global information of the entire image. Next, we do not follow the network of Resnet18 to fully concatenate the global vector g, resulting in its classification category. Instead, a global vector g is used, respectively with L₁,L₂,L₃Fusing to obtain the discrimination weight c₁,c₂,c₃I.e. attention-deficit diagram. Then c₁,c₂,c₃And L₁,L₂,L₃Are respectively turned offDetecting and sorting the key points, and multiplying the key points correspondingly to obtain a new feature vector g corresponding to each layer with different scales₁,g₂,g₃，g₁,g₂,g₃And after splicing, carrying out full connection to obtain the classification category.

An attention-ranking based confrontation sample defense system comprising:

Further, the discrete cosine transform activation module is configured to:

and restoring to the space domain again by using the inverse discrete cosine transform to obtain a characteristic diagram for eliminating partial attacks.

Further, the attention module is to:

calculating attention weight by using L_sIs mapped as dimension n of g;

local features

wherein,

The keypoint ranking module is further to:

Calculating a global vector based on the key points through formula 3;

g_s＝C_s(k)e L_s(k) formula 3;

The Discrete Cosine Transform (DCT) active layer, the attention module and the key point sorting are designed and used for the invention.

1. Discrete Cosine Transform (DCT) active layer

In the spatial domain, the countersample and the original image are very similar visually, so it is very difficult to separate the counternoise from the original image, and conventional image denoising usually converts it into the frequency domain to solve it. Therefore, we choose to use a block-wise two-dimensional Discrete Cosine Transform (DCT) [34], where the two-dimensional Discrete Cosine Transform (DCT) is defined as follows:

after this transformation, the noise of the image is in its high frequency part in the discrete cosine transform result, and the amplitude of the high frequency part is generally small. Therefore, the following activation function η (x) is designed:

after the image transfer activation function eta (x) of DCT transformation, on one hand, the antagonistic noise is partially removed; on the other hand, it is not easily attacked by the existing white-box attack methods. This is because most existing white-box attacks are based on gradient backpropagation, where gradients cannot easily propagate through the Discrete Cosine Transform (DCT) active layer.

2. Attention module

Note the book

Representing the set of feature vectors extracted after each residual block, where s e {1,2,3 }.

Representing a set of feature vectors L_sThe ith of the n feature vectors, g, represents the global feature vector output by the fully connected layer in fig. 7. Local features

And the global feature g is connected in a way of adding item by item, the obtained feature is mapped into an attention weight C by learning an FC mapping (namely an FC layer with the weight of u), and then the attention weight C is calculated and defined as

Here, the

The dimensions of the sum g are both n, and since the number of channels corresponding to different s eigenvectors is different, L will be calculated before the attention weight is calculated_sIs mapped to dimension n, which is g.

Thus, attention weight is obtained through the attention module

Attention weights identify significant image regions and amplify their effect while suppressing irrelevant and potentially confusing information in other regions. Thus, this score plays a role in attention.

3. Key point ranking

And dynamically selecting the most discriminating characteristic points from the generated characteristic map. The selection process is dynamic and is influenced by the input conditions. The selected feature points may differ for different images.

Attention weight learned via network attention module

Representing the discriminative weight of each feature point. Specifically, the secondary size is [ W, H, C]Input feature map L of_sIn this way, it predicts a same space size [ W, H ] by the attention module]Attention diagram C_sIn which C is_s(i, j) represents a feature L_s(i, j) point of attention. From the feature map L_sIn (1), our goal is to select the C with the highest attention_sThe first K characteristic points of (i, j) are obtained again to obtain a new globalAnd (5) vector quantity.

This keypoint detection and ranking comprises the following main steps:

will feature map L_sAnd predicted attention map C_sReshaped into a one-dimensional array, respectively consisting of { L_s(n) } and { C_s(n) }. Here, n is H × W, { L_s(n) is a feature vector of channel size C.

From the attention map { C_s(n) } selecting the first K values { C_s(k) Record their point locations from a profile L_s(n) } these positions of extracted feature points { L_s(k)}。

Computing a keypoint-based global vector g_s。

g_s＝C_s(k)e L_s(k).

Wherein e represents a key point { C_s(k) And a feature map L_s(k) Inner products between each channel (reconstructed as a one-dimensional array).

Finally, we train the modified network using a cross entropy loss function. The loss of the original image and the challenge sample are calculated separately and then summed.

The invention designs a new network architecture to support the dynamic key point selection of the attention mechanism so as to improve the robustness of the deep learning network. Discrete cosine transform and activation function are introduced into the network to effectively suppress noise patterns based on gradient attack.

It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

One or more embodiments of the present description may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. One or more embodiments of the specification may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices.

The embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the system embodiment, since it is substantially similar to the method embodiment, the description is simple, and for the relevant points, reference may be made to the partial description of the method embodiment.

The above description is only an example of this document and is not intended to limit this document. Various modifications and changes may occur to those skilled in the art from this document. Any modifications, equivalents, improvements, etc. which come within the spirit and principle of the disclosure are intended to be included within the scope of the claims of this document.

Claims

1. An attention ranking-based confrontation sample defense method is characterized by specifically comprising the following steps:

s101, pre-training a classification model to obtain a confrontation sample;

s104, according to the attention weight, obtaining a feature vector with global features again;

2. The method for defending a confrontational sample based on attention ranking according to claim 1, wherein the S101 specifically comprises:

and S1012, generating a confrontation sample by using the weight.

3. The method for defending a confrontational sample based on attention ranking according to claim 1, wherein the S102 specifically comprises:

4. The method for defending a confrontational sample based on attention ranking according to claim 1, wherein the S103 specifically comprises:

s1033, local feature

wherein,

and the dimensions of g are both n.

5. The method for defending a confrontational sample based on attention ranking according to claim 1, wherein the S104 specifically comprises:

g_s＝C_s(k)eL_s(k) Formula 3;

6. The method for defending a confrontational sample based on attention ranking according to claim 1, wherein the S105 specifically comprises:

7. An attention-ranking based confrontational sample defense system, comprising:

the countermeasure sample defense system modifies the base model Resnet 18; the whole system is divided into three modules: the system comprises a discrete cosine transform activation module, an attention module and a key point sequencing module; a Discrete Cosine Transform (DCT) active layer is introduced after the first convolution layer so as to effectively restrain a noise mode based on gradient attack; and in the attention module and the key point sorting module, dynamically selecting key points on the feature map for classification based on an attention mechanism so as to reduce the influence of other attacked pixel points.

8. The attention-ranking based confrontation sample defense system according to claim 6, wherein the discrete cosine transform activation module is configured to:

9. The attention-ranking based confrontation sample defense system of claim 6, wherein the attention module is to:

calculating attention weight by using L_sIs mapped as dimension n of g;

local features

wherein,

10. The attention-ranking based confrontation sample defense system of claim 6 wherein the keypoint ranking module is further to:

Calculating a global vector g based on the keypoints by equation 3_s；

g_s＝C_s(k)eL_s(k) Formula 3;