CN112215788A

CN112215788A - Multi-focus image fusion algorithm based on improved generation countermeasure network

Info

Publication number: CN112215788A
Application number: CN202010966366.XA
Authority: CN
Inventors: 王娟; 柯聪; 袁旭亮; 丁畅; 何宇; 刘远远; 张鑫午; 刘敏; 刘聪
Original assignee: Hubei University of Technology
Current assignee: Hubei University of Technology
Priority date: 2020-09-15
Filing date: 2020-09-15
Publication date: 2021-01-12

Abstract

The invention discloses a multi-focus image fusion algorithm based on an improved generation countermeasure network, which is applied to target images extracted by focusing photographing at different positions in the same scene. Firstly, designing a generator network and a discriminator network, cutting down a pooling layer in a network structure in order to avoid information loss caused by an image in a network model transmission process, and extracting image characteristics through convolution stacking; secondly, constructing a loss function for generating a countermeasure network, and optimizing network parameters to obtain an optimal network model; finally, inputting the acquired target image into a trained model to obtain a fused image; when the multi-focus image fusion algorithm is carried out, a generator in the generation countermeasure network generates a fusion image, the generated image and a source image are input into a discriminator, and if the discriminator cannot discriminate, the generated image is the best fusion image.

Description

Multi-focus image fusion algorithm based on improved generation countermeasure network

Technical Field

The invention relates to the technical field of image processing, in particular to a multi-focus image fusion algorithm based on an improved generation countermeasure network.

Background

With the rapid development of technologies such as computers, sensors and the like, the device brings convenience to the life of people to a great extent. Among them, digital images, which are products of these new technologies, have also slowly permeated into the lives of people, and have also played an important role in communication between people. As the amount of image information obtained by people increases, it is important to process the image information. Because the focal length of the optical lens is set in a certain range, only objects within the depth of field can be clearly displayed in a picture, other objects may present a blurry state, and a common technique for acquiring a full-focus image is to fuse a plurality of images of the same scene taken under different focal length settings, i.e., a multi-focus image fusion technique. The multi-focus image fusion technology can fuse focus images under different focal lengths, and the fused images can retain the detail characteristics of source images to the maximum extent, so that richer information is provided for practical application fields such as military detection, medical diagnosis and target recognition.

Disclosure of Invention

The present invention is directed to solving, at least to some extent, one of the technical problems in the related art. Therefore, an object of the present invention is to provide a multi-focus image algorithm based on an improved generation countermeasure network, which processes target images extracted from different positions of focused shots in the same scene to obtain a fused image containing rich detail information.

1. According to the embodiment of the invention, the multi-focus image fusion algorithm based on the improved generation countermeasure network is applied to target images extracted by focusing photographing at different positions in the same scene, and comprises the following steps:

s1: designing a network structure for generating a generator and a discriminator in a countermeasure network, cutting down a pooling layer in the network structure, and extracting image features by using convolution stacking;

s2: constructing an objective function of a network model by generating a network structure of a countermeasure network;

s3: training through a training set to obtain an optimal generative confrontation network model;

s4: applying the generator model of the countermeasure network generated in step S3, inputting the source image into the generator to obtain the generated image, and performing target update for the discrimination of the generated image and the source image by the discriminator. Preferably, the generator in step S1 is a 5-layer convolutional neural network, the first and second layers use convolution kernels of 5x5, the third and fourth layers use convolution kernels of 3x3, and the last layer uses convolution kernels of 1x1, the step size of each convolution kernel is set to 1, and the input of the generator is formed by connecting two multi-focus images, that is, the input channel is 2

The discriminator in step S1 is an 8-layer convolutional neural network, each layer uses convolution kernels of 3 × 3, the convolution kernel step size of the second, third, and fourth layers is set to 2, and the convolution kernel step size of the remaining layers is 1.

Preferably, the objective function of the created generation confrontation network model in step S2 includes an objective function of the generator network and an objective function of the discriminator network

L_GAN＝{min(L_G),min(L_D)}；

The loss function of the generator comprises two parts, wherein one part is the loss resistance of the generator and the discriminator and is represented by V, and the other part is the content loss of the image detail information in the generation process and is represented by L_contentRepresents:

then L is_GCan be expressed as

L_G＝V+αL_content

L for loss function of discriminator_DRepresents:

preferably, in step S3, an optimal generative confrontation network model is obtained through training of a training set, 50 pairs of multi-focus images are used as the training set of the experiment, each pair of multi-focus images is divided into sub-blocks by a sliding window with a step size of 14 and a size of 64x64, the sub-blocks are expanded to a size of 76x76 in a filling manner and are used as the input of the generator, the size of the fused image output by the generator is still 64x64, and the generated fused image is used as the input of the discriminator and the Adam optimization algorithm is used until the maximum training times are reached.

Preferably, in step S4, the fused image is obtained by the generator, the fused image is updated by the discriminator, and the two input source images I₁、I₂By means of a generator G, a fusion image I is obtained_fThe discriminator D is used for fusing the images I_fSource image I₁、I₂The extracted image characteristics are judged, and a fused image I is judged_fWhether or not to include the source image I₁、I₂If the discriminator can discriminate, the fused image I is continuously updated_f(ii) a If the discriminator cannot discriminate, it indicates that the image generated by the generator is the best fused image.

The invention provides a multi-focus image fusion algorithm based on an improved generation countermeasure network, which realizes the extraction of image information at different focus positions by utilizing the generation countermeasure network and generates a fusion image containing rich detail information. Firstly, designing a generator network and a discriminator network, cutting down a pooling layer in a network structure in order to avoid information loss caused by an image in a network model transmission process, and extracting image characteristics through convolution stacking. Secondly, constructing a loss function for generating the countermeasure network, and optimizing network parameters to obtain an optimal network model. And finally, inputting the acquired target image into the trained model to obtain a fused image.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

FIG. 1 is a block diagram of a method of a multi-focus image fusion algorithm based on an improved generation countermeasure network according to the present invention;

FIG. 2 is a diagram of a generator network structure in a multi-focus image fusion algorithm based on an improved generation countermeasure network according to the present invention;

FIG. 3 is a diagram of a network structure of a discriminator in a multi-focus image fusion algorithm based on an improved generation countermeasure network according to the present invention;

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.

Examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.

Example 1:

as shown in fig. 1-3, an image fusion algorithm based on an improved generation of a countermeasure network. And designing a generator network and a discriminator network, cutting down a pooling layer in a network structure in order to avoid information loss caused by the image in the network model transmission process, and extracting image characteristics through convolution stacking. Secondly, constructing a loss function for generating the countermeasure network, and optimizing network parameters to obtain an optimal network model. And finally, inputting the acquired target image into the trained model to obtain a fused image.

The image fusion algorithm based on the improved generation countermeasure network specifically comprises the following steps:

s4: applying the generator model of the countermeasure network generated in step S3, inputting the source image into the generator to obtain the generated image, and performing target update for the discrimination of the generated image and the source image by the discriminator.

Example 2:

as shown in fig. 1 to 3, according to the steps of embodiment 1, a network structure for generating generators and discriminators in a countermeasure network is designed in S1. The generator aims to extract more detail information in the source image and generate a fused image with abundant details. The generator is a 5-layer convolutional neural network, with the first and second layers using convolution kernels of 5x5, the third and fourth layers using convolution kernels of 3x3, and the last layer using convolution kernels of 1x 1. The step size of each layer of convolution kernel is set to be 1, and the input of the generator is formed by connecting two multi-focus images, namely the input channel is 2. The purpose of the discriminator is to discriminate whether the target image is the image generated by the generator or the real image, and to classify the target image by extracting the features of the target image. The discriminator is a convolutional neural network with 8 layers, each layer uses convolution kernel of 3x3, the convolution kernel step of the second, third and fourth layers is set to be 2, and the convolution kernel step of the other layers is 1.

Example 3:

as shown in FIGS. 1 to 3, according to the step of embodiment 1, the objective function for generating the countermeasure network model built in S2 includes the objective function of the generator network and the objective function of the discriminator network

L_GAN＝{min(L_G),min(L_D)}；

The generator loss function includes two parts, one part is that the antagonistic loss of the generator and the arbiter is represented by V. The other part is L for content loss in the process of generating image detail information_contentRepresents:

then L is_GCan be expressed as

L_G＝V+αL_content

In order to generate a better fused image, a discriminator is introduced. L for loss function of discriminator_DRepresents:

example 4:

as shown in fig. 1 to 3, according to the step of embodiment 1, the optimal generative confrontation network model is obtained by training through the training set in S3. 50 pairs of multi-focus images were used as a training set for the experiment. To enable a better training model, each pair of multi-focused images was divided into sub-blocks with a sliding window size of 64x64 with a step size of 14, and these sub-blocks were expanded in a padding fashion to a size of 76x76, which was used as input to the generator. The fused image output by the generator is still 64x64 in size. The resulting fused image is used as input to a discriminator and the Adam optimization algorithm is used until the maximum number of training passes is reached.

Example 5:

as shown in fig. 1 to 3, according to the step of embodiment 1, the fused image is obtained by the generator in S4, and the fused image is updated by the discriminator. Inputting two source images I₁、I₂By means of a generator G, a fusion image I is obtained_fThe discriminator D is used for fusing the images I_fSource image I₁、I₂The extracted image characteristics are judged, and a fused image I is judged_fWhether or not to include the source image I₁、I₂If the discriminator can discriminate, the fused image I is continuously updated_f(ii) a If the discriminator cannot discriminate, it indicates that the image generated by the generator is the best fused image.

In summary, the multi-focus image fusion algorithm based on the improved generation countermeasure network realizes end-to-end adaptive fusion and avoids the complicated fusion rule of design. And (3) extracting image features by adopting convolution stacking through the design of the generator network and the discriminator network. Secondly, constructing a loss function for generating the countermeasure network, and optimizing network parameters to obtain an optimal network model. And finally, inputting the acquired target image into the trained model to obtain a fused image. The algorithm can better extract the detail information and the edge characteristics of the two source images, and achieves better fusion effect.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered to be within the technical scope of the present invention, and the technical solutions and the inventive concepts thereof according to the present invention should be equivalent or changed within the scope of the present invention.

Claims

1. a multi-focus image fusion algorithm based on improved generative adversarial network, applied to the extracted target image of different positions focusing on taking pictures under the same scene, it is characterized in that, the described multi-focus image fusion algorithm based on improved generative adversarial network comprises:

S1: Design the network structure of the generator and discriminator in the generative adversarial network, cut out the pooling layer in the network structure, and use convolution stacking to extract image features;

S2: Build the objective function of the network model by generating the network structure of the adversarial network;

S3: The best generative adversarial network model is obtained by training on the training set;

S4: Apply the generator model of the generative adversarial network in step S3, input the source image into the generator to obtain the generated image, and use the discriminator to discriminate between the generated image and the source image to update the target.

2. The multi-focus image fusion algorithm based on improved generative adversarial network according to claim 1, characterized in that: the generator in step S1 is a 5-layer convolutional neural network, and both the first layer and the second layer are It uses a 5x5 convolution kernel, the third and fourth layers use a 3x3 convolution kernel, and the last layer uses a 1x1 convolution kernel. The stride of each layer of convolution kernel is set to 1, and the input of the generator It is the connection of two multi-focus images, that is, the input channel is 2;

The discriminator in step S1 is an 8-layer convolutional neural network. Each layer uses a 3x3 convolution kernel. The kernel step size is all 1.

3. the multi-focus image fusion algorithm based on improved generative adversarial network according to claim 1, is characterized in that: the objective function of the generative adversarial network model of building in step S2 comprises the objective function of generator network and discriminator network. objective function

L _GAN = {min(L _G ), min(L _D )};

The generator loss function includes two parts, one part is the confrontation loss between the generator and the discriminator, which is represented by V, and the other part is the content loss of the image detail information in the generation process, which is represented by L _content :

Then _LG can be expressed as

L _G =V+αL _content

The loss function of the discriminator is denoted by _LD :

4. the multi-focus image fusion algorithm based on improved generative adversarial network according to claim 1, is characterized in that: in step S3, obtain the best generative adversarial network model through training set training, take 50 pairs of multi-focus images as experiments The training set of , divides each pair of multi-focus images into sub-blocks with a sliding window of stride 14 and size 64x64, and then expands the size of these sub-blocks to 76x76 by padding, which is used as the input of the generator, via The size of the fused image output by the generator is still 64x64, the generated fused image is used as the input of the discriminator, and the Adam optimization algorithm is used until the maximum number of training times is reached.

5. The multi-focus image fusion algorithm based on improved generative adversarial network according to claim 1, is characterized in that: in step S4, the fusion image is obtained by the generator, the fusion image is updated by the discriminator, and the input two source images I ₁ , I ₂ , through the generator _G , the fusion image If is obtained, and the discriminator D _judges whether the fusion image If contains the image features extracted from the fusion image _If and the source images I ₁ , I ₂ If the discriminator can discriminate the detailed information of the source images I ₁ and I ₂ , then continue to update the _fusion image If; if the discriminator cannot discriminate, it means that the image generated by the generator is the best fusion image.