CN110956617A

CN110956617A - Brain nuclear magnetic resonance abnormal image visualization method based on circulation attention model

Info

Publication number: CN110956617A
Application number: CN201911129218.6A
Authority: CN
Inventors: 柯丰恺; 刘欢平; 赵大兴; 孙国栋; 冯维; 柳晨康
Original assignee: Hubei University of Technology
Current assignee: Hubei University of Technology
Priority date: 2019-11-18
Filing date: 2019-11-18
Publication date: 2020-04-03

Abstract

The invention discloses a brain nuclear magnetic resonance abnormal image visualization method based on a circulation attention model, which comprises the steps of collecting a brain nuclear magnetic resonance abnormal image of a patient as a training sample, training the circulation attention model by using the training sample to obtain trained network parameters, and carrying out visualization detection on the brain nuclear magnetic resonance abnormal image by using a trained RAM model. The model is simple, the speed is fast, the precision is high, and the detection capability is stronger than that of the traditional CNN network.

Description

Brain nuclear magnetic resonance abnormal image visualization method based on circulation attention model

Technical Field

The invention belongs to the technical field of nuclear magnetic resonance abnormal image visualization, and particularly relates to a brain nuclear magnetic resonance abnormal image visualization method based on a circulation attention model.

Background

The most common model for deep learning is a network model based on CNN and RNN, and the deep learning is used for detecting defects of the magnetic resonance of the brain, generally, bottom layer features are extracted through operations such as convolution and pooling of input magnetic resonance images and nonlinear transformation is carried out, so that defects in the images are identified, and finally, the defects are classified and the like. However, as the Model input increases, the CNN and RNN training data linearly increase, and the training time is doubled, in order to solve this problem, a circular Attention Model (RAM) is proposed, which is actually an Attention mechanism simulating human vision to selectively process a certain region of interest of the Model based on the input of the RNN serialization, rather than processing the whole image, so that the computation time of the Model is greatly reduced, and the computation complexity is reduced.

The conventional Attention mechanism is generally divided into two forms, Soft Attention and Hard Attention. SoftAttention is mainly created by Saliency maps (salience Map) of visual signals. The salient image is obtained by processing the image features of the bottom layer, but the deep features such as the semantic features of the image cannot be considered, so that the effect of the constructed model on treating the magnetic resonance defects of the brain is not good, and although more high-level features can be extracted from the original image based on the CNN, the model is complex and the information amount of each processing is huge.

Disclosure of Invention

The invention aims to provide a rapid and high-precision brain nuclear magnetic resonance abnormal image visualization method based on a circulation attention model, aiming at the defects of the technology.

In order to achieve the purpose, the brain nuclear magnetic resonance abnormal image visualization method based on the circulation attention model comprises the following steps:

1) acquiring a brain nuclear magnetic resonance abnormal image of a patient as a training sample;

2) training the circulating attention model by using the training sample to obtain trained network parameters;

2.1) constructing a RAM model and randomly initializing network parameters

And constructing a RAM model, wherein the RAM model belongs to a new hard attention model algorithm based on a position and combining reinforcement learning and a recurrent neural network. The RAM model comprises five parts of a Glimpse network, a Core network, an Action network, a Location network and a Baseline network, and is randomly initialized, namely the parameters of the Glimpse network, the Core network, the Action network, the Location network and the Baseline network are initialized

l(h_t|θ^l) And b (h)_t|θ^b) Is a random initial value;

2.2) training the RAM model

2.2.1) random initialization of the first attention position l₀；

2.2.2) according to the first attention position l₀Obtaining Glimpse characteristics g₀；

2.2.3) first hidden state h of the time-series Core network₀Initialization is 0;

2.2.4) hidden state h of Core network₀And features of the Glimpse network g₀As Core network input, the output obtains a new hidden state h₁；

2.2.5) hiding new Core network state h₁As the input of the Action network, outputting the predicted classification result a₁；

2.2.6) outputting the new hidden state h of the Core network₁As the input of the Baseline network, the output is obtained as a one-dimensional vector b₁Baseline networks reduce the variance to the threshold for gradientsA range of values ε;

2.2.7) outputting the new hidden state h of the Core network₁As the input of the Location network, the Location network adopts a policy gradient algorithm to determine the attention position of the next time sequence, and the output attention position of the next time sequence is l₁；

2.2.8) circulating the step 2.2.2) to the step 2.2.7), and repeating the step T times;

2.2.9) constructing a loss function for the network

Defining the overall loss function of the RAM training model as:

2.2.10) back-propagating the neural network according to the loss function, thereby updating the parameters of the network

According to the Loss function Loss (theta) in the step 2.2.9), training the Action network and reversely propagating the Action network to the Glimpse network and the Core network, wherein in the process of reversely propagating, the Location network and the Baseline network are trained by adopting a strategy gradient of reinforcement learning, although the input of the Location network and the Baseline network is the hidden state of the Core network, the input of the base network is not reversely propagated to the Core network and the Glimpse network, and the reinforcement learning weight value updating mode is as follows: theta_t+1＝θ_t+αγ_tR_t▽_θlog(A_t|S_tθ), θ is a parameter of the policy gradient, and a parameter θ of the Glimpse network, the Core network, the Location network^fg、θ^fh、θ^lThe parameters are all strategy parameters controlled by the parameter theta of the strategy gradient, so that the parameters of the whole network are updated;

2.2.11 repeating the training from step 2.2.1) to step 2.2.10) M times to obtain the final network parameters;

3) and carrying out visual detection on the brain nuclear magnetic resonance abnormal image by using the trained RAM model.

Further, in the step 2.2), the Glimpse network comprises a Glimpse sensor, and the Glimpse sensor samples the magnetic resonance image x of the brain of the patient to be processed and surrounds the magnetic resonance image xObtaining 4 square images with different lengths by taking the first attention position as the center of the image fixation area at the first attention position, and uniformly transforming the square images into a group of images with the size of 32 x 32 by using a nearest neighbor interpolation method, wherein the first attention position l of the image is₀The middle region of (a) is a higher resolution image, and the larger regions outward from the middle region are progressively lower resolution images; the Glimpse sensor then acquires the set of images and the first attention location l₀Carrying out feature extraction, and connecting through full-connection layers to obtain features g output by the Glimpse network₀。

Further, in the step 2.2.4), the Core network is actually an RNN network, and the hidden state h output by the Core network in the previous time series is time-sequenced₀And the feature g currently output via the Glimpse network₀The two characteristics are combined to be used as the input of the Core network, and the output of the Core network obtains a new hidden state h in the RNN network₁。

Further, in the step 2.2.5), the Action network outputs a new hidden state h output by the Core network₁As input, the training target is the detection of the brain nuclear magnetic resonance image of the patient, the Action network is a classification network, and the output of the obtained prediction classification result is a₁And further based on the predicted classification result a₁And the actual label of the image obtains the reward function, wherein if the classification result a is correct, the reward function is 1, otherwise, the reward function is 0.

Further, in the step 2.2.9), the loss function is composed of three parts:

the first part is the last classification result a_TAnd the cross entropy loss function formed by the actual category of the image, wherein the loss function formula is as follows:

wherein y isⁱFor the input magnetic resonance image true tag values,

predicting for a targetA tag value of (a);

the second part is a loss function of the Location network policy gradient algorithm, and the loss function formula is as follows:

wherein theta is a parameter of the strategy gradient, and parameters of the Glimpse network, the Core network and the Location network

θ^lAre all the strategy parameters controlled by the parameter theta of the strategy gradient,

the reward obtained for each sample in reinforcement learning,

the attenuation coefficient of the obtained reward is between 0 and 1,

the action behaviors sampled by the policy gradient algorithm,

the behavior states sampled by the policy gradient algorithm,

performing an action behavior

Accumulated awards obtained, b_iIs a reward benchmark value that depends on the behavior state

Independent of motion behavior

The representative is that the strategy gradient algorithm directly parameterizes the strategy, namely, the strategy is represented by a parameterized function to seek the optimal strategy;

the loss function of the third part is based on the classification result a_TThe reward obtained whether the reward is correct and the baseline form a mean square error loss function, and the loss function formula is as follows:

using error to obtain training reference value b_iThe increase of the baseline network can reduce the probability of behavior actions below a baseline value and increase the probability of behavior actions above the baseline value, wherein,

the reward obtained for each sample in reinforcement learning if the classification result a_TIf correct, the reward function is

Otherwise the reward function is

b_iA baseline value predicted for each sample; according to the loss functions of the first part, the second part and the third part, the loss function of the RAM training model is defined as follows:

compared with the prior art, the invention has the following advantages: the brain nuclear magnetic resonance abnormal image visualization method based on the circulation attention model is simple in model, high in speed and precision and stronger in detection capability than the traditional CNN network.

Drawings

FIG. 1 is a schematic view of a cyclic attention model in the present invention.

Detailed Description

The invention is described in further detail below with reference to the figures and the specific embodiments.

A brain nuclear magnetic resonance abnormal image visualization method based on a circulation attention model comprises the following specific steps:

2) training a circulating attention model (RAM) by using a training sample to obtain trained network parameters;

2.1) constructing a RAM model and randomly initializing network parameters

And constructing an RAM model, wherein the RAM model belongs to a new hard attention model algorithm based on a position and combining reinforcement learning and a cycle neural network. As shown in fig. 1, the RAM model includes five parts, i.e., a Glimpse network, a Core network, an Action network, a Location network, and a base network, and is randomly initialized, i.e., the parameters of the Glimpse network, the Core network, the Action network, the Location network, and the base network are initialized

l(h_t|θ^l) And b (h)_t|θ^b) Is a random initial value;

2.2) training the RAM model

2.2.1) random initialization of the first attention position l₀；

The Glimpse network comprises a Glimpse sensor which samples a nuclear magnetic resonance image x of the brain of a patient to be processed around the attention position thereof to obtain 4 square images with different lengths taking the square images as the center of the image fixation area, and then uniformly transforms the square images into a group of images with the size of 32 x 32 by using a nearest neighbor interpolation method, wherein the first attention position l of the images₀Is a higher resolution image, and larger areas from the middle area outwards are progressively lower resolution images, and then the Glimpse sensor is based on the set of images obtained and the first attention position/₀The characteristic extraction is carried out, and the characteristic extraction is carried out,obtaining the characteristics g of Glimpse network output through full-connection layer connection₀；

2.2.4) hidden state h of Core network₀And features of the Glimpse network g₀As Core network input, the output obtains a new hidden state h₁

The Core network is actually an RNN network, and the hidden state h output by the Core network in the last time sequence is output in time sequence₀And the feature g currently output via the Glimpse network₀The two characteristics are combined to be used as the input of the Core network, and the output of the Core network obtains a new hidden state h in the RNN network₁；

2.2.5) hiding new Core network state h₁As the input of the Action network, outputting the predicted classification result a₁

The Action network outputs a new hidden state h of the Core network₁As input, the training target is the detection of the brain nuclear magnetic resonance image of the patient, the Action network is a classification network, and the output of the obtained prediction classification result is a₁And further based on the predicted classification result a₁Obtaining a reward function together with an actual label of the image, wherein if the classification result a is correct, the reward function is 1, otherwise, the reward function is 0;

2.2.6) outputting the new hidden state h of the Core network₁As the input of the Baseline network, the output is obtained as a one-dimensional vector b₁The Baseline network reduces the variance of the gradient to be within a threshold range epsilon;

2.2.9) constructing a loss function for the network

The loss function consists of three parts:

wherein y isⁱFor the input magnetic resonance image true tag values,

a predicted tag value for the target;

the reward obtained for each sample in reinforcement learning,

the attenuation coefficient of the obtained reward is between 0 and 1,

the action behaviors sampled by the policy gradient algorithm,

the behavior states sampled by the policy gradient algorithm,

performing an action behavior

Independent of motion behavior

Otherwise the reward function is

b_iA baseline value predicted for each sample;

according to the loss functions of the first part, the second part and the third part, the loss function of the RAM training model is defined as follows:

According to the Loss function Loss (θ) in step 2.2.9), the Action network is trained and propagated backward to the Glimpse network or the Core network, but it should be noted that, in the backward propagation process, the Location network and the Baseline network are always trained by using a strategy gradient of reinforcement learning, and although the input of the Location network and the Baseline network is the hidden state of the Core network, the input of the base network is not propagated backward to the Core network or the Glimpse network, because the algorithm is difficult to converge and has poor effect, the weight updating method of reinforcement learning is as follows: theta_t+1＝θ_t+αγ_tR_t▽_θlog(A_t|S_tθ), θ is a parameter of the policy gradient, and a parameter of the Glimpse network, the Core network, the Location network

θ^lThe parameters are all strategy parameters controlled by the parameter theta of the strategy gradient, so that the parameters of the whole network are updated;

2.2.12) repeating the training from the step 2.2.1) to the step 2.2.10) for M times to obtain the final network parameters;

3) and the trained RAM model can be used for carrying out visual detection on the brain nuclear magnetic resonance abnormal image.

Experimental data:

the experimental data is brain nuclear magnetic resonance abnormal images of patients with the size of about 12 thousands and the size of 128 x 128 as training samples; parameters in the experiment: the training times M of the RAM model are 60000 times, T is 7 times, batch is 256 groups, and the experimental results are as follows:

the training comparison experiment is carried out according to two different models in the table 1, and compared with a Convolutional Neural Network (CNN), the RAM model provided by the text has the advantages that the CNN network detection capability is poor, the RAM model detection operation speed is high, the accuracy is high, and the experiment effect is good.

Claims

1. A brain nuclear magnetic resonance abnormal image visualization method based on a circulation attention model is characterized in that: the specific method comprises the following steps:

2.1) constructing a RAM model and randomly initializing network parameters

l(h_t|θ^l) And b (h)_t|θ^b) Is a random initial value;

2.2) training the RAM model

2.2.1) random initialization of the first attention position l₀；

2.2.5) newly hiding Core networkHidden state h₁As the input of the Action network, outputting the predicted classification result a₁

2.2.9) constructing a loss function for the network

Defining the overall loss function of the RAM training model as:

According to the Loss function Loss (theta) in the step 2.2.9), training the Action network and reversely propagating the Action network to the Glimpse network and the Core network, wherein in the process of reversely propagating, the Location network and the Baseline network are trained by adopting a strategy gradient of reinforcement learning, although the input of the Location network and the Baseline network is the hidden state of the Core network, the input of the base network is not reversely propagated to the Core network and the Glimpse network, and the reinforcement learning weight value updating mode is as follows: theta_t+1＝θ_t+αγ_tR_t▽_θlog(A_t|S_tθ), θ is a parameter of the policy gradient, and a parameter of the Glimpse network, the Core network, the Location network

2. The method for visualizing the abnormal nuclear magnetic resonance image based on the circulatory attention model as claimed in claim 1, wherein: in step 2.2), the Glimpse network comprises a Glimpse sensor, which samples the magnetic resonance image x of the brain of the patient to be treated, around the first attention position, to obtain 4 square images with different lengths, with the first attention position as the center of the image fixation area, and then uniformly transforms them into a group of images with the size of 32 × 32 by using nearest neighbor interpolation, wherein the first attention position l of the images₀Is a higher resolution image, and larger areas from the middle area outwards are progressively lower resolution images, and then the Glimpse sensor is based on the set of images obtained and the first attention position/₀Carrying out feature extraction, and connecting through full-connection layers to obtain features g output by the Glimpse network₀。

3. The method for visualizing the abnormal nuclear magnetic resonance image based on the circulatory attention model as claimed in claim 1, wherein: in said step 2.2.4), the Core network is actually an RNN network, and the hidden state h output by the Core network in the previous time series is chronologically output₀And the feature g currently output via the Glimpse network₀The two characteristics are combined to be used as the input of the Core network, and the output of the Core network obtains a new hidden state h in the RNN network₁。

4. The method for visualizing the abnormal nuclear magnetic resonance image based on the circulatory attention model as claimed in claim 1, wherein: in the step 2.2.5), the Action network outputs a new hidden state h output by the Core network₁As input, the training target is the detection of nuclear magnetic resonance images of the brain of the patient, an Action networkIf the network is a classification network, the predicted classification result is output as a₁And further based on the predicted classification result a₁And the actual label of the image obtains the reward function, wherein if the classification result a is correct, the reward function is 1, otherwise, the reward function is 0.

5. The method for visualizing the abnormal nuclear magnetic resonance image based on the circulatory attention model as claimed in claim 1, wherein: said step 2.2.9), the loss function consists of three parts: