[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN117078780A - Deep learning-based micro-fossil CT image preprocessing method and device - Google Patents

Deep learning-based micro-fossil CT image preprocessing method and device Download PDF

Info

Publication number
CN117078780A
CN117078780A CN202310863239.0A CN202310863239A CN117078780A CN 117078780 A CN117078780 A CN 117078780A CN 202310863239 A CN202310863239 A CN 202310863239A CN 117078780 A CN117078780 A CN 117078780A
Authority
CN
China
Prior art keywords
image
super
resolution
network
micro
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310863239.0A
Other languages
Chinese (zh)
Inventor
侯叶茂
尹鹏飞
崔心东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute Of Vertebrate Paleontology And Paleoanthropology
Original Assignee
Institute Of Vertebrate Paleontology And Paleoanthropology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute Of Vertebrate Paleontology And Paleoanthropology filed Critical Institute Of Vertebrate Paleontology And Paleoanthropology
Priority to CN202310863239.0A priority Critical patent/CN117078780A/en
Publication of CN117078780A publication Critical patent/CN117078780A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/003Reconstruction from projections, e.g. tomography
    • G06T11/005Specific pre-processing for tomographic reconstruction, e.g. calibration, source positioning, rebinning, scatter correction, retrospective gating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/40Scaling of whole images or parts thereof, e.g. expanding or contracting
    • G06T3/4053Scaling of whole images or parts thereof, e.g. expanding or contracting based on super-resolution, i.e. the output image resolution being higher than the sensor resolution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10072Tomographic images
    • G06T2207/10081Computed x-ray tomography [CT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Image Processing (AREA)

Abstract

The invention discloses a deep learning-based micro-fossil CT image preprocessing method and device, wherein the method comprises the following steps: acquiring CT images of the micro fossil; obtaining a super-resolution image of the CT image based on the CT image super-resolution reconstruction model; and obtaining a CT image with high signal-to-noise ratio corresponding to the super-resolution image based on the denoising network model. The invention can efficiently and objectively perform super-resolution reconstruction and noise removal functions on the micro-fossil CT image so as to solve the problems of low resolution of the micro-fossil CT image, low signal-to-noise ratio of the CT image and the like.

Description

Deep learning-based micro-fossil CT image preprocessing method and device
Technical Field
The invention relates to the technical field of micro-fossil CT image preprocessing, in particular to a micro-fossil CT image preprocessing method and device based on deep learning.
Background
The microvesicle fossil is a microscopic ancient organism remains or remains stored in a stratum and cannot be directly observed by naked eyes. The micro-fossil contains a great deal of abundant geological information and has very important significance for the scientific research of the earth, life and environment. In recent years, thanks to the wide use of the micro-CT technique in micro-ancient biologies, ancient biologists can obtain more comprehensive three-dimensional data information of micro-fossils, including three-dimensional microstructures from the inside to the surface of the micro-fossils, through micro-CT scanning. However, in the process of acquiring the micro-fossil CT image, the micro-CT image is limited by software, hardware and environmental factors of the micro-CT device, and various problems that CT image noise and CT image resolution cannot meet the identification requirement of the micro-fossil structure are unavoidable.
The resolution of the micro-fossil CT image acquired by the micro-CT system is limited by factors such as the size of the focal spot of the X-ray source, the pixel of the imaging detector, the size of the micro-fossil sample, and the like. Thus, the archaeological specialist can only trade off between the micro-foster CT image field of view and the image voxel size. However, CT image voxel size is critical to describe the microstructure of the microposts (e.g., texture, medullary cavity holes, ridges, etc.). At present, the ancient biological specialist wants to observe the characteristics or further observe the super-microscopic structure of the micro-fossils, even the ancient tissue details, and the expensive hardware equipment is needed or the traditional image interpolation-based method is adopted to increase the voxel size of the CT image, so that the super-resolution reconstruction of the CT image of the micro-fossils is realized.
The signal-to-noise ratio of the micro fossil CT image acquired by the micro CT system is limited by factors such as the exposure time of an X-ray machine, the number of superimposed frames, the number of projection images and the like. Therefore, the archaeological specialist can only trade off between the time of CT scanning of the micro-fossils and the high signal-to-noise ratio of CT images. However, CT images with high signal-to-noise ratios are critical to distinguishing the texture detail features of the micro-fossils. Currently, ancient biological specialists want to acquire high signal-to-noise ratio micro-fossil CT images, and generally adopt a traditional denoising method based on artificial features, for example, a discrete cosine transform, a wavelet transform and the like.
The conventional micro-fossil CT image preprocessing method has the following problems and disadvantages: the traditional image interpolation-based method is adopted to realize super-resolution reconstruction of the micro-fossil CT image, although the method is simple and rough and has small operand, the pixel values of adjacent pixel points can be directly copied for filling, but the reconstruction effect is poor and the blocking effect is obvious by the method of moving surrounding pixels; the conventional denoising method generally needs to adjust parameters, so that experienced staff is required to repeatedly adjust the parameters to exert the maximum performance of the algorithm. In addition, the calculation amount of some algorithms is huge, and the application of the algorithms in the pretreatment of the micro-fossil CT images is limited.
Disclosure of Invention
In order to overcome the defects of the traditional micro-fossil CT image preprocessing method, the invention provides a novel micro-fossil CT image preprocessing method and device based on deep learning, which can efficiently and objectively perform super-resolution reconstruction and noise removal functions on micro-fossil CT images so as to solve the problems of low resolution of the micro-fossil CT images, low signal-to-noise ratio of the CT images and the like.
The technical scheme of the invention comprises the following steps:
a deep learning-based micro-fossils CT image preprocessing method, the method comprising:
acquiring CT images of the micro fossil;
based on a CT image super-resolution reconstruction model, obtaining a super-resolution image of the CT image; the CT image super-resolution reconstruction model is obtained by calculating perception loss, antagonism loss, total variation loss and content loss by using a high-definition-low-definition original CT image pair, wherein the perception loss is obtained based on a feature vector distance between a high-definition original CT image and a corresponding super-resolution image, the antagonism loss is obtained by discriminating a high-definition original CT image and a reconstruction result of the super-resolution network model, the total variation loss is obtained based on a sharpness difference between image gradients of the high-definition original CT image and the corresponding super-resolution image due to regional variation, and the content loss is obtained based on a reconstruction result of the super-resolution network model and a pixel level distance between the high-definition original CT image;
and obtaining the CT image with high signal-to-noise ratio corresponding to the super-resolution image based on the denoising network model.
Further, the backbone network of the CT image super-resolution reconstruction model is a WSRGAN network, and the WSRGAN network comprises: a generation network and a countermeasure network, the generation network being comprised of an encoding structure and a decoding structure, the encoding structure comprising a plurality of encoding modules, each encoding module comprising: residual block, convolution layer, model weight normalization layer, reLU activation layer, channel attention block, spatial attention block and upsampling function; the decoding structure comprises a plurality of decoding modules, and the countermeasure network establishes a structure comprising a VGG19 network model, a LeakyReLU activation function, a convolution layer and a model weight normalization layer based on a VGG architecture.
Further, training the CT image super-resolution reconstruction model includes:
constructing an experimental data set based on a plurality of high-definition-low-definition original CT image pairs, and dividing all images in the experimental data set into a plurality of batches;
loading a pre-trained WSRGAN network original weight file;
inputting a low-definition original CT image to a generating network so that the encoding module extracts a feature vector from the low-definition original CT image, and generates an output vector to be input to a next encoding module, wherein the encoding module extracts the feature vector from the output vector; the feature vector output by the encoding module of the next stage obtains higher-dimensional image feature information than the feature vector output by the encoding module of the previous stage;
inputting the final feature vector extracted by the coding structure into a decoding structure, and generating a super-resolution image corresponding to the original CT image by a last decoding module, and obtaining perception loss based on the Euclidean distance of the feature vector between the high-definition original CT image and the corresponding super-resolution image;
inputting the low-definition original CT image into a generating network to obtain a reconstruction result of a super-resolution network model, inputting the reconstruction result and the high-definition original CT image into an countermeasure network together, and distinguishing the reconstruction image of the high-definition original CT image and the super-resolution network model based on a discriminator to obtain countermeasure loss;
using variable step sizesAnd updating the weight W of each batch by a random gradient descent algorithm i+1 Restarting the optimization weight by using the LRS strategy to obtain an optimal weight file of the WSRGAN network model; wherein, table iShowing batch, j represents algebra of training;
the total variation loss is obtained by minimizing the total variation of gradient operators in the horizontal direction and the vertical direction of the pixel level of the super-resolution reconstructed image and the high-definition original CT image;
calculating a pixel-level distance between a super-resolution reconstructed image result and a high-definition original CT image based on the mean square error loss to obtain content loss;
and training and optimizing the WSRGAN network according to the perception loss, the antagonism loss, the total variation loss and the content loss, so as to obtain a CT image super-resolution reconstruction model.
Further, the variable step sizeWherein alpha is max Representing a variable step +.>Maximum value range of alpha min Representing a variable step +.>C is the cycle length of the cos function, S is the number of iterations of the restart step, b i The number of micro-fostered CT images per batch.
Further, the weightWherein (1)>Gradient value, X for the ith generation of training j For the high-definition original CT image set, Y j For a high resolution image set generated based on the low definition raw CT image set, B represents the total batch of the ith generation of training.
Further, the obtaining the CT image with high signal-to-noise ratio corresponding to the super-resolution image based on the denoising network model includes:
constructing a denoising network model, wherein a backbone network of the denoising network model is a modified Noise to Noise network, and the modified Noise to Noise network comprises: a downsampling portion, a skip connecting portion, and an upsampling portion; the feature images of the same stage of the downsampling part and the upsampling part have the same size and the same channel number, the downsampling part solves gradient disappearance and network degradation by introducing a residual error module, and the jump connection part fuses different layers of multi-scale feature images extracted by the convolutional neural network by introducing an attention mechanism;
training the improved Noise to Noise network by inputting micro-fossil CT images with different levels of Gaussian random Noise with zero mean as input data and tag data respectively, and monitoring the training process of the improved Noise to Noise network by the signal-to-Noise ratio of the denoised images to obtain denoising loss L noise
According to denoising loss L noise Performing the improved Noise to Noise network training and optimization to obtain a denoising network model;
and inputting the super-resolution image into a denoising network model to obtain a CT image with high signal-to-noise ratio.
Further, the denoising penaltyWherein y is t Represents the t-th noisy micro-body fossil CT data, θ represents network parameters, +.>Another noisy distribution of the t-th noisy micro-body CT data is represented, and N represents the total number of noisy micro-body CT data.
A deep learning based micro-fossils CT image preprocessing device, the device comprising:
the acquisition module is used for acquiring CT images of the micro fossil;
the generation module is used for obtaining a super-resolution image of the CT image based on the CT image super-resolution reconstruction model; the CT image super-resolution reconstruction model is obtained by calculating perception loss, antagonism loss, total variation loss and content loss by using a high-definition-low-definition original CT image pair, wherein the perception loss is obtained based on a feature vector distance between a high-definition original CT image and a corresponding super-resolution image, the antagonism loss is obtained by identifying a reconstruction result of the high-definition original CT image and the super-resolution network model, the total variation loss is obtained based on a sharpness difference generated by image gradients of the high-definition original CT image and the corresponding super-resolution image due to region change, and the content loss is obtained based on a reconstruction result of the super-resolution network model and a pixel level distance between the high-definition original CT image;
and the denoising module is used for obtaining the CT image with high signal-to-noise ratio corresponding to the super-resolution image based on the denoising network model.
A computer device, the computer device comprising: a processor and a memory storing computer program instructions; the processor, when executing the computer program instructions, implements the deep learning-based micro-fossils CT image preprocessing method of any one of the above.
A computer readable storage medium, wherein computer program instructions are stored on the computer readable storage medium, and when executed by a processor, the computer program instructions implement the deep learning-based micro-fossils CT image preprocessing method according to any one of the above.
Compared with the prior art, the invention has the following advantages:
1) The deep learning technology is applied to develop super-resolution reconstruction work of the micro-fossil CT image, so that the spatial resolution and the density resolution of the micro-fossil CT image are improved, and the research requirement of ancient specialists on fine observation of the micro-fossil three-dimensional microstructure is met;
2) The traditional method for removing the micro-fossil CT image noise needs experienced staff to repeatedly adjust parameters to exert the maximum performance of the algorithm. The CT image denoising method based on the deep learning can play the maximum performance of the deep learning method without constructing a noiseless data set.
Drawings
Fig. 1 is a flowchart of a method for preprocessing a CT image of a micro-fossil according to an embodiment of the present invention.
Fig. 2 is a flow chart for creating a super-resolution reconstruction network model of a micro-fossil CT image based on deep learning according to an embodiment of the present invention.
Fig. 3 is a flowchart of creating a denoising network model of a micro-fossil CT image based on deep learning according to an embodiment of the present invention.
Fig. 4 is a block diagram of a micro-fossil CT image preprocessing apparatus according to an embodiment of the present invention.
Detailed Description
The invention will be described in further detail with the aim of making the objects, solutions and advantages of the invention more clear, taking experiments performed on a real dataset as an example. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.
The invention relates to a preprocessing method of a micro-fossil CT image, which comprises the following steps: acquiring a CT image of the micro-fossils, performing super-resolution reconstruction of the CT image of the micro-fossils based on deep learning, and denoising the CT image of the micro-fossils based on the deep learning.
Specifically, as shown in fig. 1, the present invention has the steps of:
step 1: CT images of the microvesicle fossils were acquired.
According to the invention, the micro-CT image is acquired by using the micro-CT according to the micro-CT image acquisition standard. The format of the micro-fossils CT image can be a DICOM-format micro-fossils CT image, or a non-DICOM common image BMP/TIF-format micro-fossils CT image.
In one example, the micro-fossils were derived from early vertebrate micro-fossils from the mid-juveniles to the clay pot of Yunnan China. After the treatment of the microsilica with diluted (3% -7%) acetic acid solution, CT images of the microsilica were acquired using GE v|tome| x m300&180 microscopy CT. Wherein, the parameters of GE v|tome| x m300&180 micro-industrial CT are set as follows: voltage: 90kV; current flow: 90uA; exposure time: 2000ms; superimposed frame number: 1 frame; number of projection images: 2500 sheets; voxel size: 3um.
Step 2: and obtaining a super-resolution image of the CT image based on the CT image super-resolution reconstruction model.
The CT image is directly input into a trained deep learning-based micro-fossil CT image super-resolution reconstruction WSRGAN network model, namely, the trained generation network is used for obtaining the micro-fossil CT image with high resolution, realizing the super-resolution reconstruction of the micro-fossil CT image and outputting the micro-fossil CT image super-resolution image.
The super-resolution reconstruction WSRGAN network model uses a weight-normalized generation type countermeasure network. The generation network part is used for generating the super-resolution micro-body fossil CT image, and the countermeasure network part judges the fidelity degree of the generation network generated image and feeds the fidelity degree back to the generation network. The generation network and the countermeasure network part are added with weight normalization layers, and the weight normalization layers WN are defined as follows:
wherein the vector iiv is the euclidean norm of V, W, V being the K-dimensional vector. g is a scalar quantity, and g is a scalar quantity, g= |w|. W is the minor state of V. The introduction of WN can obtain better training and testing accuracy.
The super-resolution reconstruction WSRGAN network model adopts a dual attention mechanism, namely a channel attention mechanism and a space attention mechanism are combined. The channel attention mechanism can adaptively adjust the weight of the channel, and effectively improve the super-resolution reconstruction index. The spatial attention mechanism is an effective supplement to the channel attention mechanism, and the introduction of the spatial attention mechanism can lead the feature extraction to be concentrated on the detail part of the high-frequency region, inhibit the weight of the low-frequency region, and be beneficial to reconstructing the high-frequency detail in the image, in particular to structures such as textures, medullary cavity holes, ridges and the like in the micro-fossils.
In one example, when the super-resolution reconstruction network model is trained, an experimental dataset is constructed by inputting a plurality of high-definition (2048 x 2048) -low-definition (512 x 512) CT image pairs, and then the super-resolution reconstruction WSRGAN network model of the micro-body fossil CT image based on deep learning is trained, verified and tested. Loading an original weight file of a pre-trained WSRGAN network model, performing countermeasure learning through a high-resolution micro-fossil CT image and a low-resolution micro-fossil CT image, continuously regressing, training a deep learning-based micro-fossil CT image super-resolution reconstruction WSRGAN network model, adjusting the reconstruction precision of the WSRGAN network model, and storing the parameter weights of the optimized WSRGAN network model. In the loss function part, in order to reduce adverse effects caused by feature matching errors, the invention adds total variation loss on the basis of the context loss function, so that gradients in the image can be changed in a smaller area, the image can generate a certain sharpness, the detail texture of the image can be improved to a certain extent, and the reconstruction of structures such as textures, medullary cavity holes and ridges in the micro-fossils is facilitated. And judging the relative reality of the micro-fossil CT image by adopting a relative discriminator, and optimizing the super-resolution effect of the micro-fossil CT image.
As shown in fig. 2, the embodiment of the invention provides a super-resolution reconstruction WSRGAN network model creation flow based on deep learning micro-fossil CT images. The low resolution 512 x 512 micro-fossils CT images in the experimental dataset were input into a codec-based generator. The generator has 6 modules, each comprising a residual block, a convolution layer, a model weight normalization layer, a ReLU activation layer, a channel attention block, a spatial attention block, and an upsampling function, etc. Each module may extract feature vectors from the low resolution image and then generate an output vector for input to the next module. The feature vectors represent the main features of the input low resolution image and the decoder is responsible for upsampling the feature vectors. The last decoding module can reconstruct the micro-fossil CT image with the resolution of 2048 x 2048.
And loading the high-resolution 2048 x 2048 micro-fossil CT image in the experimental data set and the super-resolution reconstruction result of the micro-fossil CT image in the VGG19 network model-based discriminator pre-trained in the ImageNet data set, and judging the relative authenticity of the micro-fossil CT image. The arbiter comprises a VGG19 network model, a LeakyReLU activation function, a convolution layer, a model weight normalization layer and other structures.
In the training process, the method is used for reconstructing the micro-fossil CT image in super-resolution in order to save the optimal weight of the WSRGAN network model. The invention adopts an LRS (learning rate decay strategy) of an optimization weight algorithm. The LRS is a strategy capable of restarting the optimization weight in each epoch stage, so that the convergence rate of the network model is further improved. In order to make the network model easier to converge to local minima, the present invention updates the weights of each epoch stage with a variable step-size α and random gradient descent (SGD) algorithm, while dividing all images in the training set into B batches. The weight value W of the WSRGAN network model updated by the SGD is as follows:
wherein,gradient value, X for the ith generation of training j For the original high resolution image set, Y j A set of high resolution images is generated for the generation network.
Restarting the optimization weight by using the LRS strategy, and minimizing the loss function of the WSRGAN network model. The variable step size α is defined as follows:
wherein,for the jth generation i batch, < ->The value range of (a) is [ alpha ] minmax ],b i For the number of micro-fostered CT images per batch, c is the cycle length of the cos function, S is the number of iterations of the restart step.
Training deep learning-based micro fossil CTSuper-resolution reconstruction WSRGAN network model of image, iteration number set to 500, optimizer as Adam SGD function, batch size set to 16, initial learning rate set to [1×10 ] -5 ,1×10 -3 ]. The loss function of the countermeasure network model is a non-convex loss function, and the super-resolution reconstruction WSRGAN network model of the optimal deep learning-based micro-fossil CT image is obtained. The loss function includes four parts, namely, perceived loss, counterloss, content loss and total variation loss. Loss of perceptionThe method is based on VGG architecture establishment, and the VGG19 network is used for respectively extracting the original high-resolution image and the reconstructed super-resolution image feature vector, calculating the Euclidean distance between the original high-resolution image and the reconstructed super-resolution image feature vector, and focusing on the perception information of the image. Countering losses->And using a discriminator to discriminate the original high-resolution image and the reconstructed super-resolution image, and restricting a generator so as to improve the visual effect of the reconstructed image. Total variation loss->The gradient in the image can be changed in a smaller area, a certain sharpness is generated, and the detail texture of the image is improved to a certain extent. Content loss->In the image super-resolution reconstruction process, the pixel-level distance between the reconstructed super-resolution image and the original high-resolution image is calculated by using the mean square error loss, so that the training of the generator is constrained. The definition formula of the loss function L of the final model training is as follows:
wherein,for the perception of lost parts, ->To combat lost parts, let go of>In order to achieve the total variation loss,is the content loss part.
Step 3: and obtaining a CT image with high signal-to-noise ratio corresponding to the super-resolution image based on the denoising network model.
According to the invention, the micro-fossils CT image to be Noise-reduced is input into a trained deep learning-based micro-fossils CT image denoising Noise to Noise network model, so that a high signal-to-Noise ratio micro-fossils CT image is obtained, the Noise reduction treatment of the micro-fossils CT image is realized, and the high signal-to-Noise ratio micro-fossils CT image is output.
The improved Noise to Noise network model for processing the de-noising problem of the micro-fossil CT image mainly comprises the following three parts: downsampling to introduce residual modules, skip connections to introduce attention mechanism modules, and upsampling to introduce deconvolution modules. In the modified Noise to Noise network model, the feature map size and channel number of the same phase of the downsampling and upsampling portions are the same. When the feature map is input to the next stage of the downsampling section, the size is one half of the original size, and the number of channels is 2 times of the original size. Meanwhile, in the image denoising method based on deep learning, which is provided for the micro-fossil CT image, a residual error module is introduced into a sampling part of an original Noise to Noise network model. The residual error module solves the problems of gradient disappearance, network degradation and the like caused by deepening the layer number of the traditional convolutional neural network, and the network model with excellent performance is easier to obtain. Attention mechanism modules are introduced in the original Noise to Noise network model hopping connection section. The attention mechanism module can fuse different layers of multi-scale feature images extracted by the convolutional neural network, and is helpful for removing different types of noise in the micro-fossil CT image, such as hardening artifact, spiced salt noise, gaussian noise and the like. The network model is trained, verified and tested by inputting the micro-fossil CT images with different levels of Gaussian random noise with zero mean value as input data and label data respectively. Since the loss function loss does not gradually drop with iteration, but the Noise to Noise network model still converges, the invention monitors the network training process by the signal-to-Noise ratio (SNR) of the denoised image. The method comprises the steps of loading a pre-trained Noise to Noise network model original weight file, comparing micro-fossil CT images added with different levels of random Noise, continuously regressing, training a Noise to Noise network model based on deep learning of the micro-fossil CT images, adjusting and improving the signal to Noise ratio of the micro-fossil CT images after Noise removal, and storing the optimized parameter weight of the improved Noise to Noise network model.
In one example, the improved Noise to Noise network model uses a ResNet34 network model as a backbone network for downsampling, and the downsampling portion includes four phases, each phase is composed of a plurality of residual modules based on the ResNet34 network model, the first phase includes 3 residual modules, the second phase includes 4 residual modules, the third phase includes 6 residual modules, and the fourth phase includes 3 residual modules. The jump connection part adopts a feature pyramid and attention mechanism combined structure and is connected with the feature images with the same scale extracted in the same stage of downsampling encoding and upsampling decoding, so that the channel number of the feature images in the upsampling stage can be doubled to 128, 256, 512 and 1024 respectively. Each stage of up-sampling includes a deconvolution operation with deconvolution kernel size of 2 x 2, feature map concatenation, and two convolution operations with convolution kernel size of 3 x 3, with the 1 x 1 convolution layer outputting a high signal-to-noise ratio micro-fossilized CT image.
As shown in fig. 3, the embodiment of the invention provides a deep learning-based micro-fossil CT image denoising Noise to Noise network model creation flow. And (3) adding random Noise micromanipulation CT images of different grades into an input experimental data set to improve a downsampling stage of a Noise to Noise network model, wherein the stage consists of a plurality of residual modules based on a pre-trained ResNet34 network model. Loading in ImageNet datasetThe method comprises the steps of setting the initial weight file of a pre-trained ResNet34 network model, setting the iteration number to 100, setting an optimizer to an Adam function, setting the batch size to 16, setting the initial learning rate to 1e-3, training a deep learning-based micro-fossil CT image denoising Noise to Noise network model, and obtaining an optimal deep learning-based micro-fossil CT image denoising network model. Improving the loss function of a Noise to Noise network model to be L noise Function, loss function L noise The definition formula of (2) is as follows:
wherein y is i Represents noisy micro-body fossil CT data, theta represents network parameters,another noisy distribution representing noisy micro-fossil CT data.
In summary, aiming at the ancient biological specialists, the invention further observes the super-microscopic structure of the micro-foster and requires expensive hardware equipment, and acquires the micro-foster CT images with high resolution and low resolution through micro-CT to establish experimental data sets for training, verifying and testing the super-resolution reconstruction method of the micro-foster CT images based on the deep learning, which is provided by the invention; aiming at ancient tissue details of the micro-fossils further observed by ancient biological specialists, a CT image with high signal to noise ratio is needed. The deep learning-based micro-fossil CT image preprocessing method and device developed by the invention are expected to indicate potential directions for determining geologic ages, dividing and comparing strata and searching mineral resources.
Compared with the existing micro-fossil CT image pretreatment method, experiments show that:
in the process of super-resolution reconstruction of the micro-fossil CT image, compared with the traditional bilinear interpolation method, bicubic interpolation method, and EDSR, WDSR and SRGAN network models based on deep learning, as shown in table 1, the WSRGAN network model developed by using the end-to-end architecture improves the peak signal-to-noise ratio PSNR (the larger the numerical value is, the higher the image quality) of the similarity measurement peak and the LPIPS (the smaller the numerical value is, the higher the image quality is) of the image similarity measurement standard.
Method PSNR LPIPS
Bilinear 32.6754 0.4289
Bicubic 33.1636 0.4166
EDSR 33.1733 0.2281
WDSR 32.1440 0.2640
SRGAN 34.0559 0.1815
The invention is that 35.0615 0.0757
TABLE 1
In the process of denoising the micro-fossil CT image, compared with the traditional image denoising method of non-local mean value (NLM) image denoising method, as shown in table 2, the improved Noise to Noise network model adopted by the invention obviously improves the signal-to-Noise ratio SNR (the larger the numerical value is, the higher the image quality is).
TABLE 2
Referring to fig. 4, a block diagram of a micro-fossil CT image preprocessing apparatus according to an embodiment of the present invention is shown. The device can be a computer device or can be arranged in the computer device. As shown in fig. 4, the apparatus includes the following modules: an image acquisition module 410, an image reconstruction module 420, an image denoising module 430, and an image output module 440.
An image acquisition module 410 for acquiring CT images of the micro-fossils;
an image reconstruction module 420, configured to reconstruct the CT image to obtain a CT image super-resolution image;
the image denoising module 430 is configured to denoise the CT image super-resolution image to obtain a CT image with a high signal-to-noise ratio;
an image output module 440 for outputting a high resolution, high signal-to-noise ratio CT image;
for details of the specific implementation process, beneficial effects, etc. of the device module, please refer to the description of the above method embodiment, and the details are not repeated here.
In an exemplary embodiment, there is also provided a computer device including a memory and a processor, the memory storing a computer program loaded and executed by the processor to implement the above-described deep learning-based micro-fossils CT image preprocessing method.
In an exemplary embodiment, a computer readable storage medium is also provided, on which a computer program is stored, which when being executed by a processor implements a deep learning based micro-fossilized CT image preprocessing method as described above.
In an exemplary embodiment, a computer program product is also provided, which, when run on a computer device, causes the computer device to perform a deep learning based micro-fossilized CT image preprocessing method as described above.
The foregoing description of the embodiments of the invention is not intended to be limiting, but rather is intended to cover all modifications, equivalents, alternatives, and improvements that fall within the spirit and scope of the invention.

Claims (10)

1. A deep learning-based micro-fossil CT image preprocessing method, the method comprising:
acquiring CT images of the micro fossil;
based on a CT image super-resolution reconstruction model, obtaining a super-resolution image of the CT image; the CT image super-resolution reconstruction model is obtained by calculating perception loss, antagonism loss, total variation loss and content loss by using a high-definition-low-definition original CT image pair, wherein the perception loss is obtained based on a feature vector distance between a high-definition original CT image and a corresponding super-resolution image, the antagonism loss is obtained by discriminating a high-definition original CT image and a reconstruction result of the super-resolution network model, the total variation loss is obtained based on a sharpness difference between image gradients of the high-definition original CT image and the corresponding super-resolution image due to regional variation, and the content loss is obtained based on a reconstruction result of the super-resolution network model and a pixel level distance between the high-definition original CT image;
and obtaining the CT image with high signal-to-noise ratio corresponding to the super-resolution image based on the denoising network model.
2. The method of claim 1, wherein the backbone network of the CT image super-resolution reconstruction model is a WSRGAN network, the WSRGAN network comprising: a generation network and a countermeasure network, the generation network being comprised of an encoding structure and a decoding structure, the encoding structure comprising a plurality of encoding modules, each encoding module comprising: residual block, convolution layer, model weight normalization layer, reLU activation layer, channel attention block, spatial attention block and upsampling function; the decoding structure comprises a plurality of decoding modules, and the countermeasure network establishes a structure comprising a VGG19 network model, a LeakyReLU activation function, a convolution layer and a model weight normalization layer based on a VGG architecture.
3. The method of claim 2, wherein training the CT image super-resolution reconstruction model comprises:
constructing an experimental data set based on a plurality of high-definition-low-definition original CT image pairs, and dividing all images in the experimental data set into a plurality of batches;
loading a pre-trained WSRGAN network original weight file;
inputting a low-definition original CT image to a generating network so that the encoding module extracts a feature vector from the low-definition original CT image, and generates an output vector to be input to a next encoding module, wherein the encoding module extracts the feature vector from the output vector; the feature vector output by the encoding module of the next stage obtains higher-dimensional image feature information than the feature vector output by the encoding module of the previous stage;
inputting the final feature vector extracted by the coding structure into a decoding structure, and generating a super-resolution image corresponding to the original CT image by a last decoding module, and obtaining perception loss based on the Euclidean distance of the feature vector between the high-definition original CT image and the corresponding super-resolution image;
inputting the low-definition original CT image into a generating network to obtain a reconstruction result of a super-resolution network model, inputting the reconstruction result and the high-definition original CT image into an countermeasure network together, and distinguishing the reconstruction image of the high-definition original CT image and the super-resolution network model based on a discriminator to obtain countermeasure loss;
using variable step sizesAnd updating the weight W of each batch by a random gradient descent algorithm i+1 Restarting the optimization weight by using the LRS strategy to obtain an optimal weight file of the WSRGAN network model; wherein i represents a batch, j represents a training algebra;
the total variation loss is obtained by minimizing the total variation of gradient operators in the horizontal direction and the vertical direction of the pixel level of the super-resolution reconstructed image and the high-definition original CT image;
calculating a pixel-level distance between a super-resolution reconstructed image result and a high-definition original CT image based on the mean square error loss to obtain content loss;
and training and optimizing the WSRGAN network according to the perception loss, the antagonism loss, the total variation loss and the content loss, so as to obtain a CT image super-resolution reconstruction model.
4. The method of claim 3, wherein the variable step size Wherein alpha is max Representing a variable step +.>Maximum value range of alpha min Representing a variable step +.>C is the cycle length of the cos function, S is the number of iterations of the restart step, b i The number of micro-fostered CT images per batch.
5. The method of claim 3, wherein the weightsWherein (1)>Gradient value, X for the ith generation of training j For the high-definition original CT image set, Y j For a high resolution image set generated based on the low definition raw CT image set, B represents the total batch of the ith generation of training.
6. The method of claim 1, wherein obtaining a high signal-to-noise ratio CT image corresponding to the super-resolution image based on a denoising network model comprises:
constructing a denoising network model, wherein a backbone network of the denoising network model is a modified Noise to Noise network, and the modified Noise to Noise network comprises: a downsampling portion, a skip connecting portion, and an upsampling portion; the feature images of the same stage of the downsampling part and the upsampling part have the same size and the same channel number, the downsampling part solves gradient disappearance and network degradation by introducing a residual error module, and the jump connection part fuses different layers of multi-scale feature images extracted by the convolutional neural network by introducing an attention mechanism;
training the improved Noise to Noise network by inputting micro-fossil CT images with different levels of Gaussian random Noise with zero mean as input data and tag data respectively, and monitoring the training process of the improved Noise to Noise network by the signal-to-Noise ratio of the denoised images to obtain denoising loss L noise
According to denoising loss L noise Performing the improved Noise to Noise network training and optimization to obtain a denoising network model;
and inputting the super-resolution image into a denoising network model to obtain a CT image with high signal-to-noise ratio.
7. The method of claim 1, wherein the denoising penaltyWherein y is t Represents the t-th noisy micro-body fossil CT data, θ represents network parameters, +.>Another noisy distribution of the t-th noisy micro-body CT data is represented, and N represents the total number of noisy micro-body CT data.
8. A deep learning-based micro-fossil CT image preprocessing device, the device comprising:
the acquisition module is used for acquiring CT images of the micro fossil;
the generation module is used for obtaining a super-resolution image of the CT image based on the CT image super-resolution reconstruction model; the CT image super-resolution reconstruction model is obtained by calculating perception loss, antagonism loss, total variation loss and content loss by using a high-definition-low-definition original CT image pair, wherein the perception loss is obtained based on a feature vector distance between a high-definition original CT image and a corresponding super-resolution image, the antagonism loss is obtained by identifying a reconstruction result of the high-definition original CT image and the super-resolution network model, the total variation loss is obtained based on a sharpness difference generated by image gradients of the high-definition original CT image and the corresponding super-resolution image due to region change, and the content loss is obtained based on a reconstruction result of the super-resolution network model and a pixel level distance between the high-definition original CT image;
and the denoising module is used for obtaining the CT image with high signal-to-noise ratio corresponding to the super-resolution image based on the denoising network model.
9. A computer device, the computer device comprising: a processor and a memory storing computer program instructions; the processor, when executing the computer program instructions, implements the deep learning-based micro-fostering CT image preprocessing method according to any one of claims 1 to 7.
10. A computer readable storage medium, wherein computer program instructions are stored on the computer readable storage medium, which when executed by a processor, implement the deep learning based micro-fossilized CT image preprocessing method according to any one of claims 1-7.
CN202310863239.0A 2023-07-14 2023-07-14 Deep learning-based micro-fossil CT image preprocessing method and device Pending CN117078780A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310863239.0A CN117078780A (en) 2023-07-14 2023-07-14 Deep learning-based micro-fossil CT image preprocessing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310863239.0A CN117078780A (en) 2023-07-14 2023-07-14 Deep learning-based micro-fossil CT image preprocessing method and device

Publications (1)

Publication Number Publication Date
CN117078780A true CN117078780A (en) 2023-11-17

Family

ID=88708730

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310863239.0A Pending CN117078780A (en) 2023-07-14 2023-07-14 Deep learning-based micro-fossil CT image preprocessing method and device

Country Status (1)

Country Link
CN (1) CN117078780A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118195903A (en) * 2024-05-20 2024-06-14 四川新视创伟超高清科技有限公司 Super-resolution image restoration method and restoration system based on deep learning

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118195903A (en) * 2024-05-20 2024-06-14 四川新视创伟超高清科技有限公司 Super-resolution image restoration method and restoration system based on deep learning

Similar Documents

Publication Publication Date Title
CN110443768B (en) Single-frame image super-resolution reconstruction method based on multiple consistency constraints
Trinh et al. Novel example-based method for super-resolution and denoising of medical images
CN112132959B (en) Digital rock core image processing method and device, computer equipment and storage medium
CN109214989B (en) Single image super resolution ratio reconstruction method based on Orientation Features prediction priori
CN111583285A (en) Liver image semantic segmentation method based on edge attention strategy
Min et al. Blind deblurring via a novel recursive deep CNN improved by wavelet transform
CN111583262A (en) Blood vessel segmentation method and system
CN113538616A (en) Magnetic resonance image reconstruction method combining PUGAN and improved U-net
Mansour et al. Image-to-image MLP-mixer for image reconstruction
CN116739899A (en) Image super-resolution reconstruction method based on SAUGAN network
CN115018728A (en) Image fusion method and system based on multi-scale transformation and convolution sparse representation
CN117274059A (en) Low-resolution image reconstruction method and system based on image coding-decoding
CN114913262B (en) Nuclear magnetic resonance imaging method and system with combined optimization of sampling mode and reconstruction algorithm
Rashid et al. Single MR image super-resolution using generative adversarial network
CN117078780A (en) Deep learning-based micro-fossil CT image preprocessing method and device
CN116258632A (en) Text image super-resolution reconstruction method based on text assistance
CN111986102B (en) Digital pathological image deblurring method
CN111311531A (en) Image enhancement method and device, console equipment and medical imaging system
Won et al. Low-dose CT denoising using octave convolution with high and low frequency bands
CN116778016A (en) MRI image reconstruction method, system and medium
WO2023164300A1 (en) Machine learning enabled restoration of low resolution images
Zhang et al. Deep residual network based medical image reconstruction
Song et al. Super resolution reconstruction of medical image based on adaptive quad-tree decomposition
Zhu et al. Denoising of Magnetic Resonance Images with Deep Neural Regularizer Driven by Image Prior
CN114862739B (en) Intelligent medical image enhancement method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination