[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
Meteorological Satellite Observations Reveal Diurnal Exceedance of Water Quality Guideline Thresholds in the Coastal Great Barrier Reef
Previous Article in Journal
Passive 3D Imaging Method Based on Photonics Integrated Interference Computational Imaging System
Previous Article in Special Issue
Wavelet Integrated Convolutional Neural Network for Thin Cloud Removal in Remote Sensing Images
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Infrared Cirrus Detection Using Non-Convex Rank Surrogates for Spatial-Temporal Tensor

1
School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
2
Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 611731, China
3
Xi’an Institute of Electromechanical Information Technology, Xi’an 710065, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2023, 15(9), 2334; https://doi.org/10.3390/rs15092334
Submission received: 27 February 2023 / Revised: 22 April 2023 / Accepted: 27 April 2023 / Published: 28 April 2023
(This article belongs to the Special Issue Pattern Recognition and Image Processing for Remote Sensing II)
Figure 1
<p>Comparison of the contribution of the <math display="inline"><semantics> <mrow> <msub> <mi>l</mi> <mn>0</mn> </msub> <mo>−</mo> <mi>n</mi> <mi>o</mi> <mi>r</mi> <mi>m</mi> </mrow> </semantics></math>, <math display="inline"><semantics> <mrow> <msub> <mi>l</mi> <mn>1</mn> </msub> <mo>−</mo> <mi>n</mi> <mi>o</mi> <mi>r</mi> <mi>m</mi> </mrow> </semantics></math>, and the Laplace function to the rank.</p> ">
Figure 2
<p>Spatial-temporal tensor construction.</p> ">
Figure 3
<p>Spatial patch-tensor construction.</p> ">
Figure 4
<p>The i-th singular values of unfolding matrices for infrared tensor <math display="inline"><semantics> <mrow> <msup> <mi mathvariant="script">D</mi> <mrow> <mi>s</mi> <mi>t</mi> </mrow> </msup> </mrow> </semantics></math>. (<b>a</b>) Cirrus image. (<b>b</b>–<b>d</b>) Singular values of mode-1, mode-2, and mode-3 unfolding matrices.</p> ">
Figure 5
<p>Cirrus image and corresponding mask image. (<b>a</b>–<b>c</b>) represents three classical clouds. (<b>d</b>–<b>f</b>) are the corresponding masks after morphological processing.</p> ">
Figure 6
<p>The whole process of the proposed algorithm.</p> ">
Figure 7
<p>(<b>a</b>–<b>f</b>) represent 6 original infrared images of different scenes.</p> ">
Figure 8
<p>ROC curves of different patch sizes in the 6 sequence images. (<b>a</b>) Sequence 1; (<b>b</b>) Sequence 2; (<b>c</b>) Sequence 3; (<b>d</b>) Sequence 4; (<b>e</b>) Sequence 5; (<b>f</b>) Sequence 6.</p> ">
Figure 9
<p>PR curves of different patch sizes in the 6-sequence images. (<b>a</b>) Sequence 1; (<b>b</b>) Sequence 2; (<b>c</b>) Sequence 3; (<b>d</b>) Sequence 4; (<b>e</b>) Sequence 5; (<b>f</b>) Sequence 6.</p> ">
Figure 10
<p>ROC curves of different regularization parameters in the 6 sequence images. (<b>a</b>) Sequence 1; (<b>b</b>) Sequence 2; (<b>c</b>) Sequence 3; (<b>d</b>) Sequence 4; (<b>e</b>) Sequence 5; (<b>f</b>) Sequence 6.</p> ">
Figure 11
<p>PR curves of different regularization parameters in the 6-sequence images. (<b>a</b>) Sequence 1; (<b>b</b>) Sequence 2; (<b>c</b>) Sequence 3; (<b>d</b>) Sequence 4; (<b>e</b>) Sequence 5; (<b>f</b>) Sequence 6.</p> ">
Figure 12
<p>ROC curves of different regularization parameters in the 6-sequence images. (<b>a</b>) Sequence 1; (<b>b</b>) Sequence 2; (<b>c</b>) Sequence 3; (<b>d</b>) Sequence 4; (<b>e</b>) Sequence 5; (<b>f</b>) Sequence 6.</p> ">
Figure 13
<p>PR curves of different methods in the 6-sequence images. (<b>a</b>) Sequence 1; (<b>b</b>) Sequence 2; (<b>c</b>) Sequence 3; (<b>d</b>) Sequence 4; (<b>e</b>) Sequence 5; (<b>f</b>) Sequence 6.</p> ">
Review Reports Versions Notes

Abstract

:
Infrared small target detection (ISTD) plays a significant role in earth observation infrared systems. However, some high reflection areas have a grayscale similar to the target, which will cause a false alarm in the earth observation infrared system. For the sake of raising the detection accuracy, we proposed a cirrus detection measure based on low-rank sparse decomposition as a supplementary method. To better detect cirrus that may be sparsely insufficient in a single frame image, the method treats the cirrus sequence image with time continuity as a tensor, then uses the visual saliency of the image to divide the image into a cirrus region and a cirrus-free region. Considering that the classical tensor rank surrogate cannot approximate the tensor rank very well, we used a non-convex tensor rank surrogate based on the Laplace function for the spatial-temporal tensor (Lap-NRSSTT) to surrogate the tensor rank. In an effort to compute the proposed model, we used a high-efficiency optimization approach on the basis of alternating the direction method of multipliers (ADMM). Finally, final detection results were obtained by the reconstructed cirrus images with a set threshold segmentation. Results indicate that the proposed scheme achieves better detection capabilities and higher accuracy than other measures based on optimization in some complex scenarios.

1. Introduction

The earth observation infrared system is a significant component of the remote sensing application. It significantly influences infrared guidance, remote sensing, missile warning, etc. [1,2,3,4]. The infrared detector has the advantages of strong adaptability, good portability, small size, and ease of concealment [5]. As the infrared imaging detection system has been significantly improving, some scholars have developed new target detection recognition algorithms [6,7,8,9,10,11,12,13]. However, some areas or scenes in nature also generate high levels of radiation, which means that they may show similar characteristics to a real small target. This will lead to early warning system false positives, which will interfere with small target detection. Because of the sun’s radiation, cirrus will produce a large amount of radiation, which will be imaged in the infrared image together with the real small target. To reduce the false alarm rate, it is essential to research cirrus features and detection methods.
The traditional methods of cloud detection are separated into three classes: pixel-level threshold, texture analysis and the statistical-based method [14,15,16,17]. These methods use space, time and frequency domain information, or calculate one or more appropriate thresholds based on a physical model, wavelength difference, etc. According to the obtained thresholds, the cirrus is distinguished from other parts of the image [18,19,20,21]. Because of the infrared radiation characteristics of cirrus, almost all of these traditional methods use infrared band images to detect the cirrus.
Besides using spectral or physical properties, since remote sensing technology, computer vision and artificial intelligence are undergoing a spurt of progress, some scholars have developed many new detection methods for cirrus detection [22,23,24,25,26,27]. Machine learning has improved the detection accuracy within a certain range; however, since the real cirrus scene lacks sufficient data to train the model, the method based on machine learning cannot achieve great detection performance in real scenes. Moreover, some scholars also introduced robust principal component analysis (RPCA) to cirrus cloud detection, but due to the design of the models, these studies have not yet achieved satisfactory results, especially in some complex scenarios [28,29].
Recently, optimization-based methods for target detection such as sparse representation (SR) have gradually been favored by scholars and successfully used in infrared imagery [30,31,32,33,34,35]. Low-rank sparse decomposition of a matrix or tensor involves regarding the infrared image as a linear superposition of two different parts: background component and target component. This type of method focuses more on the infrared image feature and utilizes the properties of the background and target. The background component has non-local self-similarity, which can be treated as a low-rank component. The proportion of the target component is small and can be treated as a sparse component. Using the above theory, the detection problem is converted into a matrix decomposition problem. For infrared images containing cirrus, the cirrus are still sparse compared to the background component. The background component is low-rank, so a cirrus detection problem can be converted into a matrix decomposition problem.
A traditional small target detection measure using RPCA and SR is often used for single frame image detection. Due to the small dimension and weak pixel levels of infrared small targets, the sparseness of infrared small targets is high in single frame images. However, due to the different imaging forms of cirrus, there are large cirrus. In a single frame image, these forms of cirrus may have poor sparsity. Since the background information of the infrared image in the same region at different times does not change much, and the cirrus may have morphological changes due to time process and displacement due to motion, adding time information can effectively improve the discrimination between low-rank components and sparse components in an established measure.
Considering an introduction of sequence images for detection, the matrix-based low-rank sparse method cannot meet the processing method of multiple images. Therefore, the theory of tensor recovery is considered.
The tensor, as a high-dimensional form of the matrix, is able to be directly utilized in high-dimensional data. Scholars have successfully used tensors to process color images, video and hyperspectral images [36,37,38,39,40,41,42]. Tensors are faster and more efficient at solving optimization problems when compared to matrices. By introducing a tensor into RPCA, scholars have proposed a tensor-robust principal component analysis model (TRPCA) [43]. Sequence images or videos with a temporal order can be treated as a three-dimensional tensor, so the TRPCA model can be applied in infrared imagery.
One of the difficulties of TRPCA is the representation of the tensor rank. Considering the non-local self-similarity in an infrared image background, the tensor composed of the background component has a low rank [44]. Tensor rank is the most direct means of measuring low-rank characteristics. However, the definition of matrix rank cannot be directly extended to tensors, and no direct definition can be used for the tensor rank.
Another difficulty is that most of the tensor rank calculation problems are NP-hard. For example: CP decomposition cannot be directly calculated. Therefore, scholars consider using tensor-rank convex relaxation or nonconvex tensor-rank substitution to represent the rank of tensors. Huang [45] used the sum of nuclear norms (SNN) in tensor rank but could not achieve a great effect in a complex background. Lu used tensor nuclear norm (TNN) in tensor rank [46], but each singular value is endowed with the same weight. However, big singular values include the main information of the image, while small singular values are caused by noise. Both SNN and TNN are convex relaxation-of-rank, which limits their performance. Recently, tensor rank non-convex surrogates (NRSs) based on the Laplace function have been proposed and successfully used [47]. This non-convex surrogate based on the Laplace function can help every singular value obtain a proper weight according to its values, which can better represent the rank of the tensor. As shown in Figure 1, Laplace functions can better approximate the l 0 norm than the l 1 norm. Guan [48] applied the non-convex surrogate based on the Laplace function to the infrared target detection. Inspired by his strategy, we propose a measure for cirrus detection based on visual saliency and non-convex spatial-temporal tensor rank surrogate (Lap-NRSSTT). The proposed method can use information embedded in a spatial-temporal structure and multiple regularization parameters to obtain better performance. The primary contributions in the paper could be summarized as bellow:
  • Considering the infrared imaging characteristics of cirrus scenes, a spatial-temporal tensor (STT) model was built, so that a low-rank sparse decomposition method could be effectively used in an infrared cirrus detection scheme;
  • To obtain an easy-to-calculate tensor rank, the NRSs using the Laplace function are applied to the STT (Lap-NRSSTT) completion for infrared imagery; It preserves the details of the cirrus and suppresses noise with smaller singular values;
  • To reduce the time complexity, a mask based on visual saliency is constructed, so that the optimization-based scheme can quickly reach the convergence stop condition with great detection performance.
The rest of this paper is organized as follows: In Section 2, details of the proposed model are proposed, the optimization of the model is designed, and the whole detection process for infrared sequence images is given. Section 3 proves the validity of the proposed measure through some single variable measurement and comparative experiments. In Section 4 and Section 5, we summarize the paper and discuss our future work.

2. Materials and Methods

In this section, we presented the Lap-NRSSTT model for infrared cirrus detection in infrared imagery.

2.1. Construction of STT Model

An infrared cirrus image could be divided into these parts: the infrared image matrix D , the background matrix A , the cirrus matrix S and the noise image matrix N . The definition of the model is as bellow:
D = A + S + N
To introduce TRPCA into cirrus detection, it was necessary to construct an STT model for detection. Under the tensor model, the infrared cirrus image model could be expressed as bellow:
D = A + S + N
where D , A , S , N R m × n × k are the input tensor, background tensor, cirrus tensor and random noise tensor, respectively. The variables m, n and k represent the size of the tensor.
When constructing the tensor model, the infrared small target algorithm would traverse the whole image using a window of size m × n . The obtained image patches would be arranged to form a tensor of size m × n × k . To better explain the subsequent steps, we explained some of the patch-tensor model construction steps: Considering that the experimental image we used is often preprocessed into an image of equal length and width, a sliding window of m × m was used, and to make sure that the obtained image patches did not have overlapping parts, the sliding step was also set to m. The whole image was traversed by a sliding window. By using this method, we could reduce redundant information to a certain extent. In this way, each frame of the image would obtain N u m image patches.
When used for cirrus detection, due to the different imaging forms of cirrus, there were large cirruses. The sparsity may be poor in a single sliding window. Because the background information of the infrared sequence image did not change much, the sparsity could be improved according to the change of the cirrus in the same position at a different time. Therefore, we considered constructing a tensor model with spatial-temporal characteristics.
To fully utilize the relevance of moving targets in the time domain, the order of image patches would be carefully considered. Here we consider three schemes: i and j were defined as follows: i was the frame index and j was the image patch index. The first scheme was to arrange all the image patches of one frame from small to large according to the patch index j , then continue the arrangement of the image patches of the next frame. This measure did not fully utilize the similarity of image patches at the same position between different frames. The second method combined the j -th image patch of each frame into a tensor of size m × m × i in the order of frames, and then arranged N u m tensors to obtain the final tensor N u m of size m × m × a ( a = i × N u m ) according to the order of the image patch index j from small to large. However, the displacement result from cirrus motion is often reflected on the surrounding patches of the patches with the same image patch index j in adjacent frames. If the tensor was constructed according to the above method, there would be a lot of image patches between D i j and D i + 1 j + 1 , which destroyed the local correlation of each frame in infrared images. Combining the advantages and disadvantages of the above measures, the STT was constructed by the method shown in Figure 2.
Firstly, the spatial patch tensor was constructed with the image patch D i j and its surrounding patches. The blue image patches represented the image patches of the current frame D i . The corresponding construction process is shown in Figure 3.
Then, the time patch tensor was constructed according to the following scheme: After selecting 2 × s adjacent frames D i + y of the current frame, where y [ s , 1 ] [ 1 , s ] , every frame D i + y was used to build a group of tensors according to Figure 3, labeled as D i + y s t . The temporal patch tensor was constructed by D i + y s t and arranged according to the frame index.
STT D s t could be shown as:
D s t = A s t + S s t + N s t
Thus, we established a new STT model, and fully utilized the spatial-temporal information. We would then analyze the properties of the divided parts in infrared imagery.
Background patch tensor A s t : For a three-dimensional tensor, we could obtain the expansion matrix of its various modes and calculated all singular values of the expansion matrix. When singular values rapidly descend to near zero rapidly, it means that the expansion matrix has the characteristics of low rank. We unfolded STT D s t and calculated the singular values of three mode, as shown in Figure 4. There was no doubt that the curve in the image would rapidly decrease to zero, which meant that our proposed tensor model has low rank characteristics in its multiple modes. Therefore, we could make a hypothesis on the background as follows:
rank ( A s t ) a
where a was a low-rank constant that constrains the background tensor. Generally, a was larger in a complex background.
Cirrus patch tensor S s t : The distribution range of the cirrus on the image was relatively small compared with the background; in the infrared video frame, the position shift or the morphological change of the cirrus caused by the movement of the cirrus made the cirrus more sparsely distributed in the tensor model we constituted and had significant sparse characteristics. Thus, the cirrus patch may satisfy a condition such as the following:
S s t 0 s
where s was a small positive number. It could be determined by the size of the cirrus and the frequency of the cirrus appearing in the spatial-temporal tensor.
Noise patch tensor N s t : The random noise in the infrared video was usually Gaussian white noise. According to (3), we could obtain that:
D s t A s t S s t F n
Removing noise, the low-rank component and sparse component could be separated by solving the following optimization problem:
min A s t , S s t r a n k ( A s t ) + λ S s t 0
s . t . D s t = A s t + S s t
According to the STT model, we could analyze the behavior of the cirrus patch image and background patch image. However, in the cirrus image, due to the different shapes of the cirrus, some shapes of the cirrus, which were similar to the background in the divided image patches, would show low rank. This would cause difficulty in distinguishing the cirrus. It was necessary to process the tensor model so that the cirrus component and the background component could be better distinguished. Therefore, we would use visual saliency to enhance the image.

2.2. Visual Saliency Mask

According to (7), λ was an important regularization parameter, which will maintain equilibrium between the sparse component and low rank component. The variable λ could be expressed as λ = L / min ( n 1 , n 2 ) × n 3 , where n 1 × n 2 × n 3 is the dimension of a tensor. It could separate the components of the cirrus and background to some extent. However, the shape of the cirrus was different from small targets. The cirrus had a large volume and different shapes. This made it difficult to select an appropriate λ .
When λ got a smaller value, the convergence rate of the tensor model would be very low, and it would take multiple iterations to obtain the result, which was slow. Secondly, many components belonging to the background would also be decomposed into sparse components, so that the low-rank components obtained in the background were too smooth. If we increased λ to generate a low-rank component with a higher rank, some cirrus would also enter the low-rank component, resulting in missed detection. Through experiments, it was found that for a single λ , it was impossible to determine a specific value so that the background and cirrus could be well separated.
Inspired by the patch sparse RPCA for salient motion detection in video [49], pixels could be differentiated according to whether they were cirrus pixels, and different regularization parameters were adopted for different regions. Through the pre-processing, we analyzed the image to find the possible areas of the cirrus and took different regularization parameters for the possible areas of the clouds and the cloud-free areas. For the possible areas that the cirrus may exit, a smaller regularization parameter was adopted to ensure that the sparse component did not enter the background, while for the background area, a larger regularization parameter was adopted so that the low-rank component would not enter the cirrus area. The modified TRPCA model for low-rank tensors and sparse tensors was as follows:
min A s t , S s t r a n k ( A s t ) + λ S Ω s t 1 + β S Ω s t 1
s . t . D s t = A s t + S s t , S s t = S Ω s t + S Ω s t
where S Ω s t was the detected region with cirrus and S Ω s t was the detected region without cirrus.
Due to the use of infrared sequence images and tensors for experiments, the amount of data to be processed in the experiment was large, and the use of additional RPCA would greatly increase the operation time. Therefore, the use of visual saliency to process the cirrus area was considered.
Frequency-tuned (FT) saliency was first proposed by Achanta et al. [50], which was simple and efficient. The steps to solve the frequency modulation restriction were as follows: First, Gaussian filtering was performed on the image to remove noise and texture details. After that, the l 2 n o r m of the difference between the processed image and its image mean was calculated as the final saliency map. It was defined as:
S = I G ( x , y ) I μ 2
where S is the frequency modulated saliency map, I G is the Gaussian filtered image, and I μ is the image mean. Through the frequency-tuned saliency feature, we obtained the possible area of the cirrus, and after that we performed threshold segmentation and morphological processing on the region to obtain the mask of the multi-cloud region we needed, as shown in Figure 5. Other visual saliency methods such as spectral residual or phase of Fourier transform cannot achieve good detection results when the gray level of the cirrus in the image is low and the background has a high brightness area. Considering the complexity of the experimental scene, other saliency methods cannot meet the complexity of the experiment. Combined with the STT model proposed above, the Lap-NRSSTT model we needed was obtained. Next, we discussed the NRS used to describe the low-rank of the background.

2.3. Non-Convex Surrogate of Tensor Rank

Since the rank function and l 0 n o r m were non-convex, it was still NP-hard to solve (7) directly. The l 1 n o r m was often used to replace the l 0 n o r m , but the approximation of tensor rank was a considerable difficulty. Figuring out how to deal with the rank was the key to solve the tensor recovery problem. The TNN was a convex relaxation-of-tensor rank, which had been widely used in medical image processing, tensor completion, video denoising, etc. However, it assigned the same weight to each singular value. In infrared images, different singular values correspond to different image information. Large singular values correspond to low-frequency components, and small singular values correspond to high-frequency components. In low-rank sparse tensor decomposition, we expected that the image information corresponding to large singular values was less penalized, so the TNN could not achieve better results. Many improved methods need prior knowledge to estimate some parameters. If the estimation was not correct, it was difficult to obtain better results. Considering the above problems, this paper used a non-convex tensor rank surrogate based on the Laplace function (Lap-NRS) to represent the rank of the tensor.
Here, Lap-NRS was defined as:
X ε = k = 1 n 3 i = 1 min ( n 1 , n 2 ) φ ( σ i ( X ¯ ( k ) ) )
where n 1 × n 2 × n 3 is the dimension of tensor X and φ ( x ) = 1 e x / ε represents a Laplace function. X ¯ ( k ) represents the Fourier transform result in the third dimension of the k -th frontal slice of tensor X and σ i represents the i -th singular value of the corresponding slice. ε represents a positive constant.
For the solution of Lap-NRS, we should consider a sub-problem as below:
arg min ε + γ 2 X F 2
where X R n 1 × n 2 × n 3 , γ was a positive constant and the t-SVD of X was X = U S V T . (11) can be solved with (12) and an extended weight singular value thresholding (WSVT) operator [51], as shown in (12) and (13):
D ¯ φ α ( k ) = ( S ¯ ( k ) φ ( σ i ( ¯ ( k ) ) ) α ) +
¯ ( k ) = U ¯ ( k ) * D ¯ φ α ( k ) * V ¯ ( k ) H
where α = γ n 3 and φ ( σ i ( ¯ ( k ) ) ) represents the derivative of the Laplace function at the singular value of the i -th positive slice in the Fourier domain of tensor . Algorithm 1 describes each iteration solution in (11).
Algorithm 1 Specific steps of the ADMM framework
Input: X , ;
Process:
1: Computer X ¯ = f f t ( X , [ ] , 3 ) ;
2: for k = 1 , [ ( n 3 + 1 ) / 2 ] do
     [ U ¯ ( k ) , S ¯ ( k ) , V ¯ ( k ) ] = S V D ( X ¯ ( k ) ) ;
3: Update D ¯ φ α ( k ) via (12);
4: Update ¯ ( k ) via (13);
5: end for;
6: for k = [ ( n 3 + 1 ) / 2 ] + 1 , , n 3 do
     ¯ ( k ) = c o n j ( ¯ ) n 3 k + 2 ;
7: end for;
8: Compute ¯ ( k + 1 ) = i f f t ( ¯ ( k ) , [ ] , 3 ) .

2.4. Solution of Lap-NRSSTT Mode

We used the alternating direction method of multipliers (ADMM) framework to solve this optimization problem, and the corresponding augmented Lagrangian function was as follows:
= A s t ε + λ S Ω s t 1 + β S Ω s t 1 + Y , A s t + S s t D s t + μ 2 A s t + S s t D s t F 2
where Y and μ > 0 denoted the Lagrange multiplier and penalty factor, and λ and β were regularization parameters, which were used to coordinate the balance of each component.
ADMM decomposed the minimization problem into several sub-problems, and the required A s t and S s t could be obtained through continuous iteration and updating. By solving the following subproblems, A s t and S s t were updated as follows:
( A s t ) k + 1 = arg min A s t ε + μ k 2 A s t + ( S s t ) k D s t + Y k μ k F 2
( S Ω s t ) k + 1 = arg min λ S Ω s t 1 + μ k 2 ( A s t ) k + 1 + S Ω s t D s t + Y k μ k F 2
( S Ω s t ) k + 1 = arg min β S Ω s t 1 + μ k 2 ( A s t ) k + 1 + S Ω s t D s t + Y k μ k F 2
( S s t ) k + 1 = ( S Ω s t ) k + 1 + ( S Ω s t ) k + 1
By introducing the relevant parameters, (15) could be solved by Algorithm 1. After updating A s t , S s t could be solved by combining (16), (17), (18) and the soft threshold operator. The corresponding results are as follows:
( S s t ) k + 1 = ( S Ω λ μ 1 + S Ω β μ 1 ) ( D s t ( A s t ) k + 1 Y k μ k )
where S Ω ( . ) is the soft threshold operator for cloudy regions:
S Ω ε ( x ) = s i g n ( x ) × max ( | x | ε , 0 )
The updates of Y and μ are as follows:
Y k + 1 = Y k + μ k ( ( A s t ) k + 1 + ( S s t ) k + 1 D s t )
μ k + 1 = ρ μ k
Finally, the specific process of solving the model by ADMM was shown in Algorithm 2.
Algorithm 2 ADMM for solving the proposed model.
Input: D s t , λ , β ;
Initialize: ( S s t ) 0 = ( A s t ) 0 = Y 0 = 0 , μ 0 = 0.0002 0, ρ = 1.05 , k = 0 t o l = 1 e 7 ;
While not converge do
1: Update ( A s t ) k + 1 by Algorithm 1;
2: Update ( S s t ) k + 1 by (19);
3: Update Y k + 1 by (21);
4: Update μ k + 1 by (22);
5: Check the convergence conditions
( A s t ) k + 1 + ( S s t ) k + 1 D s t F D s t F < t o l or ( S s t ) k 0 = ( S s t ) k + 1 0 ;
6: Update k: k = k + 1 ;
7: Output: ( A s t ) k , ( S s t ) k .

2.5. Steps of the Method

Figure 6 showed the entire course of infrared cirrus detection scheme based on proposed measure, which was described as follows:
  • Inputting the image. Given the current frame f i R n 1 × n 2 and its adjacent 2 × s frames f i + y R n 1 × n 2 ( y ( s , 1 ) ( 1 , s ) ) . Each frame of the image traversed the whole image through a sliding window of size m × m to obtain an N u m image patch;
  • Construction of the STT model. For each image of f i R n 1 × n 2 , an STT D s t R m × m × h was constructed according to the proposed model, where h = 9 × ( 2 × s + 1 ) ;
  • Using visual saliency to separate cloudy areas and cloudless areas. For each frontal slice of the input STT D s t , the visual saliency was calculated respectively, and STT S Ω s t R m × m × h and S Ω s t R m × m × h containing the prior information of the cirrus is obtained;
  • Separate background and cirrus. Taking STT D s t as the input tensor, D s t was decomposed into background patch tensor A s t and cirrus patch tensor S s t by Algorithm 2;
  • Image reconstruction and cirrus detection. The obtained background patch tensor A s t and cirrus patch tensor S s t were reconstructed to obtain the background image f a R n 1 × n 2 and the cirrus image f s R n 1 × n 2 . Then the detection result was obtained by one or more set threshold segmentation.

3. Experimental Results and Analysis

3.1. Experimental Preparations

This paper tests six representative cirrus infrared image sequences, as shown in Figure 7.
The experimental data are derived from the near-infrared band of the Landsat8 dataset [52]. From the diagram, Sequence a included some cirrus with a large imaging area and dense distribution. Sequence b is dotted, densely distributed in the upper right, and sparsely distributed in the lower left. Sequence c is a sparsely distributed massive cirrus image. Sequence d is a punctate cirrus concentrated in the lower right. Sequence e is a cirrus showing a silky state. Sequence f is a cirrus image with dense distribution of the whole image. These six sequences contain the form and distribution of most cirrus, making the experimental results more universal.
To receive appraisal for the proposed method objectively, it should be compared with the false alarm source detection method based on other optimization-based methods. Objective evaluation methods include receiver operating characteristic (ROC) curve, Precision-Recall (PR) curve, and F-measure.

3.2. Evaluation Metrics

The ROC curve and the PR curve are supervised evaluations, and real cirrus images need to be manually marked. The corresponding concepts of TP, FP, FN and TN are illustrated by the Table 1.
TP represents the number of pixels that are considered to be cirrus in the detection results and marked as cirrus in the ground truth. FP represents the number of pixels that are considered to not be cirrus in the detection results but marked as cirrus in the ground truth. FN represents the number of pixels that are considered to not be cirrus in the detection results but marked as cirrus in the ground truth. TN represents the number of pixels that are considered to not be cirrus in the detection results and not marked as cirrus in the ground truth.
The above four indicators cannot accurately represent the performance of the detection method. A detection measure with a large TP value may also have large FN and FP values. Therefore, it is necessary to consider combining multiple indicators to evaluate the performance of detection. TPR and FPR are two commonly used evaluation indexes, and the corresponding definitions are as below:
TPR = TP TP + FN
FPR = FP FP + TN
The ROC curve sets the abscissa as FPR and the ordinate as TPR. When a ROC curve obtained by a certain detection measure is close to the upper left corner, the area under the curve (AUC) is large, indicating that the measure has excellent detection performance. However, if the detection result image is fully marked as the cirrus, the TPR and FPR will be set to 1, so the evaluation effect of the ROC image is still good. In this case, we need another curve to evaluate the detection measure.
PR curves set the abscissa as recall and ordinate as precision. The corresponding definitions are as below:
precision = TP TP + FP
recall = TP TP + FN
When the PR curve is close to the upper right corner, the area under the curve is large, indicating that the method has excellent detection performance.
Although there is no necessary relationship between recall and precision, the two indicators are mutually restrictive. A useful measure to evaluate the conflict between them is F-measure (also known as F-score). F-measure is the weighted harmonic mean of precision and recall and can be described as follows:
F - measure = ( α 2 + 1 ) ( precision × recall ) α 2 × precision + recall
when α 2 is set to 1, the F-measure is also called F1-score.

3.3. Parameter Analysis

The proposed model contains some key parameters, such as image patch size, and regularization parameters λ and β . These parameters will affect the detection performance of the proposed model for different forms of cirrus and the robustness to various scenarios. Therefore, To achieve better performance in different scenarios, appropriate parameter settings should be selected. We selected the appropriate parameter by single variable measures and comparative experiments, and then analyzed the result according to the ROC curve and PR curve. In the experiment, when studying the optimization of a certain parameter, other parameters are fixed, and one of the parameters is adjusted.

3.3.1. Patch Size

The sliding window size represents the positive slice size of the STT, so the image patch size not only has a significant influence on the accuracy of the cirrus detection, but also affects the calculation time. On the one hand, we want to use larger image patches to ensure that the cirrus is sparse enough in a forward slice, but larger image patches will increase the computational complexity of singular value decomposition, thus increasing the computational time. On the other hand, a smaller image patch will reduce the processing complexity of the model, but it will also reduce the sparsity of the volume cloud, resulting in the need for more iterations to reach the iterative stopping condition in the process of optimization. To study the influence of image patch size, we changed the patch size from 40 to 80. The ROC curve and PR curve of the 6 test sequences are shown in Figure 8 and Figure 9. Considering the ROC curve, PR curve and running time, the size of the image patch is set to 60.

3.3.2. Regularization Parameter

In the proposed model, the regularization parameters λ and β control the balance between low-rank components and sparse components as the weights of cloudy and cloud-free regions. According to the empirical value, We set β = k × λ . A larger k means there are more small clouds in the cloud-free area. However, the mask in our study contains mostly cloudy areas, and only a few clouds may be in the cloud-free area, so the k is set to 25.
For the regularization parameter λ in the cloudy region, similar to the proposed tensor model for infrared detection, λ is set to λ = L / min ( n 1 , n 2 ) × n 3 . Change L from 0.02 to 0.1. The ROC curve and PR curve of the 6 test sequences are shown in Figure 10 and Figure 11. For a large-volume cirrus, changing the regularization parameter has little effect on the detection ability, but the corresponding operation time will increase due to the increase in optimization times. For a smaller cirrus, when λ continues to increase, the components belonging to the cloud will be separated from the low-rank components, resulting in a decrease in detection performance. Considering the ROC curve, PR curve and running time, when λ = 0.02 , the best performances can be obtained.

3.4. Method Comparison

It is difficult to qualitatively analyze the results of cirrus detection with different methods. (Some methods may perform better in local cirrus detection and perform poorly in other locations of the image. In this case, qualitative analysis is difficult to compare the advantages and disadvantages of various methods). Therefore, in this section, the quantitative analysis method will be used to compare the methods. In the proposed method, patch size is set to 60, regularization parameter λ = 0.02 , and we use 5 frames (2 frames before the current frame, the current frame, and 2 frames after the current frame) to build a tensor. The comparison indicators are the ROC curves, PR curves and F-measure values of various methods. The following methods are compared: IPI [44], LOGTFNN [30], PSTNN [6], TMESNN [29], KSVD fractal [28] and DivisorstepTP [53].
Figure 12 and Figure 13 shows the ROC curve and PR curve of all methods. As discussed above, the traditional optimization-based method uses a single regularization parameter, and the detection effect is poor for a larger cirrus. The method proposed in this paper can not only detect a large cirrus, but also has better detection performance than other methods based on optimization for a small cirrus with different distributions.
Table 2 shows the F-measure the above methods in 6 test image sequences. The bold values indicate the maximum value. It can be seen that the proposed method maintains a good detection effect.

4. Discussion

For cirrus detection, researchers have proposed methods based on physical model classes, machine learning and low-rank sparse decomposition. However, the use of physical model methods requires atmospheric radiation, geophysics and other related knowledge, while machine learning methods require a certain amount of data and corresponding labels as training samples, which is costly. The low-rank sparse decomposition measure fully utilizes the attributes of background and cirrus, which is closer to the essence of the image. In particular, the derived TRPCA model transferred the infrared image into a third-order tensor, and then used the low-rank nature of the infrared background and sparsity of the cirrus for detection, which results in a great detection accuracy.
The traditional IPT model is used to detect small targets. Compared with small targets, cirrus are larger and have a certain shape. Therefore, it is not feasible to directly apply the IPT model to cirrus detection. To ensure that the infrared tensor model can be successfully used for cirrus detection, we propose a cirrus detection method based on non-convex tensor rank surrogate. Firstly, to enhance the low rank of the cirrus tensor, time information is introduced to construct an STT model, which enhances the sparsity of the cirrus in a certain range. Secondly, after the experiment, it is found that a single balance coefficient cannot achieve the optimization solution. After that, the visual saliency is introduced as a priori to divide the cloudy and cloudless regions, and different balance coefficients are given. To solve the measure, an optimization method based on ADMM is designed. Finally, the detection results are segmented by one or more adaptive thresholds to obtain the final detection results.
This method is quantitatively compared with IPI, LOGTNN, TMESNN, PSTNN, KSVD fractal and DivisorstepTP. In Figure 12 and Figure 13 and Table 2, the ROC curve for the proposed method is closer to the upper left corner, and the PR curve of this method is closer to the upper right corner. In Table 2, the F-measure shows that the proposed method can achieve the best performance in all test sequences. In summary, the NTS measure has a great accuracy for cirrus sources.

5. Conclusions

We proposed an NTS-based cirrus detection measure, which focused on the accurate representation of background rank of infrared image and sparse enhancement of cirrus. For the purpose of representing the tensor rank, we used t-SVD decomposition and extended the non-convex surrogate based on Laplace function. To enhance the sparsity of the cirrus, by introducing spatial-temporal patches, an STT model conforming to the characteristics of infrared images was obtained through experimental comparison. By using the visual saliency of cirrus, a mask based on cirrus was generated, so that the improved model could be used for cirrus detection, which laid a foundation for the detection of false alarms in a similar large volume. To solve the model, an optimization method based on ADMM was designed. By combining the optimization function with ADMM, the problem was solved and its iteration was optimized. The experimental results showed that the measure could detect different forms of cirrus in different scenarios, and its quality indicators such as ROC curve, PR curve and F-measure also showed better performance than other optimization-based algorithms.
However, our proposed method is based on sequence images, and the construction of a spatial-temporal tensor is complex. In the process of solving the model, multiple singular value decomposition and optimization iterations are needed. Therefore, there is a lack of real-time performance. Additionally, our method is designed for small sample data and lacks scene applicability compared to deep learning methods. We can improve its performance by combining traditional features with deep learning in the future.

Author Contributions

All authors have contributed substantially to, and are in agreement with the content of, the manuscript. Conception/design, provision of study materials, and the collection and/or assembly of data: Conceptualization, S.X. and Z.P.; methodology, S.X. and F.L. and Z.P.; data analysis and interpretation: S.X. and F.L.; manuscript preparation: S.X. and Z.P.; final approval of the manuscript: S.X., F.L. and Z.P. The guarantor of the paper takes responsibility for the integrity of the work as a whole, from its inception to publication. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Natural Science Foundation of Sichuan Province of China (Grant No.2022NSFSC40574) and partially supported by National Natural Science Foundation of China (Grant No.61775030, Grant No.61571096).

Data Availability Statement

All data included in this study are available upon request by contact with the corresponding author.

Acknowledgments

The authors would thank Geospatial Data Cloud Website for providing the experimental data, and appreciate the support from the Laboratory of Imaging Detection and Intelligent Perception.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Huang, S.; Liu, Y.; He, Y.; Zhang, T.; Peng, Z. Structure-Adaptive Clutter Suppression for Infrared Small Target Detection: Chain-Growth Filtering. Remote Sens. 2020, 12, 47. [Google Scholar] [CrossRef]
  2. Wang, L.; Zhang, R.; Lin, Y.; Xu, S. Application in the military of the IR detection technology. Infrared Laser Eng. 2008, 37, 570–574. [Google Scholar]
  3. Liu, X.; Chen, Y.; Peng, Z.; Wu, J.; Wang, Z. Infrared image super-resolution reconstruction based on quaternion fractional order total variation with Lp quasinorm. Appl. Sci. 2018, 8, 1864. [Google Scholar] [CrossRef]
  4. Huang, S.; Peng, Z.; Wang, Z.; Wang, X.; Li, M. Infrared Small Target Detection by Density Peaks Searching and Maximum-Gray Region Growing. IEEE Geosci. Remote Sens. Lett. 2019, 16, 1919–1923. [Google Scholar] [CrossRef]
  5. Hou, W.; Sun, X.; Shang, Y.; Yu, Q. Present State and Perspectives of Small Infrared Targets Detection Technology. Infrared Technol. 2015, 37, 1–10. [Google Scholar]
  6. Zhang, L.; Peng, Z. Infrared Small Target Detection Based on Partial Sum of the Tensor Nuclear Norm. Remote Sens. 2019, 11, 382. [Google Scholar] [CrossRef]
  7. Zhang, T.; Wu, H.; Liu, Y.; Peng, L.; Yang, C.; Peng, Z. Infrared Small Target Detection Based on Non-Convex Optimization with Lp-Norm Constraint. Remote Sens. 2019, 11, 559. [Google Scholar] [CrossRef]
  8. Wang, X.; Peng, Z.; Kong, D.; He, Y. Infrared Dim and Small Target Detection Based on Stable Multisubspace Learning in Heterogeneous Scene. IEEE Trans. Geosci. Remote Sens. 2017, 55, 5481–5493. [Google Scholar] [CrossRef]
  9. Wang, X.; Peng, Z.; Zhang, P.; He, Y. Infrared Small Target Detection via Nonnegativity-Constrained Variational Mode Decomposition. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1700–1704. [Google Scholar] [CrossRef]
  10. Wang, X.; Peng, Z.; Kong, D.; Zhang, P.; He, Y. Infrared dim target detection based on total variation regularization and principal component pursuit. Image Vis. Comput. 2017, 63, 1–9. [Google Scholar] [CrossRef]
  11. Zhang, L.; Peng, L.; Zhang, T.; Cao, S.; Peng, Z. Infrared Small Target Detection via Non-Convex Rank Approximation Minimization Joint l2,1 Norm. Remote Sens. 2018, 10, 1821. [Google Scholar] [CrossRef]
  12. Zhang, T.; Li, L.; Cao, S.; Pu, T.; Peng, Z. Attention-Guided Pyramid Context Networks for Detecting Infrared Small Target Under Complex Background. IEEE Trans. Aerosp. Electron. Syst. 2023, 1–13. [Google Scholar] [CrossRef]
  13. Han, Y.; Liao, J.; Lu, T.; Pu, T.; Peng, Z. KCPNet: Knowledge-Driven Context Perception Networks for Ship Detection in Infrared Imagery. IEEE Trans. Geosci. Remote Sens. 2023, 61, 1–19. [Google Scholar] [CrossRef]
  14. Allen, R.C.; Durkee, P.A.; Wash, C.H. Snow/Cloud Discrimination with Multispectral Satellite Measurements. J. Appl. Meteorol. Climatol. 1990, 29, 994–1004. [Google Scholar] [CrossRef]
  15. Inoue, T. A cloud type classification with NOAA 7 split-window measurements. J. Geophys. Res. Atmos. 1987, 92, 3991–4000. [Google Scholar] [CrossRef]
  16. Smith, W.L.; Revercomb, H.E.; Howell, H.B.; Lin, M.X. Multi-Spectral Window Radiance Observations of Cirrus from Satellite and Aircraft, November 2, 1986 Project FIRE; NASA: Washington, DC, USA, 1990. [Google Scholar]
  17. Wind, G.; Platnick, S.; King, M.D.; Hubanks, P.A.; Pavolonis, M.J.; Heidinger, A.K.; Yang, P.; Baum, B.A. Multilayer Cloud Detection with the MODIS Near-Infrared Water Vapor Absorption Band. J. Appl. Meteorol. Climatol. 2010, 49, 2315–2333. [Google Scholar] [CrossRef]
  18. Rossow, W.B.; Mosher, F.; Kinsella, E.; Arking, A.; Desbois, M.; Harrison, E.; Minnis, P.; Ruprecht, E.; Seze, G.; Simmer, C.; et al. ISCCP Cloud Algorithm Intercomparison. J. Appl. Meteorol. Climatol. 1985, 24, 877–903. [Google Scholar] [CrossRef]
  19. Goodman, A.H.; Henderson-Sellers, A. Cloud detection and analysis: A review of recent progress. Atmos. Res. 1988, 21, 203–228. [Google Scholar] [CrossRef]
  20. Rossow, W.B. Measuring Cloud Properties from Space: A Review. J. Clim. 1989, 2, 201–213. [Google Scholar] [CrossRef]
  21. Liu, X.; Ping, S.J.; Huang, Y.J. Cloud automatic detection in high-resolution satellite images based on morphological features. In Proceedings of the Eleventh International Conference on Graphics and Image Processing, Hangzhou, China, 3–6 July 2020; p. 113730. [Google Scholar]
  22. Hollstein, A.; Segl, K.; Guanter, L.; Brell, M.; Enesco, M. Ready-to-Use Methods for the Detection of Clouds, Cirrus, Snow, Shadow, Water and Clear Sky Pixels in Sentinel-2 MSI Images. Remote Sens. 2016, 8, 666. [Google Scholar] [CrossRef]
  23. Kang, X.; Gao, G.; Hao, Q.; Li, S. A Coarse-to-Fine Method for Cloud Detection in Remote Sensing Images. IEEE Geosci. Remote Sens. Lett. 2019, 16, 110–114. [Google Scholar] [CrossRef]
  24. Gao, B.C.; Li, R.R. Removal of Thin Cirrus Scattering Effects in Landsat 8 OLI Images Using the Cirrus Detecting Channel. Remote Sens. 2017, 9, 834. [Google Scholar] [CrossRef]
  25. Zheng, Y.; Ling, W.; Shifei, T. A Lightweight Network for Remote Sensing Image Cloud Detection. In Proceedings of the 2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems (ICPICS), Shenyang, China, 29–31 July 2022; pp. 644–649. [Google Scholar]
  26. Tang, X.; Yao, J.; Chen, J.; Li, G.; Zhang, W. Multimodel Fusion Method for Cloud Detection in Satellite Laser Footprint Images. IEEE Geosci. Remote Sens. Lett. 2022, 19, 1–5. [Google Scholar] [CrossRef]
  27. Luo, C.; Feng, S.; Yang, X.; Ye, Y.; Li, X.; Zhang, B.; Chen, Z.; Quan, Y. LWCDnet: A Lightweight Network for Efficient Cloud Detection in Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–16. [Google Scholar] [CrossRef]
  28. Lyu, Y.; Peng, L.; Pu, T.; Yang, C.; Wang, J.; Peng, Z. Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery. Remote Sens. 2020, 12, 142. [Google Scholar] [CrossRef]
  29. Yang, C.; Kong, X.; Cao, Z.; Peng, Z. Cirrus Detection Based on Tensor Multi-Mode Expansion Sum Nuclear Norm in Infrared Imagery. IEEE Access 2020, 8, 149963–149983. [Google Scholar] [CrossRef]
  30. Kong, X.; Yang, C.; Cao, S.; Li, C.; Peng, Z. Infrared Small Target Detection via Non-Convex Tensor Fibered Rank Approximation. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–21. [Google Scholar] [CrossRef]
  31. Wang, G.; Tao, B.; Kong, X.; Peng, Z. Infrared Small Target Detection Using Non-Overlapping Patch Spatial–Temporal Tensor Factorization With Capped Nuclear Norm Regularization. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–17. [Google Scholar] [CrossRef]
  32. Yi, H.; Yang, C.; Qie, R.; Liao, J.; Wu, F.; Pu, T.; Peng, Z. Spatial-Temporal Tensor Ring Norm Regularization for Infrared Small Target Detection. IEEE Geosci. Remote Sens. Lett. 2023, 20, 1–5. [Google Scholar] [CrossRef]
  33. Liu, H.K.; Zhang, L.; Huang, H. Small Target Detection in Infrared Videos Based on Spatio-Temporal Tensor Model. IEEE Geosci. Remote Sens. Lett. 2020, 58, 8689–8700. [Google Scholar] [CrossRef]
  34. Cao, Z.; Kong, X.; Zhu, Q.; Cao, S.; Peng, Z. Infrared dim target detection via mode-k1k2 extension tensor tubal rank under complex ocean environment. ISPRS J. Photogramm. Remote Sens. 2021, 181, 167–190. [Google Scholar] [CrossRef]
  35. Zhang, T.; Peng, Z.; Wu, H.; He, Y.; Li, C.; Yang, C. Infrared small target detection via self-regularized weighted sparse model. Neurocomputing 2021, 420, 124–148. [Google Scholar] [CrossRef]
  36. Yan, H.; Paynabar, K.; Shi, J. Image-Based Process Monitoring Using Low-Rank Tensor Decomposition. IEEE Trans. Autom. Sci. Eng. 2015, 12, 216–227. [Google Scholar] [CrossRef]
  37. Hassanzadeh, S.; Karami, A.; Heylen, R.; Scheunders, P. Compression of hyperspectral images using block coordinate descent search and compressed sensing. In Proceedings of the 2016 8th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS), Los Angeles, CA, USA, 21–24 August 2016; pp. 1–4. [Google Scholar]
  38. Gao, X.; Yang, Y.; Tao, D.; Li, X. Discriminative optical flow tensor for video semantic analysis. Comput. Vis. Image Underst. 2009, 113, 372–383. [Google Scholar] [CrossRef]
  39. Kong, X.; Zhao, Y.; Xue, J.; Chan, J.C.-W. Hyperspectral Image Denoising Using Global Weighted Tensor Norm Minimum and Nonlocal Low-Rank Approximation. Remote Sens. 2019, 11, 2281. [Google Scholar] [CrossRef]
  40. Kong, X.; Zhao, Y.; Xue, J.; Chan, C.W.; Kong, S.G. Global and Local Tensor Sparse Approximation Models for Hyperspectral Image Destriping. Remote Sens. 2020, 12, 704. [Google Scholar] [CrossRef]
  41. Xue, J.; Zhao, Y.; Liao, W.; Chan, J.C.W. Nonlocal Low-Rank Regularized Tensor Decomposition for Hyperspectral Image Denoising. IEEE Trans. Geosci. Remote Sens. 2019, 57, 5174–5189. [Google Scholar] [CrossRef]
  42. Xue, J.; Zhao, Y.; Liao, W.; Cheung-Wai Chan, J. Nonconvex tensor rank minimization and its applications to tensor recovery. Inf. Sci. 2019, 503, 109–128. [Google Scholar] [CrossRef]
  43. Goldfarb, D.; Qin, Z. Robust Low-Rank Tensor Recovery: Models and Algorithms. SIAM J. Matrix Anal. Appl. 2014, 35, 225–253. [Google Scholar] [CrossRef]
  44. Gao, C.; Meng, D.; Yang, Y.; Wang, Y.; Zhou, X.; Hauptmann, A.G. Infrared Patch-Image Model for Small Target Detection in a Single Image. IEEE Trans. Image Process. 2013, 22, 4996–5009. [Google Scholar] [CrossRef]
  45. Huang, B.; Mu, C.; Goldfarb, D.; Wright, J. Provable Models for Robust Low-Rank Tensor Completion. Pac. J. Optim. 2015, 11, 339–364. [Google Scholar]
  46. Lu, C.; Feng, J.; Chen, Y.; Liu, W.; Lin, Z.; Yan, S. Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 5249–5257. [Google Scholar]
  47. Xu, W.-H.; Zhao, X.-L.; Ji, T.-Y.; Miao, J.-Q.; Ma, T.-H.; Wang, S.; Huang, T.-Z. Laplace function based nonconvex surrogate for low-rank tensor completion. Signal Process. Image Commun. 2019, 73, 62–69. [Google Scholar] [CrossRef]
  48. Guan, X.; Zhang, L.; Huang, S.; Peng, Z. Infrared Small Target Detection via Nonconvex Tensor Rank Surrogate Joint Local Contrast Energy. Remote Sens. 2020, 12, 1520. [Google Scholar] [CrossRef]
  49. Liu, X.; Zhao, G.; Yao, J.; Qi, C. Background Subtraction Based on Low-Rank and Structured Sparse Decomposition. IEEE Trans. Image Process. 2015, 24, 2502–2514. [Google Scholar] [CrossRef] [PubMed]
  50. Achanta, R.; Hemami, S.; Estrada, F.; Süsstrunk, S. Frequency-tuned Salient Region Detection. In Proceedings of the IEEE Conference on Computer Vision Pattern Recognition (CVPR), Miami, FL, USA, 20–25 June 2009. [Google Scholar] [CrossRef]
  51. Chen, Y.; Guo, Y.; Wang, Y.; Wang, D.; Peng, C.; He, G. Denoising of Hyperspectral Images Using Nonconvex Low Rank Matrix Approximation. IEEE Geosci. Remote Sens. Lett. 2017, 55, 5366–5380. [Google Scholar] [CrossRef]
  52. Available online: https://www.gscloud.cn/home (accessed on 5 May 2022).
  53. Ju, W.; Lam, N.S.N. An improved algorithm for computing local fractal dimension using the triangular prism method. Comput. Geosci. 2009, 35, 1224–1233. [Google Scholar] [CrossRef]
Figure 1. Comparison of the contribution of the l 0 n o r m , l 1 n o r m , and the Laplace function to the rank.
Figure 1. Comparison of the contribution of the l 0 n o r m , l 1 n o r m , and the Laplace function to the rank.
Remotesensing 15 02334 g001
Figure 2. Spatial-temporal tensor construction.
Figure 2. Spatial-temporal tensor construction.
Remotesensing 15 02334 g002
Figure 3. Spatial patch-tensor construction.
Figure 3. Spatial patch-tensor construction.
Remotesensing 15 02334 g003
Figure 4. The i-th singular values of unfolding matrices for infrared tensor D s t . (a) Cirrus image. (bd) Singular values of mode-1, mode-2, and mode-3 unfolding matrices.
Figure 4. The i-th singular values of unfolding matrices for infrared tensor D s t . (a) Cirrus image. (bd) Singular values of mode-1, mode-2, and mode-3 unfolding matrices.
Remotesensing 15 02334 g004
Figure 5. Cirrus image and corresponding mask image. (ac) represents three classical clouds. (df) are the corresponding masks after morphological processing.
Figure 5. Cirrus image and corresponding mask image. (ac) represents three classical clouds. (df) are the corresponding masks after morphological processing.
Remotesensing 15 02334 g005
Figure 6. The whole process of the proposed algorithm.
Figure 6. The whole process of the proposed algorithm.
Remotesensing 15 02334 g006
Figure 7. (af) represent 6 original infrared images of different scenes.
Figure 7. (af) represent 6 original infrared images of different scenes.
Remotesensing 15 02334 g007
Figure 8. ROC curves of different patch sizes in the 6 sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Figure 8. ROC curves of different patch sizes in the 6 sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Remotesensing 15 02334 g008
Figure 9. PR curves of different patch sizes in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Figure 9. PR curves of different patch sizes in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Remotesensing 15 02334 g009
Figure 10. ROC curves of different regularization parameters in the 6 sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Figure 10. ROC curves of different regularization parameters in the 6 sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Remotesensing 15 02334 g010
Figure 11. PR curves of different regularization parameters in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Figure 11. PR curves of different regularization parameters in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Remotesensing 15 02334 g011
Figure 12. ROC curves of different regularization parameters in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Figure 12. ROC curves of different regularization parameters in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Remotesensing 15 02334 g012
Figure 13. PR curves of different methods in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Figure 13. PR curves of different methods in the 6-sequence images. (a) Sequence 1; (b) Sequence 2; (c) Sequence 3; (d) Sequence 4; (e) Sequence 5; (f) Sequence 6.
Remotesensing 15 02334 g013
Table 1. Concepts of TP, FP, FN and TN.
Table 1. Concepts of TP, FP, FN and TN.
Actual PositiveActual Negative
predicted positiveTPFP
predicted negativeFNTN
Table 2. F-measure of six methods.
Table 2. F-measure of six methods.
.IPILOGTFNNPSTNNKSVD FractalTMESNNDivisorstepTPProposed
Seq10.12570.33240.00590.63030.14300.29860.8374
Seq20.70080.44950.57110.74780.68320.34200.7612
Seq30.27860.37060.25540.55630.17770.13870.8129
Seq40.76450.67850.76450.79600.67400.58420.8443
Seq50.68630.55870.39420.75480.61290.54050.8874
Seq60.63030.67690.73950.82050.64630.58490.8488
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiao, S.; Peng, Z.; Li, F. Infrared Cirrus Detection Using Non-Convex Rank Surrogates for Spatial-Temporal Tensor. Remote Sens. 2023, 15, 2334. https://doi.org/10.3390/rs15092334

AMA Style

Xiao S, Peng Z, Li F. Infrared Cirrus Detection Using Non-Convex Rank Surrogates for Spatial-Temporal Tensor. Remote Sensing. 2023; 15(9):2334. https://doi.org/10.3390/rs15092334

Chicago/Turabian Style

Xiao, Shengyuan, Zhenming Peng, and Fusong Li. 2023. "Infrared Cirrus Detection Using Non-Convex Rank Surrogates for Spatial-Temporal Tensor" Remote Sensing 15, no. 9: 2334. https://doi.org/10.3390/rs15092334

APA Style

Xiao, S., Peng, Z., & Li, F. (2023). Infrared Cirrus Detection Using Non-Convex Rank Surrogates for Spatial-Temporal Tensor. Remote Sensing, 15(9), 2334. https://doi.org/10.3390/rs15092334

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop