[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
Differences in the Lateralization of Theta and Alpha Power During n-Back Task Performance Between Older and Young Adults in the Context of the Hemispheric Asymmetry Reduction in Older Adults (HAROLD) Model
Previous Article in Journal
Dynamic-Aware Network for Moving Object Detection
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

New Combined Metric for Full-Reference Image Quality Assessment

Department of Data Science and Engineering, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
*
Author to whom correspondence should be addressed.
Symmetry 2024, 16(12), 1622; https://doi.org/10.3390/sym16121622
Submission received: 14 November 2024 / Revised: 2 December 2024 / Accepted: 5 December 2024 / Published: 7 December 2024
(This article belongs to the Section Computer)

Abstract

:
In recent years, many new metrics highly correlated with the Mean Opinion Score (MOS) have been proposed for assessing image quality through Full-Reference Image Quality Assessment (FR-IQA) methods, such as MDSI, HPSI, and GMSD. Eight of these selected metrics, which compare reference and distorted images in a symmetrical manner, are briefly described in this article, and their performance is evaluated using correlation criteria (PLCC, SROCC, and KROCC), as well as RMSE. The aim of this paper is to develop a new, efficient quality index based on a combination of several high-performance metrics already utilized in the field of Image Quality Assessment (IQA). The study was conducted on four benchmark image databases (TID2008, TID2013, KADID-10k, and PIPAL) and identified the three best-performing metrics for each database. The paper introduces a New Combined Metric (NCM), which is a weighted sum of three component metrics, and demonstrates its superiority over each of its component metrics across all the examined databases. An optimization method for determining the weights of the NCM is also presented. Additionally, an alternative version of the combined metric, based on the fastest metrics and employing symmetric calculations for pairs of compared images, is discussed. This version also demonstrates strong performance.

1. Introduction

Every day, a huge number of digital cameras generate a vast stream of images. Due to the multitude of applications of imaging devices in areas such as vision-based quality control of components in manufacturing processes, security monitoring, object detection systems in automotive applications, and analysis of diagnostic images in medicine, there has been a strong increase in the demand for Image Quality Assessment (IQA) methods.
Image quality can be assessed either subjectively or objectively. Subjective methods rely on the perceptual evaluation of image quality by human observers, which means that conducting these assessments incurs significant financial costs and requires a large number of participants. In contrast, objective methods utilize mathematical models to determine the values of various metrics related to image quality. Among Image Quality Assessment (IQA) methods, the most advanced are those that perform a symmetrical comparison between distorted images and their originals, referred to as Full-Reference IQA (FR-IQA) techniques. The scores generated by each IQA metric can be evaluated against subjective assessments, such as the Mean Opinion Score (MOS) or the Difference Mean Opinion Score (DMOS), derived from human viewers. For FR-IQA methods, the correlation coefficients obtained from comparisons with MOS indicate the effectiveness of the metric: the higher the coefficient, the more closely the metric aligns with human perception. For many years, efforts have been made in the field of FR-IQA to improve and refine existing quality metrics. Significant importance is attached to attempts to combine one metric with other quality measures to increase the correlation of the resulting quality index with MOS. Meanwhile, the rapid development of machine-learning and deep-learning techniques provides new methods for image quality assessment. The FR-IQA problem can be understood as a challenge in developing mathematical models that can perceptually assess the image quality in alignment with human judgment.
Over the years, numerous metrics for FR-IQA have been proposed that take various aspects of the Human Visual System (HVS) into account. Recently, attempts have been made to enhance the effectiveness of FR-IQA by combining existing metrics to create a “super” index. Theoretical foundations for such metric fusion can be found in Liu’s work [1], where it is applied to very old and classical metrics such as PSNR, VSNR, SSIM, and VIF. In the paper by Okarma [2], the properties of three FR-IQA metrics (MS-SSIM, VIF, and R-SVD) were analyzed, and a combined quality metric based on their product was proposed. It is named the Combined Quality Metric (CQM), and its three correlation coefficients increased in relation to the correlation coefficients with MOS of individual product multipliers.
Later, this concept was further developed using optimization or regression techniques to determine the optimal weights or exponents in the products of existing FR-IQA indices [3,4]. The Combined Image Similarity Index (CISI) proposed by Okarma in [3] employs metrics similar to those used in the CQM. However, instead of the R-SVD metric, it utilizes the FSIMc metric. The CISI index demonstrates a higher correlation with Mean Opinion Score (MOS) compared to the CQM index. In the 2013 article by Okarma [4], a new EHIS metric is introduced, which is based on the product of four multipliers: two familiar from CQM (MS-SSIM and VIF) and two new ones (WFSIMc and RFSIM). This approach improves the correlation with MOS over the combined CISI metric.
Another metrics fusion strategy was proposed by the author of [5], who presented several versions of a linear combination (weighted sum) based on metrics selected from a dozen FR-IQA metrics. He referred to these new combined metrics as Linearly Combined Similarity Measures (LCSIM). The use of linear combination metrics requires determining the weighting factors for the FR-IQA metrics, which is achieved by solving the RMSE error minimization task using a genetic algorithm.
Another line of work in creating methods based on the fusion of metrics utilizes machine-learning techniques. One example can be seen in [6], where the results of six traditional FR-IQA metrics (FSIMc, PSNR-HMA, PSNR-HVS, SFF, SR-SIM, and VIF) were used as a feature vector for training and testing a four-layer neural network. The output results produced by the neural network demonstrate a significant improvement over those achieved by the input metrics. Currently, deep neural networks, particularly CNNs, can learn the best combinations of metrics to optimize image quality assessment [7,8].
Combining metrics leads to increased computational complexity, which can be an issue in the context of real-time applications. Nevertheless, the fusion of quality metrics for FR-IQA is a promising research direction that could significantly improve the correlation of objective quality assessments with MOS.
In this paper, we consider a new combined metric for FR-IQA, based on metrics that are highly correlated with MOS and well-known from the literature. The structure of this paper is as follows: following the introduction, Section 2 provides an overview of eight relatively new and promising FR-IQA metrics. Section 3 presents their linear combination, along with the experimental results. Finally, Section 4 concludes the paper.

2. Overview of Highly Correlated FR-IQA Metrics

2.1. Feature SIMilarity Index FSIMc (Color Version)

In [9], the Feature SIMilarity Index (FSIM) was introduced as a quality assessment metric for grayscale images, along with its color version, FSIMc. The local quality of the assessed image, f 2 , when symmetrically compared to the reference image, f 1 , is represented by two low-level similarity maps derived from phase congruences ( P C 1 , P C 2 ) [10]:
S P C ( x , y ) = 2 P C 1 ( x , y ) · P C 2 ( x , y ) + T 1 P C 1 2 ( x , y ) + P C 2 2 ( x , y ) + T 1 ,
and gradient magnitudes ( G 1 , G 2 ):
S G ( x , y ) = 2 G 1 ( x , y ) · G 2 ( x , y ) + T 2 G 1 2 ( x , y ) + G 2 2 ( x , y ) + T 2 ,
where G 1 and G 2 are Scharr gradient operators and T 1 and T 2 are positive constants introduced to enhance the stability of the formulas. Phase Congruence (PC) quantifies the presence and intensity of local features, including edges, corners, and textures. It is derived from the analysis of phase information in the frequency domain of the image (specifically, the phase of its Fourier transform). As an additional weighting factor for the similarity maps, the following P C m is used:
P C m ( x , y ) = m a x ( P C 1 ( x , y ) , P C 2 ( x , y ) ) .
The final expression for the proposed image quality index is given by:
F S I M = ( x , y ) Ω S P C ( x , y ) · S G ( x , y ) · P C m ( x , y ) ( x , y ) Ω P C m ( x , y ) .
FSIMc, the color extension of FSIM, incorporates the chrominance components I and Q. The calculation of the index values starts by decomposing the compared images, f 1 and f 2 , into their YIQ color components, where Y represents luminance and I and Q represent chrominance. Similar to the earlier defined similarity maps, additional similarity maps for the I and Q components were introduced:
S I ( x , y ) = 2 I 1 ( x , y ) · I 2 ( x , y ) + T 3 I 1 2 ( x , y ) + I 2 2 ( x , y ) + T 3 ,
S Q ( x , y ) = 2 Q 1 ( x , y ) · Q 2 ( x , y ) + T 4 Q 1 2 ( x , y ) + Q 2 2 ( x , y ) + T 4 ,
where T 3 and T 4 are positive constants. The overall chrominance similarity map is given by:
S C ( x , y ) = S I ( x , y ) · S Q ( x , y ) .
The inclusion of chromatic components in the FSIM index results in the following version of the formula for color images:
F S I M c = ( x , y ) Ω S P C ( x , y ) · S G ( x , y ) · [ S C ( x , y ) ] λ · P C m ( x , y ) ( x , y ) Ω P C m ( x , y ) ,
where the positive value of the λ exponent highlights the significance of chrominance in the color image quality assessment process. For subsequent studies utilizing FSIMc, the following parameter values were employed, as specified in [9]: T 1 = 0.85 , T 2 = 160 , T 3 = T 4 = 200 , and λ = 0.03 .

2.2. Mean Deviation Similarity Index MDSI

Many IQA metrics work as follows: they determine local distortions in the images, build similarity maps, and implement a pooling strategy based on the mean, weighted mean, standard deviation, etc. An example of this approach to IQA index modeling is the Mean Deviation Similarity Index (MDSI), described in [11]. The calculation of MDSI starts with converting the RGB color space components of the input images to a luminance component:
L = 0.2989 R + 0.5870 G + 0.1140 B
and two chromaticity components:
H M = 0.30 0.04 0.35 0.34 0.6 0.17 R G B .
This index is derived from the calculation of gradient similarity ( G S ) for structural distortions and chromaticity similarity ( C S ) for color distortions. The local structural similarity map is typically computed using gradient values. Traditionally, structural similarity maps are obtained by calculating the gradient values separately for the original and distorted images. The traditional approach for the MDSI metric has been improved by integrating the gradient value map, which combines the luminance channel values from both images:
f = 0.5 ( L r + L d ) ,
where f represents the fused luminance image, r is the reference image, and d refers to the distorted image. The formulas for the proposed structural similarity are given below:
G S r f ( x ) = 2 G r ( x ) G f ( x ) + C 2 G r 2 ( x ) + G f 2 ( x ) + C 2 ,
G S d f ( x ) = 2 G d ( x ) G f ( x ) + C 2 G d 2 ( x ) + G f 2 ( x ) + C 2 ,
G S ^ ( x ) = G S ( x ) + [ G S d f ( x ) G S r f ( x ) ] .
The gradient magnitude is calculated using the simple Prewitt operator. Additionally, the authors of the MDSI index have adjusted the method for evaluating local chromaticity similarity. In contrast to the previously discussed IQA metrics, such as FSIM or VSI, which assess chromaticity separately for the two chrominance components, this approach combines them in a different way. In the case of MDSI, it was suggested to calculate the color similarity for both chrominance components simultaneously, using the following formula:
C S ^ ( x ) = 2 ( H r ( x ) H d ( x ) + M r ( x ) M d ( x ) ) + C 3 H r 2 ( x ) + H d 2 ( x ) + M r ( x ) 2 + M d ( x ) 2 + C 3 ,
where C 3 is a constant introduced for numerical stability. The joint color similarity map, C S ^ ( x ) , is then combined with the G S ^ ( x ) map using a weighted mean:
G C S ^ ( x ) = α G S ^ ( x ) + ( 1 α ) C S ^ ( x ) ,
where α determines the relative importance of the similarity maps G S ^ ( x ) and C S ^ ( x ) . The final step involves converting the resulting G C S ^ map into an MDSI score through a pooling strategy based on a specific deviation method:
M D S I = 1 N i = 1 N | G C S ^ i 1 / 4 ( 1 N i = 1 N | G C S ^ i 1 / 4 ) | 1 / 4 .
The original article [11] provides suggestions for selecting various parameters that influence the performance of the MDSI index.

2.3. Haar Wavelet Perceptual Similarity Index HPSI

The Haar Wavelet-based Perceptual Similarity Index (HPSI) [12] is a relatively novel and computationally efficient similarity metric for FR-IQA. HPSI uses coefficients obtained from Haar wavelet decomposition for assessing local similarities between two images. The one-dimensional Haar filters are given by:
h 1 1 D = 1 2 · [ 1 , 1 ] ,
g 1 1 D = 1 2 · [ 1 , 1 ] ,
where h 1 1 D represents the low-pass scaling filter and g 1 1 D refers to the corresponding high-pass wavelet filter. For any scale, j N , two-dimensional Haar filters can be constructed as follows:
g j ( 1 ) = g 1 1 D h 1 1 D ,
g j ( 2 ) = h 1 1 D g 1 1 D ,
where the symbol ⊗ denotes the outer product, and the one-dimensional filters h j 1 D and g j 1 D for j > 1 are defined as:
g j 1 D = h 1 1 D ( g j 1 1 D ) 2 ,
h j 1 D = h 1 1 D ( h j 1 1 D ) 2 ,
where 2 is the dyadic upsampling operator and ∗ denotes the one-dimensional convolution operator. To effectively predict the perceptual similarity perceived by human viewers, it may be beneficial to apply an additional nonlinear mapping to the local similarities derived from the high-frequency Haar wavelet filter responses. This nonlinearity is represented by a logistic function, defined with a parameter α > 0 , as follows:
l α ( x ) = 1 1 + e α x .
For two grayscale images, f 1 and f 2 , the local similarity measure employed to calculate the HPSI is derived from the first two steps of the two-dimensional discrete Haar wavelet transform, as expressed by the following formula:
H S f 1 , f 2 ( k ) [ x ] = l α ( 1 2 j = 1 2 S ( | ( g j ( k ) f 1 ) [ x ] | , | ( g j ( k ) f 2 ) [ x ] | , C ) ) ,
where C > 0 , k { 1 , 2 } selects either horizontal or vertical Haar wavelet filters, S denotes the similarity measure, and ∗ is the two-dimensional convolution operator. Similar to FSIMc, HPSI also applies a specific weighting map, which is derived here from the response of a single low-frequency Haar wavelet filter:
W f ( k ) [ x ] = | ( g 3 ( k ) f ) [ x ] | ,
where k { 1 , 2 } again differentiates between horizontal and vertical filters. The final expression for the HPSI for grayscale images f 1 and f 2 is provided as a weighted average of the local similarity map, H S f 1 , f 2 ( k ) :
H P S I f 1 , f 2 = l α 1 ( x k = 1 2 H S f 1 , f 2 ( k ) [ x ] · W f 1 , f 2 ( k ) [ x ] x k = 1 2 W f 1 , f 2 ( k ) [ x ] ) 2 ,
where:
W f 1 , f 2 ( k ) [ x ] = max ( W f 1 ( k ) [ x ] , W f 2 ( k ) [ x ] ) .
HPSI can be extended for color images in the YIQ color space using a third local similarity map based on the chrominance components I and Q. This map, H S f 1 , f 2 ( 3 ) , is defined as:
H S f 1 , f 2 ( 3 ) [ x ] = l α ( 1 2 ( S ( | ( m f 1 I ) [ x ] | , | ( m f 2 I ) [ x ] | , C ) + S ( | ( m f 1 Q ) [ x ] | , | ( m f 2 Q ) [ x ] | , C ) ) ) ,
where m is a 2 × 2 mean filter and then:
W f 1 Y , f 2 Y ( 3 ) [ x ] = 1 2 ( W f 1 Y , f 2 Y ( 1 ) [ x ] + W f 1 Y , f 2 Y ( 2 ) [ x ] ) .
The final form of the HPSI for color images is defined as follows:
H P S I c f 1 , f 2 = l α 1 ( x k = 1 3 H S f 1 , f 2 ( k ) [ x ] · W f 1 Y , f 2 Y ( k ) [ x ] x k = 1 3 W f 1 Y , f 2 Y ( k ) [ x ] ) 2 .
The fast computation time of the HPSI may explain its high usefulness in various tasks.

2.4. Visual Saliency with Color Appearance and Gradient Similarity Index VCGS

The VCGS index [13] uses color space CIELAB and combines three feature similarity maps: visual salience with color appearance similarity map, S V C , gradient similarity map, S G , and chrominance similarity map, S C . The first of these maps is calculated using a formula based on visual saliency with color appearance (VC) for both images:
S V C = 2 V C 1 · V C 2 + K V C V C 1 2 + V C 2 2 + K V C ,
where K V C is a small constant that controls the numerical stability of the formula. The gradient similarity map using the Scharr operator applied to the L component is calculated according to the formula:
S G = 2 G 1 · G 2 + K G G 1 2 + G 2 2 + K G ,
where K G is a small constant that controls the numerical stability of the formula. The third map measures the similarity of the a * and b * chrominance components in the CIELAB color space and is given by the formula:
S C = 2 a 1 · a 2 + K C a 1 2 + a 2 2 + K C · 2 b 1 · b 2 + K C b 1 2 + b 2 2 + K C ,
where K C is a small constant that controls the numerical stability of the formula. The final form of the VCGS metric is given by the following formula:
VCGS = Ω S V C · S G α · S C λ · V C m Ω V C m ,
where Ω is the spatial domain, V C m = m a x ( V C 1 , V C 2 ) is used to weight the relevance of two maps in overall similarity, and α and λ represent the relative importance of the similarity maps depending on where they occur.

2.5. SuperPixel SIMilarity Index SPSIM

Superpixel-based SIMilarity (SPSIM) [14] utilizes superpixel segmentation for feature extraction. Superpixels are clusters of neighboring pixels that share similar characteristics, such as color, intensity, or structure. This pixel grouping results in a mosaic consisting of a significantly smaller number of superpixels, which facilitates faster subsequent processing. A key advantage of using superpixel-based segmentation over other oversegmentation algorithms is the ability to predefine the number of generated superpixels. Additionally, superpixel segmentation improves the distinction of perceptually significant regions in the image. Among the various superpixel generation algorithms, we can distinguish between graph-based, gradient-based, clustering-based, and watershed-based methods, among others [15]. The shape and size of superpixels depend on the applied algorithm, with each pixel belonging to exactly one superpixel. These algorithms control the number and properties of the superpixels, such as compactness and minimum size. One of the most popular and efficient algorithms for superpixel segmentation is the k-means-based Simple Linear Iterative Clustering (SLIC) algorithm [16]. This algorithm is notable for producing superpixels with a consistent shape and size. A key benefit of SLIC is that segmentation only requires specifying the desired number of superpixels in the output image. Consequently, the SLIC algorithm is used in the SPSIM quality index discussed in this paper. For each superpixel, the algorithm calculates the mean CIELAB color values and the Local Binary Pattern (LBP) features. Superpixels are initially generated on the reference image and then applied to both the reference and distorted images.
The SPSIM index calculation algorithm relies on pixel gradient similarity and luminance-chrominance superpixel similarity. The YUV color space, rather than RGB, is utilized for SPSIM computation, where Y represents luminance and U and V denote chrominance components. If s i is used to represent a superpixel containing pixel i, the following formulas can be written for luminance L i and luminance similarity M L ( i ) :
L i = 1 s i j s i Y ( j ) , M L i = 2 L r i L d i + T 1 L r 2 i + L d 2 ( i ) + T 1 ,
where Y ( j ) represents the luminance of pixel j and L r ( i ) and L d ( i ) denote the average luminance values for superpixel s i in the reference and distorted images, respectively. T 1 is a positive constant introduced to prevent instability in the equation. Similar expressions can be formulated for both the U and V chrominance components:
U i = 1 s i j s i U j , M U i = 2 U r i U d i + T 1 U r 2 i + U d 2 i + T 1 ,
V i = 1 s i j s i V j , M U i = 2 V r i V d i + T 1 V r 2 i + V d 2 i + T 1 .
The chrominance similarity, M C , can then be calculated as shown below:
M C ( i ) = M U ( i ) M V ( i ) .
The gradient similarity, M G , is described by the following formula:
M G i = 2 G r i G d i + T 2 G r 2 i + G d 2 i + T 2 ,
where the gradient magnitude, G, is composed of two components calculated using a simple Prewitt operator, and T 1 and T 2 are constants selected by the authors of the algorithm to account for contrast-related errors. Further information on the determination of T 1 and T 2 can be found in [14]. The formula for calculating the similarity of superpixel i in both images is as follows:
M i = M G i M L i α e β ( M C ( i ) 1 ) ,
where the parameters α and β represent the weights for the luminance and chrominance components, respectively. Finally, the SPSIM index is calculated as a weighted sum of M ( i ) and the corresponding weights, which are determined based on the texture complexity ( T C ), described by the standard deviation ( s t d ) and kurtosis ( K u r t ) of the superpixels:
T C r ( i ) = s t d ( S r ( i ) ) K u r t S r ( i ) + 3 ,   T C d ( i ) = s t d ( S d ( i ) ) K u r t S d ( i ) + 3 ,
w ( i ) = e x p ( 0.05 · a b s ( T C d ( i ) T C r ( i ) ) ) ,
S P S I M = i = 1 N M ( i ) w ( i ) i = 1 N w ( i ) ,
where S r ( i ) and S d ( i ) are, respectively, the superpixels in the reference and distorted images that contain the i-th pixel.

2.6. Local Global Variation Index LGV and Saliency Weighted Local Global Variation Index SWLGV

Varga [17] introduced new quality indices that utilize both gradients in the image and Grünwald–Letnikov fractional derivatives. While gradients capture local variations within the image, fractional derivatives describe global variations, represented by the flowing global similarity map:
S G ( x , y ) = 2 · D α G L R ( x , y ) · D α G L D ( x , y ) + c 1 D α G L R ( x , y ) 2 + D α G L D ( x , y ) 2 + c 1 ,
where R ( x , y ) is the reference image, D ( x , y ) is the distorted image, D α G L is the α -order Grünwald–Letnikov fractional derivative, and c 1 is a constant number that provides numerical stability. The order of the fractional derivative was set to α = 0.6 . The 3 × 3 Scharr operator was used to compute the gradients for the local gradient map, S L ( x , y ) :
S L ( x , y ) = 2 · G R ( x , y ) · G D ( x , y ) + c 2 G R 2 ( x , y ) + G D 2 ( x , y ) + c 2 ,
where c 2 is a constant that ensures numerical stability. The similarity map between the two compared images was calculated using the two previously defined gradient maps and the exponential coefficient λ = 0.7 :
S ( x , y ) = ( S G ( x , y ) ) λ · ( S L ( x , y ) ) 1 λ .
Finally, the similarity map obtained is fused with the saliency map. This index is referred to as the Local Global Variation (LGV):
L G V = 1 M · N x = 1 M y = 1 N S ( x , y ) ,
where M · N is the resolution of images.
The SWLGV index, in contrast to LGV, also incorporates the mechanism of visual saliency. It emphasizes the differences between the reference and distorted images in the most distinctive regions. By labeling the maps of the distinguishing regions as S M R ( x , y ) for the reference image and S M D ( x , y ) for the distorted image, we can create a formula for the image pair:
S M ( x , y ) = m a x ( S M R ( x , y ) , S M D ( x , y ) ) ,
where S M R ( x , y ) and S M D ( x , y ) are visual saliency maps built as proposed in [18]. The SWLGV index is defined as the weighted average of S ( x , y ) and S M ( x , y ) , where S M ( x , y ) represents the weights:
S W L G V = i = 1 M j = 1 N S M ( x , y ) · S ( x , y ) i = 1 M j = 1 N S M ( x , y ) .

2.7. Gradient Magnitude Similarity Deviation Index GMSD

GMSD [19] is a relatively simple metric, which is based on a gradient similarity map and uses a 3 × 3 Prewitt filter. The magnitudes of the gradients of images r and d at position i, denoted by m r ( i ) and m d ( i ) , are calculated as follows:
m r ( i ) = ( r h x ) 2 ( i ) + ( r h y ) 2 ( i ) ,
m d ( i ) = ( d h x ) 2 ( i ) + ( d h y ) 2 ( i ) ,
where ⊗ denotes a convolution operation. The magnitude gradient similarity map, G M S ( i ) , is then calculated as follows:
G M S ( i ) = 2 m r ( i ) m d ( i ) + c m r 2 ( i ) + m d 2 ( i ) + c ,
where c is a constant number that provides numerical stability. The formulas above demonstrate a symmetrical approach to both referenced and distorted images. The average gradient value from the G M S ( i ) map was then determined as:
G M S M = 1 N i = 1 N G M S ( i ) ,
where N is a number of pixels in image. Finally, the GMSD index is defined by the formula:
G M S D = 1 N i = 1 N ( G M S ( i ) G M S M ) 2 .

2.8. Evaluation Criteria for IQA

Individual IQA metrics are commonly compared with the subjective ratings of specific images. To assess the linearity, monotonicity, and accuracy of these predictions, four criteria are used: the Pearson Linear Correlation Coefficient (PLCC), the Spearman Rank Order Correlation Coefficient (SROCC), the Kendall Rank Order Correlation Coefficient (KROCC), and the Root Mean Squared Error (RMSE). The formulas for these comparisons are provided below:
P L C C = i = 1 N ( p i p ¯ ) ( s i s ¯ ) i = 1 N ( p i p ¯ ) 2 ( s i s ¯ ) 2 ,
where p i and s i represent the raw values of the subjective and objective measures, respectively, and p ¯ and s ¯ are the mean values of the subjective and objective measures.
Spearman Rank Order Correlation Coefficient is given by the formula:
S R O C C = 1 6 i = 1 N d i 2 N ( N 2 1 ) ,
where d i represents the difference between the ranks of both measures for the i-th observation and N is the total number of observations.
Kendall Rank Order Correlation Coefficient (KROCC) is provided by the formula:
K R O C C = N c N d 0.5 ( N 1 ) N ,
where N c and N d denote the counts of concordant and discordant pairs.
Root Mean Squared Error (RMSE) is given by the following equation:
R M S E = 1 N i = 1 N ( p i s i ) 2 ,
where p i and s i are defined as above.
The above correlation coefficients are useful tools for objectively assessing the agreement between IQA computational models and subjective MOS assessments. However, they capture specific aspects of this relationship, such as linearity in the case of PLCC or monotonicity in the cases of SROCC and KROCC. SROCC and KROCC are suitable for scenarios where the relationship between variables is nonlinear, with KROCC offering high robustness to small changes in the data. RMSE, on the other hand, is a measure of error primarily used to evaluate the accuracy of a model’s predictions. Unlike PLCC, SROCC, and KROCC, it is not a measure of correlation, and its role is fundamentally different. RMSE quantifies the average distance between predicted and actual values. It is sensitive to the magnitude of errors because the differences are squared before averaging, meaning that larger deviations have a disproportionate impact on the final value. High RMSE values indicate poor agreement between predicted and actual values, but RMSE does not provide information about the type of relationship (e.g., whether it is monotonic, linear, or otherwise).
As recommended in [20], a nonlinear mapping was applied to calculate PLCC and RMSE. This process involves the use of a fitting function, usually a logistic function with five beta parameters, β 1 ,   β 2 ,   β 3 ,   β 4 ,   β 5 , to better represent the relationship between predicted performance, x, and MOS.
p ( x , β ) = β 1 ( 1 2 1 1 + e x p ( β 2 ( x β 3 ) ) ) + β 4 x + β 5 .

3. The New Combined Metric (NCM) and Its Experimental Research

The New Combined Metric (NCM):
N C M = α · M 1 + β · M 2 + γ · M 3 ,
where M 1 , M 2 , and M 3 are the selected FR-IQA metrics for given dataset and α , β , and γ are the optimized weights.

3.1. Selected IQA Databases

Four benchmark databases, TID2008 [21], TID2013 [22], KADID-10k [23], and PIPAL [24], were chosen for the research. These databases are distinguished by a large set of reference images, diverse distortion types, and varying levels of their presence in the images. For each image in the databases, Mean Opinion Scores (MOSs) are experimentally gathered by collecting assessments from multiple human observers.
The TID2008 image database consists of 1700 distorted images, generated using 17 different distortion types, each applied at four levels to 25 reference images (Figure 1). MOS was provided based on the work of 838 human observers and 256,428 comparisons. All images have a resolution of 512 × 384 pixels.
The TID2013 image database is an updated and expanded version of TID2008. It retains the same set of reference images (Figure 1), but the number of distortion types has been increased to 24, and the distortion levels have been raised to five. The database includes 3000 distorted digital images. Additionally, the size of the research group from which the average subjective ratings were derived has been enlarged. MOS ratings were collected from 524,340 comparisons made by 971 observers. The image resolution remains unchanged.
Online crowdsourcing for image assessment has enabled the creation of larger databases. One such large database, KADID-10k (Konstanz Artificially Distorted Image Quality Database) [23], contains 10,125 digital images with subjective quality scores (MOSs). It was developed and published by 2209 crowd workers. This database includes a limited selection of reference images (81) (Figure 2), a restricted number of artificial distortion types (25), and five levels for each distortion type. Recently, KADID-10k has become widely used for deep-learning models for image quality assessment [25]. The artificial distortions present in the KADID-10k database include spatial distortions, noise, blurs, and more. The image resolution remains unchanged.
PIPAL is a large IQA dataset, first introduced in 2020 by [24], that increased the number of reference images to 250. In fact, these are 288 × 288 fragments from images in the DIV2K and Flickr2K high-resolution image collections (Figure 3), with distortion types increased to 40 and distorted images increased to 29,000, and it contains 1,130,000 human ratings. In this image database, the Elo rating system was used to assign the Mean Opinion Scores (MOSs). Currently, the PIPAL dataset is used in many challenges as a benchmark for IQA algorithms.
The key information regarding the selected IQA benchmark databases is presented in Table 1.

3.2. Experimental Tests

The experimental study began with the determination of the PLCC, SROCC, and KROCC correlation coefficients, and RMSE values for eight selected highly correlated metrics. The results of these tests for the four study datasets are included in Table 2. The three highest correlation coefficients and the three lowest RMSE values are shown in different colors (the best results in red, the second results in green, and the third results in blue).
The three correlation coefficients and the RMSE value were aggregated into one score. A point scale with values from 1 to 8 was adopted, where the highest points were awarded to the highest correlation coefficients and the lowest RMSE values. This ranking is shown in Table 3, where the point values for each dataset are also summarized in the columns. The number of points determined the three component metrics for each dataset. The three highest scores are highlighted in bold.
The three metrics selected from the table served as the components M 1 , M 2 , and M 3 for the linear combination that determines the New Combined Metric, as defined in Formula (61).
Determining the NCM value requires calculating the α , β , and γ weights present in this formula. These weights are optimized in the Matlab environment using the f m i n c o n function. In the optimization task, the PLCC linear correlation coefficient is maximized. The obtained values of the weights for each dataset are given in Table 4. Based on these weights and component metrics, the values of the combined NCM metric were determined. The results are shown in Table 5. The results for the three component metrics are shown in red, while green is used for the best score achieved by the combined NCM metric in each case.
In order to visualize the good quality of the proposed NCM index, scatter plots of the proposed eight metrics and the NCM for the tested bases are shown in Figure 4, Figure 5, Figure 6 and Figure 7. The scatter plots and their fitted curves show that the proposed combined NCM metric closely matches the MOS estimates for each of the databases.
A study of computation times for the considered FR-IQA metrics was also conducted. The average computation times for each of the databases are shown in Table 6. The three fastest metrics are highlighted in bold. The NCM computation time, which is not included in Table 6, is approximately equal to the sum of the computation times of its component metrics.
For the three fastest FR-IQA metrics highlighted in red (see Table 7), a linear combination was formed by redetermining the optimal α , β , and γ weights (see Table 5) from the perspective of PLCC maximization. The resulting NCM metric using the three fastest metrics achieved the best performance, as marked in green in Table 7.
The study was conducted in the MATLAB R2024a programming environment on a computer with the specifications provided in Table 8.
The best results obtained using the NCM were additionally compared with those of other combined metrics presented in the literature [26,27]. The comparison is highlighted in bold in Table 9. The authors of [26] used combined metrics (MFMOGP3, MFMOGP4) based on the additive combination of component metrics, ranging from 8 to 10 metrics. In [27], combined metrics (OFIQA) based on the product form, involving between 4 and 17 factor-metrics, were proposed. For Table 9, we selected the best results from both of the above-mentioned works. For the TID2008 database, the results are comparable, while for the TID2013 database the proposed NCM index achieves the highest correlation coefficients. We conducted the comparison on the TID2008 and TID2013 databases, as both older works on combined metrics did not consider newer databases (KADID-10k, PIPAL).

4. Conclusions

From the existing literature on FR-IQA metrics, it is evident that there is no single metric that significantly outperforms the others. Therefore, the idea of creating a linear or nonlinear combination of several top metrics has emerged. The proposed approach opted for an additive combination of the three metrics with the highest correlation coefficients and the lowest RMSE. The resulting combined NCM metric was based on component metrics that depended on the selected database. NCM achieved the best results among all tested metrics across all tested databases. In addition, a case was examined where the three fastest metrics, i.e., MDSI, HPSI, and GMSD, were selected as components. The combined metric obtained in this case also achieved the best results compared to all the tested metrics. Potential extensions of the proposed approach include replacing the linear combination of metrics to their nonlinear combination, exploring alternative methods for optimizing weight selection, and more.

Author Contributions

Conceptualization, M.F. and H.P.; methodology, M.F.; software, Ł.M.; validation, M.F. and H.P.; investigation, M.F. and Ł.M.; resources, M.F.; data curation, M.F.; writing—original draft preparation, M.F.; writing—review and editing, M.F.; visualization, M.F.; supervision, H.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Polish Ministry for Science and Education under internal grant 02/070/BK_24/0055 for the Department of Data Science and Engineering. Silesian University of Technology, Gliwice, Poland.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
IQAImage Quality Assessment
FR-IQAFull-Reference Image Quality Assessment
MOSMean Opinion Score
FSIMcFeature SIMilarity (color version)
MDSIMean Deviation Similarity Index
HPSIHaar wavelet Perceptual Similarity Index
VCGSVisual saliency with Color appearance and Gradient Similarity
SPSIMSuperPixel SIMilarity
LGVLocal Global Variation
SWLGVSaliency Weighted Local Global Variation
GMSDGradient Magnitude Similarity Deviation
NCMNew Combined Metric
PSNRPeak Signal-to-Noise Ratio
VSNRVisual Signal-to-Noise Ratio
VIFVisual Information Fidelity index
MS-SSIMMulti Scale Structural SIMilarity index
PLCCPearson Linear Correlation Coefficient
SROCCSpearman Rank Order Correlation Coefficient
KROCCKendall Rank Order Correlation Coefficient
RMSERoot Mean Squared Error
SVDSingular Value Decomposition
HVSHuman Visual System
DMOSDifferential Mean Opinion Score
TIDTampere Image Database
KADID-10kKonstanz Artificially Distorted Image quality Database
PIPALPerceptual Image Processing ALgorithms database
WFSIMcWeighted FSIM (color version) index
RFSIMRiesz-transform-based Feature SIMilarity index
CQMCombined Quality Metric
CISICombined Image Similarity Index
LCSIMLinearly Combined Similarity Measures

References

  1. Liu, M.; Yang, X. A new image quality approach based on decision fusion. In Proceedings of the 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery, Jinan, China, 18–20 October 2008; Volume 4, pp. 10–14. [Google Scholar]
  2. Okarma, K. Combined full-reference image quality metric linearly correlated with subjective assessment. In Proceedings of the Artificial Intelligence and Soft Computing: 10th International Conference, ICAISC 2010, Zakopane, Poland, 13–17 June 2010; Part I 10. Springer: Berlin/Heidelberg, Germany, 2010; pp. 539–546. [Google Scholar]
  3. Okarma, K. Combined image similarity index. Opt. Rev. 2012, 19, 349–354. [Google Scholar] [CrossRef]
  4. Okarma, K. Extended hybrid image similarity–combined full-reference image quality metric linearly correlated with subjective scores. Elektron. Ir Elektrotech. 2013, 19, 129–132. [Google Scholar] [CrossRef]
  5. Oszust, M. Full-reference image quality assessment with linear combination of genetically selected quality measures. PLoS ONE 2016, 11, e0158333. [Google Scholar] [CrossRef] [PubMed]
  6. Lukin, V.; Ponomarenko, N.; Ieremeiev, O.; Egiazarian, K.; Astola, J. Combining full-reference image visual quality metrics by neural network. In Proceedings of the Human Vision and Electronic Imaging XX, San Francisco, CA, USA, 9–12 February 2015; Volume 9394, pp. 172–183. [Google Scholar]
  7. Bosse, S.; Maniry, D.; Müller, K.R.; Wiegand, T.; Samek, W. Deep neural networks for no-reference and full-reference image quality assessment. IEEE Trans. Image Process. 2017, 27, 206–219. [Google Scholar] [CrossRef] [PubMed]
  8. Varga, D. A combined full-reference image quality assessment method based on convolutional activation maps. Algorithms 2020, 13, 313. [Google Scholar] [CrossRef]
  9. Zhang, L.; Zhang, L.; Mou, X.; Zhang, D. FSIM: A feature similarity index for image quality assessment. IEEE Trans. Image Process. 2011, 20, 2378–2386. [Google Scholar] [CrossRef]
  10. Kovesi, P. Image features from phase congruency. Videre J. Comput. Vis. Res. 1999, 1, 1–26. [Google Scholar]
  11. Nafchi, H.Z.; Shahkolaei, A.; Hedjam, R.; Cheriet, M. Mean deviation similarity index: Efficient and reliable full-reference image quality evaluator. IEEE Access 2016, 4, 5579–5590. [Google Scholar] [CrossRef]
  12. Reisenhofer, R.; Bosse, S.; Kutyniok, G.; Wiegand, T. A Haar wavelet-based perceptual similarity index for image quality assessment. Signal Process. Image Commun. 2018, 61, 33–43. [Google Scholar] [CrossRef]
  13. Shi, C.; Lin, Y. Full Reference Image Quality Assessment Based on Visual Salience with Color Appearance and Gradient Similarity. IEEE Access 2020, 8, 97310–97320. [Google Scholar] [CrossRef]
  14. Sun, W.; Liao, Q.; Xue, J.H.; Zhou, F. SPSIM: A superpixel-based similarity index for full-reference image quality assessment. IEEE Trans. Image Process. 2018, 27, 4232–4244. [Google Scholar] [CrossRef] [PubMed]
  15. Stutz, D.; Hermans, A.; Leibe, B. Superpixels: An evaluation of the state-of-the-art. Comput. Vis. Image Underst. 2018, 166, 1–27. [Google Scholar] [CrossRef]
  16. Achanta, R.; Shaji, A.; Smith, K.; Lucchi, A.; Fua, P.; Süsstrunk, S. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 2274–2282. [Google Scholar] [CrossRef] [PubMed]
  17. Varga, D. Full-Reference Image Quality Assessment Based on Grünwald–Letnikov Derivative, Image Gradients, and Visual Saliency. Electronics 2022, 11, 559. [Google Scholar] [CrossRef]
  18. Imamoglu, N.; Lin, W.; Fang, Y. A saliency detection model using low-level features based on wavelet transform. IEEE Trans. Multimed. 2012, 15, 96–105. [Google Scholar] [CrossRef]
  19. Xue, W.; Zhang, L.; Mou, X.; Bovik, A.C. Gradient magnitude similarity deviation: A highly efficient perceptual image quality index. IEEE Trans. Image Process. 2013, 23, 684–695. [Google Scholar] [CrossRef]
  20. Sheikh, H.R.; Sabir, M.F.; Bovik, A.C. A statistical evaluation of recent full reference image quality assessment algorithms. IEEE Trans. Image Process. 2006, 15, 3440–3451. [Google Scholar] [CrossRef]
  21. Ponomarenko, N.; Lukin, V.; Zelensky, A.; Egiazarian, K.; Carli, M.; Battisti, F. TID2008—A database for evaluation of full-reference visual quality assessment metrics. Adv. Mod. Radioelectron. 2009, 10, 30–45. [Google Scholar]
  22. Ponomarenko, N.; Jin, L.; Ieremeiev, O.; Lukin, V.; Egiazarian, K.; Astola, J.; Vozel, B.; Chehdi, K.; Carli, M.; Battisti, P.; et al. Image database TID2013: Peculiarities, results and perspectives. Signal Process. Image Commun. 2015, 30, 57–77. [Google Scholar] [CrossRef]
  23. Lin, H.; Hosu, V.; Saupe, D. KADID-10k: A large-scale artificially distorted IQA database. In Proceedings of the 2019 Eleventh International Conference on Quality of Multimedia Experience (QoMEX), Berlin, Germany, 5–7 June 2019; pp. 1–3. [Google Scholar]
  24. Gu, J.; Cai, H.; Chen, H.; Ye, X.; Ren, J.; Dong, C. PIPAL: A large-scale image quality assessment dataset for perceptual image restoration. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2020; pp. 633–651. [Google Scholar]
  25. Varga, D. Composition-preserving deep approach to full-reference image quality assessment. Signal Image Video Process. 2020, 14, 1265–1272. [Google Scholar] [CrossRef]
  26. Merzougui, N.; Djerou, L. Multi-measures fusion based on multi-objective genetic programming, for full-reference image quality assessment. arXiv 2017, arXiv:1801.06030. [Google Scholar]
  27. Varga, D. An optimization-based family of predictive, fusion-based models for full-reference image quality assessment. J. Imaging 2023, 9, 116. [Google Scholar] [CrossRef]
Figure 1. Reference images of the TID2008 and TID2013 databases [21].
Figure 1. Reference images of the TID2008 and TID2013 databases [21].
Symmetry 16 01622 g001
Figure 2. Reference images of the KADID-10k database [23].
Figure 2. Reference images of the KADID-10k database [23].
Symmetry 16 01622 g002
Figure 3. Examples of reference images from the PIPAL database [24].
Figure 3. Examples of reference images from the PIPAL database [24].
Symmetry 16 01622 g003
Figure 4. Scatter plots of subjective MOS against IQA metrics obtained from the TID2008 database.
Figure 4. Scatter plots of subjective MOS against IQA metrics obtained from the TID2008 database.
Symmetry 16 01622 g004
Figure 5. Scatter plots of subjective MOS against IQA metrics obtained from the TID2013 database.
Figure 5. Scatter plots of subjective MOS against IQA metrics obtained from the TID2013 database.
Symmetry 16 01622 g005
Figure 6. Scatter plots of subjective MOS against IQA metrics obtained from the KADID-10k database.
Figure 6. Scatter plots of subjective MOS against IQA metrics obtained from the KADID-10k database.
Symmetry 16 01622 g006
Figure 7. Scatter plots of subjective MOS against IQA metrics obtained from the PIPAL database.
Figure 7. Scatter plots of subjective MOS against IQA metrics obtained from the PIPAL database.
Symmetry 16 01622 g007
Table 1. Comparison of the selected IQA databases.
Table 1. Comparison of the selected IQA databases.
No. ofNo. of No. of Dist.
DatabaseYearRef.Dist.EnvironmentImages
TID200820082517lab1700
TID201320132524lab3000
KADID-10k20198125crowdsourcing10,125
PIPAL202025040crowdsourcing29,000
Table 2. Values of correlation coefficients and RMSE for FR-IQA metrics.
Table 2. Values of correlation coefficients and RMSE for FR-IQA metrics.
DatabaseMetricFSIMc [9]MDSI [11]HPSI [12]VCGS [13]SPSIM [14]LGV [17]SWLGV [17]GMSD [19]
TID2008PLCC0.8760.9160.9070.8780.8930.8650.8740.879
SROCC0.8840.9210.9100.8970.9100.8810.8890.891
KROCC0.6990.7510.7370.7170.7300.6960.7110.709
RMSE0.6470.5380.5660.6430.6050.6740.6520.640
TID2013PLCC0.8770.9090.8930.9000.9090.7780.7970.855
SROCC0.8510.8900.8730.8930.9040.8070.8070.804
KROCC0.6670.7120.6920.7170.7250.6380.6410.634
RMSE0.5960.5180.5570.5410.5170.7790.7490.642
KADID-10kPLCC0.8510.8640.8850.8680.8740.8150.8350.805
SROCC0.8540.8850.8850.8710.8740.8200.8400.847
KROCC0.6650.7020.6990.6830.6870.6300.6550.664
RMSE0.5680.5440.5050.5380.5250.6270.5950.643
PIPALPLCC0.6150.5980.6410.5540.5780.5290.5430.629
SROCC0.5890.5850.5890.5340.5620.5190.5360.583
KROCC0.4160.4080.4170.3700.3910.3590.3720.414
RMSE0.1040.1060.1010.1100.1080.1120.1110.103
Table 3. Ranking of FR-IQA metrics.
Table 3. Ranking of FR-IQA metrics.
DatabaseMetricFSIMc [9]MDSI [11]HPSI [12]VCGS [13]SPSIM [14]LGV [17]SWLGV [17]GMSD [19]
TID2008PLCC38746125
SROCC28657134
KROCC28756143
RMSE38746125
TOTAL103227182541117
TID2013PLCC47568123
SROCC46578321
KROCC46578231
RMSE47568123
TOTAL1626202632798
KADID-10kPLCC45867231
SROCC48756123
KROCC48756123
RMSE45867231
TOTAL16263022266108
PIPALPLCC65834127
SROCC86724135
KROCC75824136
RMSE65834127
TOTAL272131101641025
Table 4. Optimized weight values for the NCM metric.
Table 4. Optimized weight values for the NCM metric.
Criterion:Three BestThree Fast
DatabaseWeightValueMetricsValueMetrics
TID2008 α 0.578MDSI [11]0.618MDSI [11]
β 0.285HPSI [12]0.372HPSI [12]
γ 0.136SPSIM [14]0.010GMSD [19]
TID2013 α 0.459MDSI [11]0.680MDSI [11]
β 0.086VCGS [13]0.310HPSI [12]
γ 0.455SPSIM [14]0.010GMSD [19]
KADID-10k α 0.386MDSI [11]0.342MDSI [11]
β 0.404HPSI [12]0.486HPSI [12]
γ 0.210SPSIM [14]0.172GMSD [19]
PIPAL α 0.320FSIMc [9]0.317MDSI [11]
β 0.507HPSI [12]0.584HPSI [12]
γ 0.174GMSD [19]0.099GMSD [19]
Table 5. Results of NCM metric for the three best component metrics.
Table 5. Results of NCM metric for the three best component metrics.
Db.Met.FSIMc [9]MDSI [11]HPSI [12]VCGS [13]SPSIM [14]LGV [17]SWLGV [17]GMSD [19]NCM M 1 , M 2 , M 3
TID2008PLCC0.8760.9160.9070.8780.8930.8650.8740.8790.924SPSIM [14]
 SROCC0.8840.9210.9100.8970.9100.8810.8890.8910.924MDSI [11]
 KROCC0.6990.7510.7370.7170.7300.6960.7110.7090.759HPSI [12]
 RMSE0.6470.5380.5660.6430.6050.6740.6520.6400.514 
TID2013PLCC0.8770.9090.8930.9000.9090.7780.7970.8550.922SPSIM [14]
 SROCC0.8510.8900.8730.8930.9040.8070.8070.8040.906MDSI [11]
 KROCC0.6670.7120.6920.7170.7250.6380.6410.6340.732VCGS [13]
 RMSE0.5960.5180.5570.5410.5170.7790.7490.6420.481 
KADID-10kPLCC0.8510.8640.8850.8680.8740.8150.8350.8050.896SPSIM [14]
 SROCC0.8540.8850.8850.8710.8740.8200.8400.8470.897MDSI [11]
 KROCC0.6650.7020.6990.6830.6870.6300.6550.6640.717HPSI [12]
 RMSE0.5680.5440.5050.5380.5250.6270.5950.6430.480 
PIPALPLCC0.6150.5980.6410.5540.5770.5290.5430.6290.653GMSD [19]
SROCC0.5890.5850.5890.5340.5620.5190.5360.5830.608FSIMc [9]
KROCC0.4160.4080.4170.3700.3910.3590.3720.4140.432HPSI [12]
 RMSE0.1040.1060.1010.1100.1080.1120.1110.1030.100 
Table 6. Computation times (s) for IQA metrics.
Table 6. Computation times (s) for IQA metrics.
Database FSIMc [9]MDSI [11]HPSI [12]VCGS [13]SPSIM [14]LGV [17]SWLGV [17]GMSD [19]
TID20080.0890.0140.0280.2120.1280.4355.9860.019
TID20130.1040.0160.0370.2140.1130.3396.1300.019
KADID-10k0.1360.0260.0460.2990.1430.4515.8900.028
PIPAL0.1720.0110.0120.0910.1760.6192.6280.009
Table 7. Results of the NCM metric for the three fastest component metrics.
Table 7. Results of the NCM metric for the three fastest component metrics.
Db.Met.FSIMc [9]MDSI [11]HPSI [12]VCGS [13]SPSIM [14]LGV [17]SWLGV [17]GMSD [19]NCM M 1 , M 2 , M 3
TID2008PLCC0.8760.9160.9070.8780.8930.8650.8740.8790.922GMSD
 SROCC0.8840.9210.9100.8970.9100.8810.8890.8910.923MDSI
 KROCC0.6990.7510.7370.7170.7300.6960.7110.7090.757HPSI
 RMSE0.6470.5380.5660.6430.6050.6740.6520.6400.519 
TID2013PLCC0.8770.9090.8930.9000.9090.7780.7970.8550.912GMSD
 SROCC0.8510.8900.8730.8930.9040.8070.8070.8040.892MDSI
 KROCC0.6670.7120.6920.7170.7250.6380.6410.6340.716HPSI
 RMSE0.5960.5180.5570.5410.5170.7790.7490.6420.508 
KADID-10kPLCC0.8510.8640.8850.8680.8740.8150.8350.8050.897GMSD
 SROCC0.8540.8850.8850.8710.8740.8200.8400.8470.897MDSI
 KROCC0.6650.7020.6990.6830.6870.6300.6550.6640.719HPSI
 RMSE0.5680.5440.5050.5380.5250.6270.5950.6430.479 
PIPALPLCC0.6150.5980.6410.5540.5770.5290.5430.6290.655GMSD
 SROCC0.5890.5850.5890.5340.5620.5190.5360.5830.608MDSI
 KROCC0.4160.4080.4170.3700.3910.3590.3720.4140.432HPSI
 RMSE0.1040.1060.1010.1100.1080.1120.1110.1030.100 
Table 8. Parameters of desktop computer used for experiments.
Table 8. Parameters of desktop computer used for experiments.
ProcessorIntel(R) Core(TM) i5-7400 CPU @ 3.00 GHz (4 cores)
RAM32 GB
OSWindows 10
Env.Matlab 2024a
Table 9. Comparison of the proposed NCM metric with other combined metrics.
Table 9. Comparison of the proposed NCM metric with other combined metrics.
DatabaseMetricMFMOGP3 [26]MFMOGP4 [26]OFIQA [27]NCM
TID2008PLCC0.9250.9020.9100.922
SROCC0.9230.9110.9150.923
KROCC0.7570.7270.7380.757
RMSE0.5110.5800.5570.519
TID2013PLCC0.8830.9140.9060.922
SROCC0.8680.9020.8900.923
KROCC0.6880.7250.7130.757
RMSE0.5810.5030.5260.519
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Frackiewicz, M.; Machalica, Ł.; Palus, H. New Combined Metric for Full-Reference Image Quality Assessment. Symmetry 2024, 16, 1622. https://doi.org/10.3390/sym16121622

AMA Style

Frackiewicz M, Machalica Ł, Palus H. New Combined Metric for Full-Reference Image Quality Assessment. Symmetry. 2024; 16(12):1622. https://doi.org/10.3390/sym16121622

Chicago/Turabian Style

Frackiewicz, Mariusz, Łukasz Machalica, and Henryk Palus. 2024. "New Combined Metric for Full-Reference Image Quality Assessment" Symmetry 16, no. 12: 1622. https://doi.org/10.3390/sym16121622

APA Style

Frackiewicz, M., Machalica, Ł., & Palus, H. (2024). New Combined Metric for Full-Reference Image Quality Assessment. Symmetry, 16(12), 1622. https://doi.org/10.3390/sym16121622

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop