MDPI - Publisher of Open Access Journals

21 pages, 14388 KiB

Open AccessArticle

Adaptive Matching of High-Frequency Infrared Sea Surface Images Using a Phase-Consistency Model

by Xiangyu Li, Jie Chen, Jianwei Li, Zhentao Yu and Yaxun Zhang

Sensors 2025, 25(5), 1607; https://doi.org/10.3390/s25051607 - 6 Mar 2025

Viewed by 193

The sea surface displays dynamic characteristics, such as waves and various formations. As a result, images of the sea surface usually have few stable feature points, with a background that is often complex and variable. Moreover, the sea surface undergoes significant changes due [...] Read more.

The sea surface displays dynamic characteristics, such as waves and various formations. As a result, images of the sea surface usually have few stable feature points, with a background that is often complex and variable. Moreover, the sea surface undergoes significant changes due to variations in wind speed, lighting conditions, weather, and other environmental factors, resulting in considerable discrepancies between images. These variations present challenges for identification using traditional methods. This paper introduces an algorithm based on the phase-consistency model. We utilize image data collected from a specific maritime area with a high-frame-rate surface array infrared camera. By accurately detecting images with identical names, we focus on the subtle texture information of the sea surface and its rotational invariance, enhancing the accuracy and robustness of the matching algorithm. We begin by constructing a nonlinear scale space using a nonlinear diffusion method. Maximum and minimum moments are generated using an odd symmetric Log–Gabor filter within the two-dimensional phase-consistency model. Next, we identify extremum points in the anisotropic weighted moment space. We use the phase-consistency feature values as image gradient features and develop feature descriptors based on the Log–Gabor filter that are insensitive to scale and rotation. Finally, we employ Euclidean distance as the similarity measure for initial matching, align the feature descriptors, and remove false matches using the fast sample consensus (FSC) algorithm. Our findings indicate that the proposed algorithm significantly improves upon traditional feature-matching methods in overall efficacy. Specifically, the average number of matching points for long-wave infrared images is 1147, while for mid-wave infrared images, it increases to 8241. Additionally, the root mean square error (RMSE) fluctuations for both image types remain stable, averaging 1.5. The proposed algorithm also enhances the rotation invariance of image matching, achieving satisfactory results even at significant rotation angles. Full article

(This article belongs to the Section Remote Sensors)

► Show Figures

Figure 1

22 pages, 6239 KiB

Open AccessArticle

Fine-Grained Aircraft Recognition Based on Dynamic Feature Synthesis and Contrastive Learning

by Huiyao Wan, Pazlat Nurmamat, Jie Chen, Yice Cao, Shuai Wang, Yan Zhang and Zhixiang Huang

Remote Sens. 2025, 17(5), 768; https://doi.org/10.3390/rs17050768 - 23 Feb 2025

Viewed by 305

Abstract

With the rapid development of deep learning, significant progress has been made in remote sensing image target detection. However, methods based on deep learning are confronted with several challenges: (1) the inherent limitations of activation functions and downsampling operations in convolutional networks lead [...] Read more.

With the rapid development of deep learning, significant progress has been made in remote sensing image target detection. However, methods based on deep learning are confronted with several challenges: (1) the inherent limitations of activation functions and downsampling operations in convolutional networks lead to frequency deviations and loss of local detail information, affecting fine-grained object recognition; (2) class imbalance and long-tail distributions further degrade the performance of minority categories; (3) large intra-class variations and small inter-class differences make it difficult for traditional deep learning methods to effectively extract fine-grained discriminative features. To address these issues, we propose a novel remote sensing aircraft recognition method. First, to mitigate the loss of local detail information, we introduce a learnable Gabor filter-based texture feature extractor, which enhances the discriminative feature representation of aircraft categories by capturing detailed texture information. Second, to tackle the long-tail distribution problem, we design a dynamic feature hallucination module that synthesizes diverse hallucinated samples, thereby improving the feature diversity of tail categories. Finally, to handle the challenge of large intra-class variations and small inter-class differences, we propose a contrastive learning module to enhance the spatial discriminative features of the targets. Extensive experiments on the large-scale fine-grained datasets FAIR1M and MAR20 demonstrate the effectiveness of our method, achieving detection accuracies of 53.56% and 89.72%, respectively, and surpassing state-of-the-art performance. The experimental results validate that our approach effectively addresses the key challenges in remote sensing aircraft recognition. Full article

(This article belongs to the Special Issue Efficient Object Detection Based on Remote Sensing Images)

► Show Figures

Figure 1

21 pages, 3281 KiB

Open AccessArticle

Multi-Space Feature Fusion and Entropy-Based Metrics for Underwater Image Quality Assessment

by Baozhen Du, Hongwei Ying, Jiahao Zhang and Qunxin Chen

Entropy 2025, 27(2), 173; https://doi.org/10.3390/e27020173 - 6 Feb 2025

Viewed by 499

Abstract

In marine remote sensing, underwater images play an indispensable role in ocean exploration, owing to their richness in information and intuitiveness. However, underwater images often encounter issues such as color shifts, loss of detail, and reduced clarity, leading to the decline of image [...] Read more.

In marine remote sensing, underwater images play an indispensable role in ocean exploration, owing to their richness in information and intuitiveness. However, underwater images often encounter issues such as color shifts, loss of detail, and reduced clarity, leading to the decline of image quality. Therefore, it is critical to study precise and efficient methods for assessing underwater image quality. A no-reference multi-space feature fusion and entropy-based metrics for underwater image quality assessment (MFEM-UIQA) are proposed in this paper. Considering the color shifts of underwater images, the chrominance difference map is created from the chrominance space and statistical features are extracted. Moreover, considering the information representation capability of entropy, entropy-based multi-channel mutual information features are extracted to further characterize chrominance features. For the luminance space features, contrast features from luminance images based on gamma correction and luminance uniformity features are extracted. In addition, logarithmic Gabor filtering is applied to the luminance space images for subband decomposition and entropy-based mutual information of subbands is captured. Furthermore, underwater image noise features, multi-channel dispersion information, and visibility features are extracted to jointly represent the perceptual features. The experiments demonstrate that the proposed MFEM-UIQA surpasses the state-of-the-art methods. Full article

(This article belongs to the Collection Entropy in Image Analysis)

► Show Figures

Figure 1

Figure 1
The Framework of MFEM-UIQA. Full article ">Figure 2
Underwater images and corresponding UCD maps. (a) Underwater images of different quality levels; (b) corresponding UCD maps. Full article ">Figure 3
Comparison of statistical distribution of MSCN coefficients for original underwater images and corresponding ΨD. (a) The statistical distribution of MSCN coefficients for the original underwater images, and (b) the statistical distribution of MSCN coefficients for ΨD. Full article ">Figure 4
Underwater images of different quality levels and the corresponding fitting Rayleigh distribution shape parameter. (a) Underwater images of different quality levels; (b) fitting Rayleigh distribution shape parameters corresponding to three channel histograms of the OC space. Full article ">Figure 5
Non-uniform brightness image and its block map. (a) Non-uniform brightness underwater image; (b) block map of (a). Full article ">Figure 6
Underwater images with differing quality and corresponding K-L divergence distribution. (a) Underwater images of different quality levels; (b) the K-L divergence distribution of three channels in the OC space. Full article ">Figure 7
Different quality underwater images and corresponding visibility values. Full article ">

30 pages, 24013 KiB

Open AccessArticle

Non-Concentric Differential Model with Geographic Information-Driven Weights Allocation for Enhanced Infrared Small Target Detection

by Lingbing Peng, Zhi Lu, Tao Lei and Ping Jiang

Remote Sens. 2025, 17(1), 75; https://doi.org/10.3390/rs17010075 - 28 Dec 2024

Viewed by 427

Abstract

Infrared small target detection technology has received extensive attention due to its advantages in long-distance monitoring. However, there is much room for improvement in its performance due to complex backgrounds and the lack of distinct features in small targets. Many specific scenarios can [...] Read more.

Infrared small target detection technology has received extensive attention due to its advantages in long-distance monitoring. However, there is much room for improvement in its performance due to complex backgrounds and the lack of distinct features in small targets. Many specific scenarios can lead to target loss, such as edge-adjacent targets, intersecting targets, low contrast caused by locally bright backgrounds, and false alarms induced by globally bright backgrounds. To address these issues, we have identified the positional correlation differences between the local background location and whether the target can be perceived by the human eye, thereby introducing geographic information weights to represent this correlation difference. We first constructed a non-concentric Gaussian difference structure to prevent the central target energy loss caused by traditional concentric filters. Based on this, we introduced Gabor filters, which have the capability of directional feature extraction and position correlation representation, into the non-concentric differential structure. By adjusting the relative position of the Gabor filter center and configuring frequency parameters based on geographic information, we optimized the filter weights to handle complex situations, such as targets being close to background clutter or other targets. Subsequently, an improved logarithmic function was applied to adjust the overall saliency of candidate targets, preventing the loss of low-contrast targets and the residual high-energy background clutter. Extensive experiments show that our method exhibits effective detection performance and robustness in four application scenes and three challenging image distribution scenes. Full article

(This article belongs to the Special Issue Advances in Detection-Oriented Multi-Sensor Fusion Beyond the Visible Spectrum)

► Show Figures

Figure 1

17 pages, 6702 KiB

Open AccessArticle

A Variational Neural Network Based on Algorithm Unfolding for Image Blind Deblurring

by Shaoqing Gong, Yeran Wang, Guangyu Yang, Weibo Wei, Junli Zhao and Zhenkuan Pan

Appl. Sci. 2024, 14(24), 11742; https://doi.org/10.3390/app142411742 - 16 Dec 2024

Viewed by 688

Abstract

Image blind deblurring is an ill-posed inverse problem in image processing. While deep learning approaches have demonstrated effectiveness, they often lack interpretability and require extensive data. To address these limitations, we propose a novel variational neural network based on algorithm unfolding. The model [...] Read more.

Image blind deblurring is an ill-posed inverse problem in image processing. While deep learning approaches have demonstrated effectiveness, they often lack interpretability and require extensive data. To address these limitations, we propose a novel variational neural network based on algorithm unfolding. The model is solved using the half quadratic splitting (HQS) method and proximal gradient descent. For blur kernel estimation, we introduce an

L_{0}

regularizer to constrain the gradient information and use the fast fourier transform (FFT) to solve the iterative results, thereby improving accuracy. Image restoration is initiated with Gabor filters for the convolution kernel, and the activation function is approximated using a Gaussian radial basis function (RBF). Additionally, two attention mechanisms improve feature selection. The experimental results on various datasets demonstrate that our model outperforms state-of-the-art algorithm unfolding networks and other blind deblurring models. Our approach enhances interpretability and generalization while utilizing fewer data and parameters. Full article

► Show Figures

Figure 1

28 pages, 7535 KiB

Open AccessArticle

A New Computer-Aided Diagnosis System for Breast Cancer Detection from Thermograms Using Metaheuristic Algorithms and Explainable AI

by Hanane Dihmani, Abdelmajid Bousselham and Omar Bouattane

Algorithms 2024, 17(10), 462; https://doi.org/10.3390/a17100462 - 18 Oct 2024

Viewed by 1661

Abstract

Advances in the early detection of breast cancer and treatment improvements have significantly increased survival rates. Traditional screening methods, including mammography, MRI, ultrasound, and biopsies, while effective, often come with high costs and risks. Recently, thermal imaging has gained attention due to its [...] Read more.

Advances in the early detection of breast cancer and treatment improvements have significantly increased survival rates. Traditional screening methods, including mammography, MRI, ultrasound, and biopsies, while effective, often come with high costs and risks. Recently, thermal imaging has gained attention due to its minimal risks compared to mammography, although it is not widely adopted as a primary detection tool since it depends on identifying skin temperature changes and lesions. The advent of machine learning (ML) and deep learning (DL) has enhanced the effectiveness of breast cancer detection and diagnosis using this technology. In this study, a novel interpretable computer aided diagnosis (CAD) system for breast cancer detection is proposed, leveraging Explainable Artificial Intelligence (XAI) throughout its various phases. To achieve these goals, we proposed a new multi-objective optimization approach named the Hybrid Particle Swarm Optimization algorithm (HPSO) and Hybrid Spider Monkey Optimization algorithm (HSMO). These algorithms simultaneously combined the continuous and binary representations of PSO and SMO to effectively manage trade-offs between accuracy, feature selection, and hyperparameter tuning. We evaluated several CAD models and investigated the impact of handcrafted methods such as Local Binary Patterns (LBP), Histogram of Oriented Gradients (HOG), Gabor Filters, and Edge Detection. We further shed light on the effect of feature selection and optimization on feature attribution and model decision-making processes using the SHapley Additive exPlanations (SHAP) framework, with a particular emphasis on cancer classification using the DMR-IR dataset. The results of our experiments demonstrate in all trials that the performance of the model is improved. With HSMO, our models achieved an accuracy of 98.27% and F1-score of 98.15% while selecting only 25.78% of the HOG features. This approach not only boosts the performance of CAD models but also ensures comprehensive interpretability. This method emerges as a promising and transparent tool for early breast cancer diagnosis. Full article

(This article belongs to the Special Issue Artificial Intelligence-based Algorithms with Potential Applications in Healthcare and Prediction of Disease Evolution)

► Show Figures

Figure 1

10 pages, 3009 KiB

Open AccessEditor’s ChoiceArticle

Unsupervised Learning for the Automatic Counting of Grains in Nanocrystals and Image Segmentation at the Atomic Resolution

by Woonbae Sohn, Taekyung Kim, Cheon Woo Moon, Dongbin Shin, Yeji Park, Haneul Jin and Hionsuck Baik

Nanomaterials 2024, 14(20), 1614; https://doi.org/10.3390/nano14201614 - 10 Oct 2024

Viewed by 1070

Abstract

Identifying the grain distribution and grain boundaries of nanoparticles is important for predicting their properties. Experimental methods for identifying the crystallographic distribution, such as precession electron diffraction, are limited by their probe size. In this study, we developed an unsupervised learning method by [...] Read more.

Identifying the grain distribution and grain boundaries of nanoparticles is important for predicting their properties. Experimental methods for identifying the crystallographic distribution, such as precession electron diffraction, are limited by their probe size. In this study, we developed an unsupervised learning method by applying a Gabor filter to HAADF-STEM images at the atomic level for image segmentation and automatic counting of grains in polycrystalline nanoparticles. The methodology comprises a Gabor filter for feature extraction, non-negative matrix factorization for dimension reduction, and K-means clustering. We set the threshold distance and angle between the clusters required for the number of clusters to converge so as to automatically determine the optimal number of grains. This approach can shed new light on the nature of polycrystalline nanoparticles and their structure–property relationships. Full article

(This article belongs to the Special Issue Exploring Nanomaterials through Electron Microscopy and Spectroscopy)

► Show Figures

Figure 1

Figure 1
Schematic of the Gabor-filter-based clustering for particle segmentation. (1) Application of multiple Gabor filters, (2) creation of a feature vector for each pixel to obtain a feature matrix, and (3) dimension reduction using NMF followed by K-means clustering. The class vectors are rearranged into a 2D matrix, illustrating the segmented image. Full article ">Figure 2
Sequence of k in the Au nanoparticles with five-fold twin and colorized segmentation, which are compared with the ground truth. (a) HAADF-STEM image, showing five-fold twins of the particle. Segmentation and colored classes for (b) k = 2; (c) k = 4; (d) k = 6; (e) k = 8. (f) Ground truth of the segmentation of (a). The different colors indicate the different classes after clustering. Full article ">Figure 3
Segmented images of the Au nanoparticles with various k values. (a) HAADF-STEM image, showing five-fold twins of the particle. Segmentation and color maps for (b) k = 7; (c) k = 8; (d) k = 9; (e) k = 10. The different colors indicate the different classes after clustering. Full article ">Figure 4
Segmentation of PtNi intermetallic nanoparticles. (a) HAADF-STEM image of PtNi intermetallic nanoparticle; (b) segmented image with k = 5. The different colors indicate the different classes after clustering. Full article ">Figure 5
Segmentation of the PtNi intermetallic nanoparticles. (a) HAADF STEM image of the PtNi intermetallic nanoparticle; (b) segmentated image with k = 6. The different colors indicate the different classes after clustering. Full article ">Figure 6
Automated segmentation by setting threshold value with k = 10. (a–d) HAADF-STEM images of intermetallic nanoparticles for image segmentation. With the same k values, those images are segmented with optimal k values of (e) 5, (f) 5, (g) 4, and (h) 9. The different colors indicate the different classes after clustering. Full article ">

23 pages, 9520 KiB

Open AccessArticle

Visual Feature-Guided Diamond Convolutional Network for Finger Vein Recognition

by Qiong Yao, Dan Song, Xiang Xu and Kun Zou

Sensors 2024, 24(18), 6097; https://doi.org/10.3390/s24186097 - 20 Sep 2024

Viewed by 783

Abstract

Finger vein (FV) biometrics have garnered considerable attention due to their inherent non-contact nature and high security, exhibiting tremendous potential in identity authentication and beyond. Nevertheless, challenges pertaining to the scarcity of training data and inconsistent image quality continue to impede the effectiveness [...] Read more.

Finger vein (FV) biometrics have garnered considerable attention due to their inherent non-contact nature and high security, exhibiting tremendous potential in identity authentication and beyond. Nevertheless, challenges pertaining to the scarcity of training data and inconsistent image quality continue to impede the effectiveness of finger vein recognition (FVR) systems. To tackle these challenges, we introduce the visual feature-guided diamond convolutional network (dubbed ‘VF-DCN’), a uniquely configured multi-scale and multi-orientation convolutional neural network. The VF-DCN showcases three pivotal innovations: Firstly, it meticulously tunes the convolutional kernels through multi-scale Log-Gabor filters. Secondly, it implements a distinctive diamond-shaped convolutional kernel architecture inspired by human visual perception. This design intelligently allocates more orientational filters to medium scales, which inherently carry richer information. In contrast, at extreme scales, the use of orientational filters is minimized to simulate the natural blurring of objects at extreme focal lengths. Thirdly, the network boasts a deliberate three-layer configuration and fully unsupervised training process, prioritizing simplicity and optimal performance. Extensive experiments are conducted on four FV databases, including MMCBNU_6000, FV_USM, HKPU, and ZSC_FV. The experimental results reveal that VF-DCN achieves remarkable improvement with equal error rates (EERs) of

0.17 %

,

0.19 %

,

2.11 %

, and

0.65 %

, respectively, and Accuracy Rates (ACC) of

100 %

,

99.97 %

,

98.92 %

, and

99.36 %

, respectively. These results indicate that, compared with some existing FVR approaches, the proposed VF-DCN not only achieves notable recognition accuracy but also shows fewer number of parameters and lower model complexity. Moreover, VF-DCN exhibits superior robustness across diverse FV databases. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

25 pages, 13590 KiB

Open AccessArticle

Fast and Nondestructive Proximate Analysis of Coal from Hyperspectral Images with Machine Learning and Combined Spectra-Texture Features

by Jihua Mao, Hengqian Zhao, Yu Xie, Mengmeng Wang, Pan Wang, Yaning Shi and Yusen Zhao

Appl. Sci. 2024, 14(17), 7920; https://doi.org/10.3390/app14177920 - 5 Sep 2024

Cited by 1 | Viewed by 1348

Abstract

Proximate analysis, including ash, volatile matter, moisture, fixed carbon, and calorific value, is a fundamental aspect of fuel testing and serves as the primary method for evaluating coal quality, which is critical for the processing and utilization of coal. The traditional analytical methods [...] Read more.

Proximate analysis, including ash, volatile matter, moisture, fixed carbon, and calorific value, is a fundamental aspect of fuel testing and serves as the primary method for evaluating coal quality, which is critical for the processing and utilization of coal. The traditional analytical methods involve time-consuming and costly combustion processes, particularly when applied to large volumes of coal that need to be sampled in massive batches. Hyperspectral imaging is promising for the rapid and nondestructive determination of coal quality indices. In this study, a fast and nondestructive coal proximate analysis method with combined spectral-spatial features was developed using a hyperspectral imaging system in the 450–2500 nm range. The processed spectra were evaluated using PLSR, with the most effective MSC spectra selected. To reduce the spectral redundancy and improve the accuracy, the SPA, Boruta, iVISSA, and CARS algorithms were adopted to extract the characteristic wavelengths, and 16 prediction models were constructed and optimized based on the PLSR, RF, BPNN, and LSSVR algorithms within the Optuna framework for each quality indicator. For spatial information, the histogram statistics, gray-level covariance matrix, and Gabor filters were employed to extract the texture features within the characteristic wavelengths. The texture feature-based and combined spectral-texture feature-based prediction models were constructed by applying the spectral modeling strategy, respectively. Compared with the models based on spectral or texture features only, the LSSVR models with combined spectral-texture features achieved the highest prediction accuracy in all quality metrics, with

R_{p}^{2}

values of 0.993, 0.989, 0.979, 0.948, and 0.994 for Ash, VM, MC, FC, and CV, respectively. This study provides a technical reference for hyperspectral imaging technology as a new method for the rapid, nondestructive proximate analysis and quality assessment of coal. Full article

(This article belongs to the Section Optics and Lasers)

► Show Figures

Figure 1

18 pages, 16408 KiB

Open AccessArticle

Enhanced Scratch Detection for Textured Materials Based on Optimized Photometric Stereo Vision and Fast Fourier Transform–Gabor Filtering

by Yaoshun Yue, Wenpeng Sang, Kaiwei Zhai and Maohai Lin

Appl. Sci. 2024, 14(17), 7812; https://doi.org/10.3390/app14177812 - 3 Sep 2024

Viewed by 1209

Abstract

In the process of scratch defect detection in textured materials, there are often problems of low efficiency in traditional manual detection, large errors in machine vision, and difficulty in distinguishing defective scratches from the background texture. In order to solve these problems, we [...] Read more.

In the process of scratch defect detection in textured materials, there are often problems of low efficiency in traditional manual detection, large errors in machine vision, and difficulty in distinguishing defective scratches from the background texture. In order to solve these problems, we developed an enhanced scratch defect detection system for textured materials based on optimized photometric stereo vision and FFT-Gabor filtering. We designed and optimized a novel hemispherical image acquisition device that allows for selective lighting angles. This device integrates images captured under multiple light sources to obtain richer surface gradient information for textured materials, overcoming issues caused by high reflections or dark shadows under a single light source angle. At the same time, for the textured material, scratches and a textured background are difficult to distinguish; therefore, we introduced a Gabor filter-based convolution kernel, leveraging the fast Fourier transform (FFT), to perform convolution operations and spatial domain phase subtraction. This process effectively enhances the defect information while suppressing the textured background. The effectiveness and superiority of the proposed method were validated through material applicability experiments and comparative method evaluations using a variety of textured material samples. The results demonstrated a stable scratch capture success rate of 100% and a recognition detection success rate of 98.43% ± 1.0%. Full article

(This article belongs to the Section Applied Industrial Technologies)

► Show Figures

Figure 1

33 pages, 30114 KiB

Open AccessArticle

Exploring the Influence of Object, Subject, and Context on Aesthetic Evaluation through Computational Aesthetics and Neuroaesthetics

by Fangfu Lin, Wanni Xu, Yan Li and Wu Song

Appl. Sci. 2024, 14(16), 7384; https://doi.org/10.3390/app14167384 - 21 Aug 2024

Cited by 1 | Viewed by 1423

Abstract

Background: In recent years, computational aesthetics and neuroaesthetics have provided novel insights into understanding beauty. Building upon the findings of traditional aesthetics, this study aims to combine these two research methods to explore an interdisciplinary approach to studying aesthetics. Method: Abstract artworks were [...] Read more.

Background: In recent years, computational aesthetics and neuroaesthetics have provided novel insights into understanding beauty. Building upon the findings of traditional aesthetics, this study aims to combine these two research methods to explore an interdisciplinary approach to studying aesthetics. Method: Abstract artworks were used as experimental materials. Based on traditional aesthetics and in combination, features of composition, tone, and texture were selected. Computational aesthetic methods were then employed to correspond these features to physical quantities: blank space, gray histogram, Gray Level Co-occurrence Matrix (GLCM), Local Binary Pattern (LBP), and Gabor filters. An electroencephalogram (EEG) experiment was carried out, in which participants conducted aesthetic evaluations of the experimental materials in different contexts (genuine, fake), and their EEG data were recorded to analyze the impact of various feature classes in the aesthetic evaluation process. Finally, a Support Vector Machines (SVMs) was utilized to model the feature data, Event-Related Potentials (ERPs), context data, and subjective aesthetic evaluation data. Result: Behavioral data revealed higher aesthetic ratings in the genuine context. ERP data indicated that genuine contexts elicited more negative deflections in the prefrontal lobes between 200 and 1000 ms. Class II compositions demonstrated more positive deflections in the parietal lobes at 50–120 ms, while Class I tones evoked more positive amplitudes in the occipital lobes at 200–300 ms. Gabor features showed significant variations in the parieto-occipital area at an early stage. Class II LBP elicited a prefrontal negative wave with a larger amplitude. The results of the SVM models indicated that the model incorporating aesthetic subject and context data (ACC = 0.76866) outperforms the model using only parameters of the aesthetic object (ACC = 0.68657). Conclusion: A positive context tends to provide participants with a more positive aesthetic experience, but abstract artworks may not respond to this positivity. During aesthetic evaluation, the ERP data activated by different features show a trend from global to local. The SVM model based on multimodal data fusion effectively predicts aesthetics, further demonstrating the feasibility of the combined research approach of computational aesthetics and neuroaesthetics. Full article

► Show Figures

Figure 1

22 pages, 12904 KiB

Open AccessArticle

Intelligent Classification and Segmentation of Sandstone Thin Section Image Using a Semi-Supervised Framework and GL-SLIC

by Yubo Han and Ye Liu

Minerals 2024, 14(8), 799; https://doi.org/10.3390/min14080799 - 5 Aug 2024

Cited by 1 | Viewed by 1210

Abstract

This study presents the development and validation of a robust semi-supervised learning framework specifically designed for the automated segmentation and classification of sandstone thin section images from the Yanchang Formation in the Ordos Basin. Traditional geological image analysis methods encounter significant challenges due [...] Read more.

This study presents the development and validation of a robust semi-supervised learning framework specifically designed for the automated segmentation and classification of sandstone thin section images from the Yanchang Formation in the Ordos Basin. Traditional geological image analysis methods encounter significant challenges due to the labor-intensive and error-prone nature of manual labeling, compounded by the diversity and complexity of rock thin sections. Our approach addresses these challenges by integrating the GL-SLIC algorithm, which combines Gabor filters and Local Binary Patterns for effective superpixel segmentation, laying the groundwork for advanced component identification. The primary innovation of this research is the semi-supervised learning model that utilizes a limited set of manually labeled samples to generate high-confidence pseudo labels, thereby significantly expanding the training dataset. This methodology effectively tackles the critical challenge of insufficient labeled data in geological image analysis, enhancing the model’s generalization capability from minimal initial input. Our framework improves segmentation accuracy by closely aligning superpixels with the intricate boundaries of mineral grains and pores. Additionally, it achieves substantial improvements in classification accuracy across various rock types, reaching up to 96.3% in testing scenarios. This semi-supervised approach represents a significant advancement in computational geology, providing a scalable and efficient solution for detailed petrographic analysis. It not only enhances the accuracy and efficiency of geological interpretations but also supports broader hydrocarbon exploration efforts. Full article

(This article belongs to the Special Issue Application of Deep Learning and Computer Vision in Petrographic Images Analysis)

► Show Figures

Figure 1

26 pages, 3348 KiB

Open AccessArticle

Hybrid Feature Mammogram Analysis: Detecting and Localizing Microcalcifications Combining Gabor, Prewitt, GLCM Features, and Top Hat Filtering Enhanced with CNN Architecture

by Miguel Alejandro Hernández-Vázquez, Yazmín Mariela Hernández-Rodríguez, Fausto David Cortes-Rojas, Rafael Bayareh-Mancilla and Oscar Eduardo Cigarroa-Mayorga

Diagnostics 2024, 14(15), 1691; https://doi.org/10.3390/diagnostics14151691 - 5 Aug 2024

Cited by 3 | Viewed by 1792

Abstract

Breast cancer is a prevalent malignancy characterized by the uncontrolled growth of glandular epithelial cells, which can metastasize through the blood and lymphatic systems. Microcalcifications, small calcium deposits within breast tissue, are critical markers for early detection of breast cancer, especially in non-palpable [...] Read more.

Breast cancer is a prevalent malignancy characterized by the uncontrolled growth of glandular epithelial cells, which can metastasize through the blood and lymphatic systems. Microcalcifications, small calcium deposits within breast tissue, are critical markers for early detection of breast cancer, especially in non-palpable carcinomas. These microcalcifications, appearing as small white spots on mammograms, are challenging to identify due to potential confusion with other tissues. This study hypothesizes that a hybrid feature extraction approach combined with Convolutional Neural Networks (CNNs) can significantly enhance the detection and localization of microcalcifications in mammograms. The proposed algorithm employs Gabor, Prewitt, and Gray Level Co-occurrence Matrix (GLCM) kernels for feature extraction. These features are input to a CNN architecture designed with maxpooling layers, Rectified Linear Unit (ReLU) activation functions, and a sigmoid response for binary classification. Additionally, the Top Hat filter is used for precise localization of microcalcifications. The preprocessing stage includes enhancing contrast using the Volume of Interest Look-Up Table (VOI LUT) technique and segmenting regions of interest. The CNN architecture comprises three convolutional layers, three ReLU layers, and three maxpooling layers. The training was conducted using a balanced dataset of digital mammograms, with the Adam optimizer and binary cross-entropy loss function. Our method achieved an accuracy of 89.56%, a sensitivity of 82.14%, and a specificity of 91.47%, outperforming related works, which typically report accuracies around 85–87% and sensitivities between 76 and 81%. These results underscore the potential of combining traditional feature extraction techniques with deep learning models to improve the detection and localization of microcalcifications. This system may serve as an auxiliary tool for radiologists, enhancing early detection capabilities and potentially reducing diagnostic errors in mass screening programs. Full article

(This article belongs to the Special Issue Quantitative and Intelligent Analysis of Medical Imaging, 2nd Edition)

► Show Figures

Figure 1

17 pages, 14796 KiB

Open AccessArticle

Application of Gabor, Log-Gabor, and Adaptive Gabor Filters in Determining the Cut-Off Wavelength Shift of TFBG Sensors

by Sławomir Cięszczyk

Appl. Sci. 2024, 14(15), 6394; https://doi.org/10.3390/app14156394 - 23 Jul 2024

Cited by 1 | Viewed by 997

Abstract

Tilted fibre Bragg gratings are optical fibre structures used as sensors of various physical quantities. Their unique measurement capabilities result from the high complexity of the optical spectrum consisting of several dozen cladding mode resonances. TFBG spectra demodulation methods generate signal features that [...] Read more.

Tilted fibre Bragg gratings are optical fibre structures used as sensors of various physical quantities. Their unique measurement capabilities result from the high complexity of the optical spectrum consisting of several dozen cladding mode resonances. TFBG spectra demodulation methods generate signal features that highlight changes in the spectrum due to changes in the interacting quantities. Such methods should enable the distinction between two slightly different values of the measured quantity. The paper presents an effective method of processing the TFBG spectrum for use in measuring the refractive index of liquids. The use of Gabor and log-Gabor filters and their adaptive version eliminates the problem of discontinuity in determining the SRI value related to the existence of the cladding mode comb. The Gabor filters used make visible the shifting and fading of spectral features related to the decrease in the intensity of leaking modes. Subsequent modifications of the proposed algorithm led to an increase in the quality factor of the processed spectrum. Full article

(This article belongs to the Section Optics and Lasers)

► Show Figures

Figure 1

22 pages, 3024 KiB

Open AccessArticle

Augmenting Aquaculture Efficiency through Involutional Neural Networks and Self-Attention for Oplegnathus Punctatus Feeding Intensity Classification from Log Mel Spectrograms

by Usama Iqbal, Daoliang Li, Zhuangzhuang Du, Muhammad Akhter, Zohaib Mushtaq, Muhammad Farrukh Qureshi and Hafiz Abbad Ur Rehman

Animals 2024, 14(11), 1690; https://doi.org/10.3390/ani14111690 - 5 Jun 2024

Cited by 4 | Viewed by 1206

Abstract

Understanding the feeding dynamics of aquatic animals is crucial for aquaculture optimization and ecosystem management. This paper proposes a novel framework for analyzing fish feeding behavior based on a fusion of spectrogram-extracted features and deep learning architecture. Raw audio waveforms are first transformed [...] Read more.

Understanding the feeding dynamics of aquatic animals is crucial for aquaculture optimization and ecosystem management. This paper proposes a novel framework for analyzing fish feeding behavior based on a fusion of spectrogram-extracted features and deep learning architecture. Raw audio waveforms are first transformed into Log Mel Spectrograms, and a fusion of features such as the Discrete Wavelet Transform, the Gabor filter, the Local Binary Pattern, and the Laplacian High Pass Filter, followed by a well-adapted deep model, is proposed to capture crucial spectral and spectral information that can help distinguish between the various forms of fish feeding behavior. The Involutional Neural Network (INN)-based deep learning model is used for classification, achieving an accuracy of up to 97% across various temporal segments. The proposed methodology is shown to be effective in accurately classifying the feeding intensities of Oplegnathus punctatus, enabling insights pertinent to aquaculture enhancement and ecosystem management. Future work may include additional feature extraction modalities and multi-modal data integration to further our understanding and contribute towards the sustainable management of marine resources. Full article

(This article belongs to the Special Issue Animal Health and Welfare in Aquaculture)

► Show Figures

Figure 1

Search Results (141)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (141)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI