[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (2,160)

Search Parameters:
Keywords = deep forest

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 8074 KiB  
Article
Automated Segmentation of Breast Cancer Focal Lesions on Ultrasound Images
by Dmitry Pasynkov, Ivan Egoshin, Alexey Kolchev, Ivan Kliouchkin, Olga Pasynkova, Zahraa Saad, Anis Daou and Esam Mohamed Abuzenar
Sensors 2025, 25(5), 1593; https://doi.org/10.3390/s25051593 - 5 Mar 2025
Abstract
Ultrasound (US) remains the main modality for the differential diagnosis of changes revealed by mammography. However, the US images themselves are subject to various types of noise and artifacts from reflections, which can worsen the quality of their analysis. Deep learning methods have [...] Read more.
Ultrasound (US) remains the main modality for the differential diagnosis of changes revealed by mammography. However, the US images themselves are subject to various types of noise and artifacts from reflections, which can worsen the quality of their analysis. Deep learning methods have a number of disadvantages, including the often insufficient substantiation of the model, and the complexity of collecting a representative training database. Therefore, it is necessary to develop effective algorithms for the segmentation, classification, and analysis of US images. The aim of the work is to develop a method for the automated detection of pathological lesions in breast US images and their segmentation. A method is proposed that includes two stages of video image processing: (1) searching for a region of interest using a random forest classifier, which classifies normal tissues, (2) selecting the contour of the lesion based on the difference in brightness of image pixels. The test set included 52 ultrasound videos which contained histologically proven suspicious lesions. The average frequency of lesion detection per frame was 91.89%, and the average accuracy of contour selection according to the IoU metric was 0.871. The proposed method can be used to segment a suspicious lesion. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

Figure 1
<p>Frames from two different ultrasound video sequences; the arrows indicate (<b>a</b>) histologically proven mucinous breast cancer; and (<b>b</b>) histologically proven ductal breast cancer.</p>
Full article ">Figure 2
<p>Block diagram of the proposed method for identifying the contour of a lesion in ultrasound video frames.</p>
Full article ">Figure 3
<p>(<b>a</b>,<b>c</b>) Original ultrasound images; (<b>b</b>,<b>d</b>) ultrasound images with marked tissues on them (skin, fat, fibrous tissue, glandular tissue, and artifacts).</p>
Full article ">Figure 4
<p>(<b>a</b>) Original ultrasound image; (<b>b</b>) result of tissue classification by the random forest classifier; (<b>c</b>) after applying the morphological dilation operation and median filter; (<b>d</b>) ground truth.</p>
Full article ">Figure 5
<p>(<b>a</b>) Rays drawn from the center of lesion <span class="html-italic">A</span>; (<b>b</b>) brightness values of the pixels <span class="html-italic">P<sub>i</sub></span> lying on the rays; (<b>c</b>) graphs of the brightness gradients Δ<span class="html-italic">P<sub>i</sub></span> of the pixels lying on the rays.</p>
Full article ">Figure 6
<p>(<b>a</b>) Points corresponding to the extrema of the brightness gradient on the rays; (<b>b</b>) constructed averaged cubic regression for them (solid thick yellow line).</p>
Full article ">Figure 7
<p>Results of tissue classification on frames from the ultrasound video sequences corresponding to <a href="#sensors-25-01593-f001" class="html-fig">Figure 1</a>. (<b>a</b>) Histologically proven mucinous breast carcinoma; (<b>b</b>) histologically proven ductal breast carcinoma from different video sequences of another patient; (<b>c</b>,<b>d</b>) ground truths.</p>
Full article ">Figure 7 Cont.
<p>Results of tissue classification on frames from the ultrasound video sequences corresponding to <a href="#sensors-25-01593-f001" class="html-fig">Figure 1</a>. (<b>a</b>) Histologically proven mucinous breast carcinoma; (<b>b</b>) histologically proven ductal breast carcinoma from different video sequences of another patient; (<b>c</b>,<b>d</b>) ground truths.</p>
Full article ">Figure 8
<p>The result of identifying unclassified objects in the frames of ultrasound video sequences corresponding to <a href="#sensors-25-01593-f001" class="html-fig">Figure 1</a>. (<b>a</b>,<b>b</b>) Show the highlighted objects after additional shape filtering, (<b>c</b>,<b>d</b>) show the rectangular areas circled around them, which will be the ROI.</p>
Full article ">Figure 9
<p>(<b>a</b>,<b>b</b>) Segmented contours of lesions using pixel intensity gradient detection along rays drawn from the center of gravity of the contour (yellow); (<b>c</b>,<b>d</b>) contours of the lesion outlined by the specialist physician (red). The images correspond to <a href="#sensors-25-01593-f001" class="html-fig">Figure 1</a>.</p>
Full article ">Figure 10
<p>Frame-by-frame processing of the video ((<b>left</b>) to (<b>right</b>), (<b>top</b>) to (<b>bottom</b>): frame 1, frame 10, frame 20, frame 30, frame 40, frame 50). In frames 40 and 50, the ROI was not detected due to the absence of a suspicious lesion.</p>
Full article ">
30 pages, 5634 KiB  
Article
Evaluating Ecosystem Service Trade-Offs and Recovery Dynamics in Response to Urban Expansion: Implications for Sustainable Management Strategies
by Mohammed J. Alshayeb
Sustainability 2025, 17(5), 2194; https://doi.org/10.3390/su17052194 - 3 Mar 2025
Viewed by 136
Abstract
Land use land cover (LULC) changes due to rapid urbanization pose critical challenges to sustainable development, particularly in arid and semi-arid regions like Saudi Arabia, where cities such as Abha are experiencing unprecedented expansion. Urban sprawl is accelerating environmental degradation, affecting key natural [...] Read more.
Land use land cover (LULC) changes due to rapid urbanization pose critical challenges to sustainable development, particularly in arid and semi-arid regions like Saudi Arabia, where cities such as Abha are experiencing unprecedented expansion. Urban sprawl is accelerating environmental degradation, affecting key natural resources such as vegetation, water bodies, and barren land. This study introduces an advanced machine learning (ML) and deep learning (DL)-based framework for high-accuracy LULC classification, urban sprawl quantification, and ecosystem service assessment, providing a more precise and scalable approach compared to traditional remote sensing techniques. A hybrid methodology combining ML models—Random Forest, Artificial Neural Networks, Gradient Boosting Machine, and LightGBM—with a 1D Convolutional Neural Network (CNN) was fine-tuned using grid search optimization to enhance classification accuracy. The integration of deep learning improves feature extraction and classification consistency, achieving an AUC of 0.93 for Dense Vegetation and 0.82 for Cropland, outperforming conventional classification methods. The study also applies the Markov transition model to project land cover changes, offering a probabilistic understanding of urban expansion trends and ecosystem dynamics, providing a significant improvement over static LULC assessments by quantifying transition probabilities and predicting future land cover transformations. The results reveal that urban areas in Abha expanded by 120.74 km2 between 2014 and 2023, with barren land decreasing by 557.09 km2 and cropland increasing by 205.14 km2. The peak ecosystem service value (ESV) loss was recorded at USD 125,662.7 between 2017 and 2020, but subsequent land management efforts improved ESV to USD 96,769.5 by 2023. The resilience and recovery of natural land cover types, particularly barren land (44,163 km2 recovered by 2023), indicate the potential for targeted restoration strategies. This study advances urban sustainability research by integrating state-of-the-art deep learning models with Markov-based land change predictions, enhancing the accuracy and predictive capability of LULC assessments. The findings highlight the need for proactive land management policies to mitigate the adverse effects of urban sprawl and promote sustainable ecosystem service recovery. The methodological advancements presented in this study provide a scalable and adaptable framework for future urbanization impact assessments, particularly in rapidly developing regions. Full article
Show Figures

Figure 1

Figure 1
<p>Study area.</p>
Full article ">Figure 2
<p>Training and validation loss curves for a 1D CNN model.</p>
Full article ">Figure 3
<p>Confusion matrices of ML and DL models for LULC classification evaluating RF, ANN, GBM, LightGBM, and 1D CNN models, highlighting classification accuracy and misclassification trends across land cover classes.</p>
Full article ">Figure 4
<p>ROC curves and AUC values for Random Forest, ANN, GBM, LightGBM, and 1D CNN models for six land cover classes.</p>
Full article ">Figure 5
<p>Spatiotemporal distribution of LULC classes for the years (<b>a</b>) 2014, (<b>b</b>) 2017, (<b>c</b>) 2020, and (<b>d</b>) 2023.</p>
Full article ">Figure 6
<p>Land cover area for different classes for the years 2014, 2017, 2020, and 2023.</p>
Full article ">Figure 7
<p>Probability-based Markov transition matrices depicting the dynamic land cover changes between 2014–2017, 2017–2020, 2020–2023, and overall, for 2014–2023, quantifying transformation trends among LULC categories.</p>
Full article ">Figure 8
<p>Temporal analysis (2014 to 2023) showing trends in urban growth metrics over time, including urban growth rate (<b>top left</b>), Shannon’s entropy (<b>top right</b>), urban fragmentation (<b>bottom left</b>), and urban edge growth (<b>bottom right</b>), highlighting spatial and structural changes in urban expansion.</p>
Full article ">
15 pages, 2161 KiB  
Article
Enhancing Lymph Node Metastasis Risk Prediction in Early Gastric Cancer Through the Integration of Endoscopic Images and Real-World Data in a Multimodal AI Model
by Donghoon Kang, Han Jo Jeon, Jie-Hyun Kim, Sang-Il Oh, Ye Seul Seong, Jae Young Jang, Jung-Wook Kim, Joon Sung Kim, Seung-Joo Nam, Chang Seok Bang and Hyuk Soon Choi
Cancers 2025, 17(5), 869; https://doi.org/10.3390/cancers17050869 - 3 Mar 2025
Viewed by 124
Abstract
Objectives: The accurate prediction of lymph node metastasis (LNM) and lymphovascular invasion (LVI) is crucial for determining treatment strategies for early gastric cancer (EGC). This study aimed to develop and validate a deep learning-based clinical decision support system (CDSS) to predict LNM including [...] Read more.
Objectives: The accurate prediction of lymph node metastasis (LNM) and lymphovascular invasion (LVI) is crucial for determining treatment strategies for early gastric cancer (EGC). This study aimed to develop and validate a deep learning-based clinical decision support system (CDSS) to predict LNM including LVI in EGC using real-world data. Methods: A deep learning-based CDSS was developed by integrating endoscopic images, demographic data, biopsy pathology, and CT findings from the data of 2927 patients with EGC across five institutions. We compared a transformer-based model to an image-only (basic convolutional neural network (CNN)) model and a multimodal classification (CNN with random forest) model. Internal testing was conducted on 449 patients from the five institutions, and external validation was performed on 766 patients from two other institutions. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), probability density function, and clinical utility curve. Results: In the training, internal, and external validation cohorts, LNM/LVI was observed in 379 (12.95%), 49 (10.91%), 15 (9.09%), and 41 (6.82%) patients, respectively. The transformer-based model achieved an AUC of 0.9083, sensitivity of 85.71%, and specificity of 90.75%, outperforming the CNN (AUC 0.5937) and CNN with random forest (AUC 0.7548). High sensitivity and specificity were maintained in internal and external validations. The transformer model distinguished 91.8% of patients with LNM in the internal validation dataset, and 94.0% and 89.1% in the two different external datasets. Conclusions: We propose a deep learning-based CDSS for predicting LNM/LVI in EGC by integrating real-world data, potentially guiding treatment strategies in clinical settings. Full article
(This article belongs to the Collection Artificial Intelligence and Machine Learning in Cancer Research)
Show Figures

Graphical abstract

Graphical abstract
Full article ">Figure 1
<p>Flowchart of the study design. CDSS—clinical decision support system; EGC—early gastric cancer; LNM—lymph node metastasis; LVI—lymphovascular invasion.</p>
Full article ">Figure 2
<p>Mechanisms of models. (<b>a</b>) Basic CNN model; (<b>b</b>) CNN with random forest model; (<b>c</b>) transformer-based model. CNN: convolutional neural network.</p>
Full article ">Figure 3
<p>The training convergence process of the transformer-based model in recall scores.</p>
Full article ">Figure 4
<p>The attention ranks inferred by the transformer-based model. Each row is images from a patient, whereas each column is the attention rank between images. The heatmap was extracted by using the Grad-CAM method. In the heatmap, the red color means high attention for the 2D image space. (<b>a</b>) A case predicted as positive by the transformer-based model: A 64-year-old male with AWD and a 13 mm lesion, showing normal findings on CT. The actual result was LVI-positive, which matched the prediction. (<b>b</b>) A case predicted as positive: A 52-year-old male with SRC and a 15 mm lesion, showing fold thickness on CT. The actual result was LNM-positive, which matched the prediction. (<b>c</b>) A case predicted as negative: A 56-year-old male with AWD and a 22 mm lesion, showing normal findings on CT. The actual results were LNM-negative and LVI-negative, which matched the prediction.</p>
Full article ">Figure 5
<p>Probability density function and clinical utility curve. (<b>a</b>) Probability density function. (<b>b</b>) Clinical utility curves to decide clinical utility thresholds suggested 36.2% as a threshold probability for guiding the diagnosis of lymph node metastasis (LNM) or lymphovascular invasion (LVI), which could distinguish approximately 91.8% of patients with LNM/LVI, 80.0% (external validation set 1), and 85.4% (external validation set 2).</p>
Full article ">
36 pages, 1811 KiB  
Review
Progress and Limitations in Forest Carbon Stock Estimation Using Remote Sensing Technologies: A Comprehensive Review
by Weifeng Xu, Yaofei Cheng, Mengyuan Luo, Xuzhi Mai, Wenhuan Wang, Wei Zhang and Yinghui Wang
Forests 2025, 16(3), 449; https://doi.org/10.3390/f16030449 - 2 Mar 2025
Viewed by 209
Abstract
Forests play a key role in carbon sequestration and oxygen production. They significantly contribute to carbon peaking and carbon neutrality goals. Accurate estimation of forest carbon stocks is essential for a precise understanding of the carbon sequestration capacity of forest ecosystems. Remote sensing [...] Read more.
Forests play a key role in carbon sequestration and oxygen production. They significantly contribute to carbon peaking and carbon neutrality goals. Accurate estimation of forest carbon stocks is essential for a precise understanding of the carbon sequestration capacity of forest ecosystems. Remote sensing technology, with its wide observational coverage, strong timeliness, and low cost, is essential for carbon stock estimation research. However, challenges in data acquisition and processing include data variability, signal saturation in dense forests, and environmental limitations. These factors hinder accurate carbon stock estimation. This review summarizes the current state of research on forest carbon stock estimation from two aspects, namely remote sensing data and estimation methods, highlighting both the advantages and the limitations of various data sources and models. It also explores technological innovations and cutting-edge research in the field, focusing on deep learning techniques, optical vegetation thickness estimation methods, and the impact of forest–climate interactions on carbon stock estimation. Finally, the review discusses the current challenges in the field, including issues related to remote sensing data quality, model adaptability, forest stand complexity, and uncertainties in the estimation process. Based on these challenges, the paper looks ahead to future trends, proposing potential technological breakthroughs and pathways. The aim of this study is to provide theoretical support and methodological guidance for researchers in related fields. Full article
(This article belongs to the Section Forest Meteorology and Climate Change)
Show Figures

Figure 1

Figure 1
<p>Comparison of traditional forest survey methods and remote sensing estimation methods for forest carbon storage mapping.</p>
Full article ">Figure 2
<p>Trends, geographic distribution of the authors or research institutions, and source titles of publications on forest carbon stock estimation using remote sensing. (<b>a</b>) Annual number of publications and citations from 2000 to 2024, showing a significant increase in both over time. (<b>b</b>) Distribition of publications by country/region, highlighting major contributing countries such as the USA and China. (<b>c</b>) Top publication sources, listing the most frequently appearing journals in this research field, with “Remote Sensing” leading the count.</p>
Full article ">Figure 3
<p>Frequency of remote sensing data sources used in forest carbon stock estimation studies.</p>
Full article ">
30 pages, 1605 KiB  
Article
From Misinformation to Insight: Machine Learning Strategies for Fake News Detection
by Despoina Mouratidis, Andreas Kanavos and Katia Kermanidis
Information 2025, 16(3), 189; https://doi.org/10.3390/info16030189 - 28 Feb 2025
Viewed by 189
Abstract
In the digital age, the rapid proliferation of misinformation and disinformation poses a critical challenge to societal trust and the integrity of public discourse. This study presents a comprehensive machine learning framework for fake news detection, integrating advanced natural language processing techniques and [...] Read more.
In the digital age, the rapid proliferation of misinformation and disinformation poses a critical challenge to societal trust and the integrity of public discourse. This study presents a comprehensive machine learning framework for fake news detection, integrating advanced natural language processing techniques and deep learning architectures. We rigorously evaluate a diverse set of detection models across multiple content types, including social media posts, news articles, and user-generated comments. Our approach systematically compares traditional machine learning classifiers (Naïve Bayes, SVMs, Random Forest) with state-of-the-art deep learning models, such as CNNs, LSTMs, and BERT, while incorporating optimized vectorization techniques, including TF-IDF, Word2Vec, and contextual embeddings. Through extensive experimentation across multiple datasets, our results demonstrate that BERT-based models consistently achieve superior performance, significantly improving detection accuracy in complex misinformation scenarios. Furthermore, we extend the evaluation beyond conventional accuracy metrics by incorporating the Matthews Correlation Coefficient (MCC) and Receiver Operating Characteristic–Area Under the Curve (ROC–AUC), ensuring a robust and interpretable assessment of model efficacy. Beyond technical advancements, we explore the ethical implications of automated misinformation detection, addressing concerns related to censorship, algorithmic bias, and the trade-off between content moderation and freedom of expression. This research not only advances the methodological landscape of fake news detection but also contributes to the broader discourse on safeguarding democratic values, media integrity, and responsible AI deployment in digital environments. Full article
(This article belongs to the Special Issue Information Extraction and Language Discourse Processing)
Show Figures

Graphical abstract

Graphical abstract
Full article ">Figure 1
<p>Comparative architecture and performance of machine learning models.</p>
Full article ">Figure 2
<p>Standard deviation of the Mean F1 and Val F1 metrics for each vectorization technique.</p>
Full article ">Figure 3
<p>ROC-AUC comparison across models.</p>
Full article ">Figure 4
<p>SHAP feature importance for fake news across models.</p>
Full article ">Figure 5
<p>SHAP feature importance for fake news across datasets.</p>
Full article ">Figure 6
<p>LIME interpretability for fake news across models.</p>
Full article ">Figure 7
<p>LIME interpretability for fake news across datasets.</p>
Full article ">
19 pages, 21661 KiB  
Article
U-SwinFusionNet: High Resolution Snow Cover Mapping on the Tibetan Plateau Based on FY-4A
by Xi Kan, Xu Liu, Zhou Zhou, Jing Wang, Linglong Zhu, Lei Gong and Jiangeng Wang
Water 2025, 17(5), 706; https://doi.org/10.3390/w17050706 - 28 Feb 2025
Viewed by 130
Abstract
The Qinghai–Tibet Plateau (QTP), one of China’s most snow-rich regions, has an extremely fragile ecosystem, with drought being the primary driver of ecological degradation. Given that the water resources in this region predominantly exist in the form of snow, high-spatiotemporal-resolution snow mapping is [...] Read more.
The Qinghai–Tibet Plateau (QTP), one of China’s most snow-rich regions, has an extremely fragile ecosystem, with drought being the primary driver of ecological degradation. Given that the water resources in this region predominantly exist in the form of snow, high-spatiotemporal-resolution snow mapping is essential for understanding snow distribution and managing snow water resources effectively. However, although FY-4A/AGRI is capable of obtaining wide-area remote sensing data, only the first to third bands have a resolution of 1 km, which greatly limits its ability to produce high-resolution snow maps. This study proposes U-SwinFusionNet (USFNet), a deep learning-based snow cover retrieval algorithm that leverages the multi-scale advantages of FY-4A/AGRI remote sensing data in the shortwave infrared and visible bands. By integrating 1 km and 2 km resolution remote sensing imagery with auxiliary terrain information, USFNet effectively enhances snow cover mapping accuracy. The proposed model innovatively combines Swin Transformer and convolutional neural networks (CNNs) to capture both global contextual information and local spatial details. Additionally, an Attention Feature Fusion Module (AFFM) is introduced to align and integrate features from different modalities through an efficient attention mechanism, while the Feature Complementation Module (FCM) facilitates interactions between the encoded and decoded features. As a result, USFNet produces snow cover maps with a spatial resolution of 1 km. Experimental comparisons with Artificial Neural Networks (ANNs), Random Forest (RF), U-Net, and ResNet-FSC demonstrate that USFNet exhibits superior robustness, enhanced snow cover continuity, and lower error rates. The model achieves a correlation coefficient of 0.9126 and an R2 of 0.7072. Compared to the MOD10A1 snow product, USFNet demonstrates an improved sensitivity to fragmented and low-snow-cover areas while ensuring more natural snow boundary transitions. Full article
Show Figures

Figure 1

Figure 1
<p>Overall structure of the USFNet.</p>
Full article ">Figure 2
<p>Schematic diagram of the study area. The selected regions outlined represent the areas included in the dataset after filtering.</p>
Full article ">Figure 3
<p>Intermediate image of the FSC True Value Labeling process.</p>
Full article ">Figure 4
<p>Structural diagrams of various modules in the proposed model. (<b>a</b>) CSFEM structure. (<b>b</b>) AFFM structure. (<b>c</b>) FSM structure.</p>
Full article ">Figure 5
<p>Loss curve. The “Train loss with 480 TIF” and “Validation loss with 480 TIF” represent the train and validation loss curves over the entire train set. The “Train loss with 240 TIF” and “Validation loss with 240 TIF” correspond to the train and validation loss curves computed on half of the train set.</p>
Full article ">Figure 6
<p>Comparison of inversion results from various models. The highlighted areas of the figure show where the differences between the methods are more significant.</p>
Full article ">Figure 7
<p>Comparison between the FSC estimation results of the USFNet and the MOD10A1 snow product. The highlighted area shows the significant difference between the MOD10A1 FSC and FY-4A/AGRI FSC. The spatial resolution of FY-4A, Landsat8 FSC, MOD10A1 FSC, and OUR FSC is 1 km, and the MOD10A1 NDSI has a native resolution of 500 m.</p>
Full article ">Figure 8
<p>FY-4A/AGRI remote sensing images of the corresponding area in <a href="#water-17-00706-f007" class="html-fig">Figure 7</a> from 9:00 a.m. to 2:00 p.m.</p>
Full article ">
26 pages, 4362 KiB  
Article
EQLC-EC: An Efficient Voting Classifier for 1D Mass Spectrometry Data Classification
by Lin Guo, Yinchu Wang, Zilong Liu, Fengyi Zhang, Wei Zhang and Xingchuang Xiong
Electronics 2025, 14(5), 968; https://doi.org/10.3390/electronics14050968 - 28 Feb 2025
Viewed by 122
Abstract
Mass spectrometry (MS) data present challenges for machine learning (ML) classification due to their high dimensionality, complex feature distributions, batch effects, and intensity discrepancies, often hindering model generalization and efficiency. To address these issues, this study introduces the Efficient Quick 1D Lite Convolutional [...] Read more.
Mass spectrometry (MS) data present challenges for machine learning (ML) classification due to their high dimensionality, complex feature distributions, batch effects, and intensity discrepancies, often hindering model generalization and efficiency. To address these issues, this study introduces the Efficient Quick 1D Lite Convolutional Neural Network (CNN) Ensemble Classifier (EQLC-EC), integrating 1D convolutional networks with reshape layers and dual voting mechanisms for enhanced feature representation and classification performance. Validation was performed on five publicly available MS datasets, each featured in high-impact publications. EQLC-EC underwent comprehensive evaluation against classical machine learning (ML) models (e.g., support vector machine (SVM), random forest) and the leading deep learning methods reported in these studies. EQLC-EC demonstrated dataset-specific improvements, including enhanced classification accuracy (1–5% increase) and reduced standard deviation (1–10% reduction). Performance differences between soft and hard voting mechanisms were negligible (<1% variation in accuracy and standard deviation). EQLC-EC presents a powerful and efficient tool for MS data analysis with potential applications across metabolomics and proteomics. Full article
Show Figures

Figure 1

Figure 1
<p>One-Dimensional Mass Spectrometry Data: Sources, Characteristics, and Challenges: (<b>a</b>) This figure shows the sources of mass spectrometry data from various fields, including environmental science, metabolomics, single-cell analysis, and medicine. These sources generate a large amount of diverse one-dimensional mass spectrometry data, encompassing a wide range of samples from aerosol particles to blood and urine. (<b>b</b>) One-dimensional mass spectrum and its three characteristics: Extreme range, highly scattered, and high-dimensional.</p>
Full article ">Figure 2
<p>Sample distribution of the development dataset CHD. (<b>a</b>) Distribution of CHD patient samples and control group samples across the four batches. The sectors corresponding to CHD patients are shown in their original colors, while the sectors representing the control group samples in the corresponding batch are shown with 40% transparency. Different batches represent variations in experimental conditions or sample sources. (<b>b</b>) The CHD dataset’s training set consists of the sum of data from two randomly selected batches, and the test set consists of one of the other two batches. There are twelve possible data partitioning methods, and this figure shows two of them. Split 1’s training set is the sum of all data from batch 1 and batch 2, and its test set is the data from batch 3. Split 2’s training set is the sum of data from batch 3 and batch 4, and its test set is the data from batch 2. These two partitioning methods are examples of random sampling. The sector colors in (<b>a</b>,<b>b</b>) are consistent to facilitate understanding of the data partitioning method.</p>
Full article ">Figure 3
<p>Sample distributions of the validation datasets: (<b>a</b>) MI, this dataset shares a similar structure with the CHD dataset in <a href="#electronics-14-00968-f002" class="html-fig">Figure 2</a>, featuring three batches. Data partitioning also follows the same approach: The sum of two batches forms the training set, while the remaining batch constitutes the test set, resulting in three possible partitioning methods. This dataset is used to evaluate the model’s applicability to datasets exhibiting batch effects. (<b>b</b>) ICC, the test set accounts for 20% of the total data. (<b>c</b>) HIP_CER, the test set accounts for 20% of the total data. (<b>d</b>) TOMATO, the test set accounts for 29% of the total data. Since each sample in the TOMATO dataset underwent 10 data acquisitions, a 30% split was used to create the test set, yielding 470 data points, in order to prevent data leakage. This approach ensures that the test set’s sample distribution closely aligns with that of the entire dataset.</p>
Full article ">Figure 4
<p>Network architecture of the EQLC base model. The input is One-Dimensional Mass Spectrometry Input (1DMS INPUT). Key components include convolutional layers Convolutional Layer 1 (env1/conv1) and Convolutional Layer 2 (env2/conv2), followed by max-pooling layers Max Pooling Layer 1 (maxpool-1) and Max Pooling Layer 2 (maxpool-2), and Rectified Linear Unit (ReLU) activation functions. Dropout 0.2 indicates a Dropout layer with a 0.2 dropout rate for regularization. Feature processing concludes with a flatten layer and Fully Connected layers Fully Connected Layer 1 (FC1) and Fully Connected Layer 2 (FC2). The output Classification employs Hard Voting (HardVoting) and Soft Voting (SoftVoting) for ensemble decisions, providing class label and class probability (proba), respectively.</p>
Full article ">Figure 5
<p>Structure of the EQLC-EC ensemble model.</p>
Full article ">Figure 6
<p>Test results corresponding to 12 dataset partitioning methods in the CHD dataset. Taking 12_3 as an example, it represents that the training set is composed of the sum of batch 1 and batch 2, and the test set is batch 3. Box plots show the F1-scores of two voting methods, EQLC and EQLC-EC. Each plot contains 100 F1-scores, derived from 100 test results of the test set reconstructed by the bootstrap method. The narrower the box body of the box plot, the more stable the model; the higher the horizontal line of the median, the better the classification performance of the model. Bar charts show the standard deviation of 100 F1-scores. The shorter the bar, the better the stability of the model.</p>
Full article ">Figure 7
<p>MI dataset results. Test result comparison between EQLC-EC and methods reported in high-impact studies. The three bar graphs represent algorithm performance variability across test sets, with shaded areas included for visual clarity only. Dashed lines denote the mean values reported in the literature, while solid circles and hollow circles represent EQLC-EC’s mean performance.</p>
Full article ">Figure 8
<p>Box plots depict the F1-score performance comparison of EQLC-EC with other methods on the ICC dataset. The blue XGBoost method represents the current state-of-the-art (SOTA) method. Red boxes denote the two voting modes of EQLC-EC. Bar charts illustrate the standard deviation of F1-scores for EQLC-EC and other methods on the ICC dataset.</p>
Full article ">Figure 9
<p>Box plots depict the F1-score performance comparison of EQLC-EC with other methods on the HIP_CER dataset. The blue XGBoost method represents the current state-of-the-art (SOTA) method. Red boxes denote the two voting modes of EQLC-EC. Bar charts illustrate the standard deviation of F1-scores for EQLC-EC and other methods on the HIP_CER dataset.</p>
Full article ">Figure 10
<p>For the TOMATO dataset, the state-of-the-art (SOTA) method in the literature is Random Forest, and only its best classification accuracy is available, which is represented as a dashed line in the figure. Therefore, the comparison of methods on the TOMATO dataset can only be conducted through accuracy. Box plots represent accuracy, and bar charts represent the standard deviation of accuracy.</p>
Full article ">
21 pages, 2600 KiB  
Article
A Particle Swarm Optimization-Based Ensemble Broad Learning System for Intelligent Fault Diagnosis in Safety-Critical Energy Systems with High-Dimensional Small Samples
by Jiasheng Yan, Yang Sui and Tao Dai
Mathematics 2025, 13(5), 797; https://doi.org/10.3390/math13050797 - 27 Feb 2025
Viewed by 197
Abstract
Intelligent fault diagnosis (IFD) plays a crucial role in reducing maintenance costs and enhancing the reliability of safety-critical energy systems (SCESs). In recent years, deep learning-based IFD methods have achieved high fault diagnosis accuracy extracting implicit higher-order correlations between features. However, the excessive [...] Read more.
Intelligent fault diagnosis (IFD) plays a crucial role in reducing maintenance costs and enhancing the reliability of safety-critical energy systems (SCESs). In recent years, deep learning-based IFD methods have achieved high fault diagnosis accuracy extracting implicit higher-order correlations between features. However, the excessive long training time of deep learning models conflicts with the requirements of real-time analysis for IFD, hindering their further application in practical industrial environments. To address the aforementioned challenge, this paper proposes an innovative IFD method for SCES that combines the particle swarm optimization (PSO) algorithm and the ensemble broad learning system (EBLS). Specifically, the broad learning system (BLS), known for its low time complexity and high classification accuracy, is adopted as an alternative to deep learning for fault diagnosis in SCES. Furthermore, EBLS is designed to enhance model stability and classification accuracy with high-dimensional small samples by incorporating the random forest (RF) algorithm and an ensemble strategy into the traditional BLS framework. In order to reduce the computational cost of the EBLS, which is constrained by the selection of its hyperparameters, the PSO algorithm is employed to optimize the hyperparameters of the EBLS. Finally, the model is validated through simulated data from a complex nuclear power plant (NPP). Numerical experiments reveal that the proposed method significantly improved the diagnostic efficiency while maintaining high accuracy. In summary, the proposed approach shows great promise for boosting the capabilities of the IFD models for SCES. Full article
Show Figures

Figure 1

Figure 1
<p>Architecture of BLS.</p>
Full article ">Figure 2
<p>Computational flowchart of BLS.</p>
Full article ">Figure 3
<p>Architecture of the EBLS.</p>
Full article ">Figure 4
<p>Flowchart of the PSO-EBLS.</p>
Full article ">Figure 5
<p>Personal computer transient analyzer.</p>
Full article ">
29 pages, 13392 KiB  
Article
Enhanced Data-Driven Machine Learning Models for Predicting Total Organic Carbon in Marine–Continental Transitional Shale Reservoirs
by Sizhong Peng, Congjun Feng, Zhen Qiu, Qin Zhang, Wen Liu and Wanli Gao
Sustainability 2025, 17(5), 2048; https://doi.org/10.3390/su17052048 - 27 Feb 2025
Viewed by 157
Abstract
Natural gas, as a sustainable and cleaner energy source, still holds a crucial position in the energy transition stage. In shale gas exploration, total organic carbon (TOC) content plays a crucial role, with log data proving beneficial in predicting total organic carbon content [...] Read more.
Natural gas, as a sustainable and cleaner energy source, still holds a crucial position in the energy transition stage. In shale gas exploration, total organic carbon (TOC) content plays a crucial role, with log data proving beneficial in predicting total organic carbon content in shale reservoirs. However, in complex coal-bearing layers like the marine–continental transitional Shanxi Formation, traditional prediction methods exhibit significant errors. Therefore, this study proposes an advanced, cost- and time-saving deep learning approach to predict TOC in marine–continental transitional shale. Five well log records from the study area were used to evaluate five machine learning models: K-Nearest Neighbors (KNNs), Random Forest (RF), Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGB), and Deep Neural Network (DNN). The predictive results were compared with conventional methods for accurate TOC predictions. Through K-fold cross-validation, the ML models showed superior accuracy over traditional models, with the DNN model displaying the lowest root mean square error (RMSE) and mean absolute error (MAE). To enhance prediction accuracy, δR was integrated as a new parameter into the ML models. Comparative analysis revealed that the improved DNN-R model reduced MAE and RMSE by 57.1% and 70.6%, respectively, on the training set, and by 59.5% and 72.5%, respectively, on the test set, compared to the original DNN model. The Williams plot and permutation importance confirmed the reliability and effectiveness of the enhanced DNN-R model. The results indicate the potential of machine learning technology as a valuable tool for predicting crucial parameters, especially in marine–continental transitional shale reservoirs lacking sufficient core samples and relying solely on basic well-logging data, signifying its importance for effective shale gas assessment and development. Full article
(This article belongs to the Topic Recent Advances in Diagenesis and Reservoir 3D Modeling)
Show Figures

Figure 1

Figure 1
<p>Flow chart of data intelligence paradigms to predict TOC in this study.</p>
Full article ">Figure 2
<p>Geological context. (<b>a</b>) Location of the study area in the Ordos Basin, China. (<b>b</b>) Stratigraphic column of the study area. (<b>c</b>) Simplified geological map of the study area with the locations of well logs.</p>
Full article ">Figure 3
<p>Pearson correlation heatmap between logging curves and TOC of the dataset.</p>
Full article ">Figure 4
<p>Well-logging curves evaluated. (<b>A</b>) Well DJ3-4 (depth: 2166–2162 m). (<b>B</b>) Well DJ53 (depth: 1918–1948 m).</p>
Full article ">Figure 5
<p>Models developed for TOC prediction of (<b>a</b>) Random Forest, (<b>b</b>) Gradient Boosting Decision Tree, (<b>c</b>) Extreme Gradient Boosting, and (<b>d</b>) Deep Neural Network.</p>
Full article ">Figure 6
<p>Schematic diagram of K-fold cross validation.</p>
Full article ">Figure 7
<p>TOC prediction errors for four models.</p>
Full article ">Figure 8
<p>Cross-plots showing the predicted and measured TOC from the testing and validation sets by one conventional method and five ML models.</p>
Full article ">Figure 9
<p>TOC prediction errors for improved models.</p>
Full article ">Figure 10
<p>Cross-plots showing the predicted and measured TOC from the testing and validation sets by five improved ML models.</p>
Full article ">Figure 11
<p>Taylor chart for the measured and predicted TOC in improved models.</p>
Full article ">Figure 12
<p>Comparison of MAE, RMSE, MRE, and R<sup>2</sup> values between the original model and the model with the inclusion of δR applied to all models for predicting TOC.</p>
Full article ">Figure 13
<p>Comparison of the measured TOC values with the predicted TOC values using different models for testing well H15.</p>
Full article ">Figure 14
<p>Comparison of the measured TOC values with the predicted TOC values using different models for testing well H17.</p>
Full article ">Figure 15
<p>Applicable verification for DNN-R model based on a Williams plot.</p>
Full article ">Figure 16
<p>Permutation importance of the input features for the predictive performance of the DNN-R model.</p>
Full article ">Figure 17
<p>(<b>a</b>) TOC distribution contour map of Shan<sub>2</sub><sup>3</sup> in the study area. (<b>b</b>) TOC distribution contour map of Shan<sub>2</sub><sup>3</sup>-3 sub-bed in the study area. (<b>c</b>) TOC distribution contour map of Shan<sub>2</sub><sup>3</sup>-2 sub-bed in the study area. (<b>d</b>) TOC distribution contour map of Shan<sub>2</sub><sup>3</sup>-1 sub-bed in the study area.</p>
Full article ">Figure 18
<p>Schematic diagram of important parameters in marine–continental transitional shales gas production.</p>
Full article ">
21 pages, 1277 KiB  
Article
HeartEnsembleNet: An Innovative Hybrid Ensemble Learning Approach for Cardiovascular Risk Prediction
by Syed Ali Jafar Zaidi, Attia Ghafoor, Jun Kim, Zeeshan Abbas and Seung Won Lee
Healthcare 2025, 13(5), 507; https://doi.org/10.3390/healthcare13050507 - 26 Feb 2025
Viewed by 175
Abstract
Background: Cardiovascular disease (CVD) is a prominent determinant of mortality, accounting for 17 million lives lost across the globe each year. This underscores its severity as a critical health issue. Extensive research has been undertaken to refine the forecasting of CVD in patients [...] Read more.
Background: Cardiovascular disease (CVD) is a prominent determinant of mortality, accounting for 17 million lives lost across the globe each year. This underscores its severity as a critical health issue. Extensive research has been undertaken to refine the forecasting of CVD in patients using various supervised, unsupervised, and deep learning approaches. Methods: This study presents HeartEnsembleNet, a novel hybrid ensemble learning model that integrates multiple machine learning (ML) classifiers for CVD risk assessment. The model is evaluated against six classical ML classifiers, including support vector machine (SVM), gradient boosting (GB), decision tree (DT), logistic regression (LR), k-nearest neighbor (KNN), and random forest (RF). Additionally, we compare HeartEnsembleNet with Hybrid Random Forest Linear Models (HRFLM) and ensemble techniques including stacking and voting. Results: Employing a dataset of 70,000 cardiac patients with 12 clinical attributes, our proposed model achieves a notable accuracy of 92.95% and a precision of 93.08%. Conclusions: These results highlight the effectiveness of hybrid ensemble learning in enhancing CVD risk prediction, offering a promising framework for clinical decision support. Full article
(This article belongs to the Special Issue Artificial Intelligence in Healthcare: Opportunities and Challenges)
Show Figures

Figure 1

Figure 1
<p>Holistic structural workflow of the innovative approach, HeartEnsembleNet, for cardiovascular disease detection.</p>
Full article ">Figure 2
<p>Architectural analysis of novel feature selection approach presented for cardiac failure diagnosis.</p>
Full article ">Figure 3
<p>Skewness and Kurtosis evaluation for normality assessment for cardiovascular disease dataset features.</p>
Full article ">Figure 4
<p>Outlier handling for data quality and model performance in cardiovascular disease dataset.</p>
Full article ">Figure 5
<p>Pair plot of continuous features in the cardiovascular disease datase.</p>
Full article ">Figure 6
<p>Correlation heatmap of numerical features in the cardiovascular disease dataset.</p>
Full article ">Figure 7
<p>Classical machine learning approach performance analysis.</p>
Full article ">Figure 8
<p>Performance analysis of stacking and voting classifiers.</p>
Full article ">Figure 9
<p>Holistic analysis of Hybrid Random Forest Linear Model.</p>
Full article ">Figure 10
<p>Ensemble voting classifier performance analysis.</p>
Full article ">Figure 11
<p>Ensemble stacking classifier performance aanalysis.</p>
Full article ">Figure 12
<p>Comparative analysis with SOTA techniques.</p>
Full article ">
16 pages, 86590 KiB  
Article
Automated Detection of Araraucaria angustifolia (Bertol.) Kuntze in Urban Areas Using Google Earth Images and YOLOv7x
by Mauro Alessandro Karasinski, Ramon de Sousa Leite, Emmanoella Costa Guaraná, Evandro Orfanó Figueiredo, Eben North Broadbent, Carlos Alberto Silva, Erica Kerolaine Mendonça dos Santos, Carlos Roberto Sanquetta and Ana Paula Dalla Corte
Remote Sens. 2025, 17(5), 809; https://doi.org/10.3390/rs17050809 - 25 Feb 2025
Viewed by 377
Abstract
This study addresses the urgent need for effective methods to monitor and conserve Araucaria angustifolia, a critically endangered species of immense ecological and cultural significance in southern Brazil. Using high-resolution satellite images from Google Earth, we apply the YOLOv7x deep learning model [...] Read more.
This study addresses the urgent need for effective methods to monitor and conserve Araucaria angustifolia, a critically endangered species of immense ecological and cultural significance in southern Brazil. Using high-resolution satellite images from Google Earth, we apply the YOLOv7x deep learning model to detect this species in two distinct urban contexts in Curitiba, Paraná: isolated trees across the urban landscape and A. angustifolia individuals within forest remnants. Data augmentation techniques, including image rotation, hue and saturation adjustments, and mosaic augmentation, were employed to increase the model’s accuracy and robustness. Through a 5-fold cross-validation, the model achieved a mean Average Precision (AP) of 90.79% and an F1-score of 88.68%. Results show higher detection accuracy in forest remnants, where the homogeneous background of natural landscapes facilitated the identification of trees, compared to urban areas where complex visual elements like building shadows presented challenges. To reduce false positives, especially misclassifications involving palm species, additional annotations were introduced, significantly enhancing performance in urban environments. These findings highlight the potential of integrating remote sensing with deep learning to automate large-scale forest inventories. Furthermore, the study highlights the broader applicability of the YOLOv7x model for urban forestry planning, offering a cost-effective solution for biodiversity monitoring. The integration of predictive data with urban forest maps reveals a spatial correlation between A. angustifolia density and the presence of forest fragments, suggesting that the preservation of these areas is vital for the species’ sustainability. The model’s scalability also opens the door for future applications in ecological monitoring across larger urban areas. As urban environments continue to expand, understanding and conserving key species like A. angustifolia is critical for enhancing biodiversity, resilience, and addressing climate change. Full article
Show Figures

Figure 1

Figure 1
<p>Location of the study area in the city of Curitiba, Paraná, Brazil. The highlighted neighborhoods (Batel, Centro, Jardim Botânico, Jardim das Américas, Rebouças, and Santa Felicidade) were used to train and test the YOLOv7x model. The gray area indicates regions where the available images did not have the same quality as the others and, therefore, were not included in the study.</p>
Full article ">Figure 2
<p>Components of a bounding box. (bx, by) represent the X and Y coordinates of the center of the bounding box; w represents the width and h the height of the bounding box.</p>
Full article ">Figure 3
<p>Learning curve performance of YOLOv7x in the detection of <span class="html-italic">A. angustifolia</span> in the city of Curitiba, Paraná, Brazil.</p>
Full article ">Figure 4
<p>Frequency distribution of individuals classified as forest and isolated individuals.</p>
Full article ">Figure 5
<p>Overview of <span class="html-italic">A. angustifolia</span> distribution by YOLOv7x in Curitiba, Paraná. (<b>a</b>) Forest areas. (<b>b</b>) Kernel Density Map (trees/ha). (<b>c</b>) Predicted trees. (<b>d</b>) Uncertainty distribution for predicted trees.</p>
Full article ">Figure 6
<p>Examples of prediction results: (<b>a</b>) Detection in the context of isolated trees. (<b>b</b>) Detection in forest fragments. (<b>c</b>) Example of a false negative caused by building shadows. (<b>d</b>) Example of a false positive due to confusion with palm trees. (<b>e</b>) Example of a false positive caused by confusion with the shadow projection of an <span class="html-italic">A. angustifolia</span>.</p>
Full article ">
15 pages, 2283 KiB  
Article
Forecasting Covered Call Exchange-Traded Funds (ETFs) Using Time Series, Machine Learning, and Deep Learning Models
by Chigozie Andy Ngwaba
J. Risk Financial Manag. 2025, 18(3), 120; https://doi.org/10.3390/jrfm18030120 - 25 Feb 2025
Viewed by 421
Abstract
This study explores the application of time series, machine learning (ML), and deep learning (DL) models to predict the prices and performance of covered call ETFs. Utilizing historical data from major covered call ETFs like QYLD, XYLD, JEPI, JEPQ, and RYLD, the research [...] Read more.
This study explores the application of time series, machine learning (ML), and deep learning (DL) models to predict the prices and performance of covered call ETFs. Utilizing historical data from major covered call ETFs like QYLD, XYLD, JEPI, JEPQ, and RYLD, the research assesses the predictive accuracy and reliability of different forecasting approaches. It compares traditional time series methods, including ARIMA and Heterogeneous Autoregressive Model (HAR), with advanced ML techniques such as Random Forests (RF) and Support Vector Regression (SVR), as well as DL models like Recurrent Neural Networks (RNN) and Convolutional Neural Networks (CNN). Model performance is evaluated using metrics such as Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC). Results indicate that the DL models are effective at identifying the nonlinear patterns and temporal dependencies in the price movements of covered call ETFs, outperforming both traditional time series and ML techniques. These findings enhance the existing financial forecasting literature and offer valuable insights for investors and portfolio managers aiming to improve their strategies using covered call ETFs. Full article
(This article belongs to the Special Issue Machine Learning, Economic Forecasting, and Financial Markets)
Show Figures

Figure 1

Figure 1
<p>Daily ETF Prices (January 2019–December 2024).</p>
Full article ">Figure 2
<p>Realized volatility for Covered Call ETFs (January 2019–December 2024).</p>
Full article ">Figure 3
<p>Forecast model comparison of actual and predicted values (RNN Model).</p>
Full article ">Figure 4
<p>Forecast model comparison of actual and predicted value (CNN Model).</p>
Full article ">
16 pages, 5222 KiB  
Article
Remaining Useful Life Prediction for Lithium-Ion Batteries Based on Hybrid Ensembles Allied with Data-Driven Approach
by Shuai Zhao, Daming Sun, Yan Liu and Yuqi Liang
Energies 2025, 18(5), 1114; https://doi.org/10.3390/en18051114 - 25 Feb 2025
Viewed by 206
Abstract
Capacity fade in lithium-ion batteries (LIBs) poses challenges for various industries. Predicting and preventing this fade is crucial, and hybrid methods for estimating remaining useful life (RUL) have become prevalent and achieved significant advancements. In this paper, we introduce a hybrid voting ensemble [...] Read more.
Capacity fade in lithium-ion batteries (LIBs) poses challenges for various industries. Predicting and preventing this fade is crucial, and hybrid methods for estimating remaining useful life (RUL) have become prevalent and achieved significant advancements. In this paper, we introduce a hybrid voting ensemble that combines Gradient Boosting, Random Forest, and K-Nearest Neighbors to forecast the fading capacity trend and knee point. We conducted extensive experiments using the CALCE CS2 datasets. The results indicate that our proposed approach outperforms single deep learning methods for RUL prediction and accurately identifies the knee point. Beyond prediction, this innovative method can potentially be integrated into real-world applications for broader use. Full article
(This article belongs to the Section D: Energy Storage and Application)
Show Figures

Figure 1

Figure 1
<p>Framework of proposed approach.</p>
Full article ">Figure 2
<p>The methods for battery RUL prediction and inflection point estimation.</p>
Full article ">Figure 3
<p>Data processing flow for battery RUL prediction and knee point estimation.</p>
Full article ">Figure 4
<p>Correlation of features with capacity.</p>
Full article ">Figure 5
<p>Capacity and voltage change for CS2_35.</p>
Full article ">Figure 6
<p>Voltage and internal resistance change for CS2_35.</p>
Full article ">Figure 7
<p>Estimation process workflow.</p>
Full article ">Figure 8
<p>Voltage changes for battery CS2_36.</p>
Full article ">Figure 9
<p>Prediction results for battery CS2_36.</p>
Full article ">Figure 10
<p>Voltage changes for battery CS2_37.</p>
Full article ">Figure 11
<p>Prediction results for battery CS2_37.</p>
Full article ">Figure 12
<p>Voltage changes for battery CS2_38.</p>
Full article ">Figure 13
<p>Prediction results for battery CS2_38.</p>
Full article ">
14 pages, 3572 KiB  
Article
Explainable Siamese Neural Networks for Detection of High Fall Risk Older Adults in the Community Based on Gait Analysis
by Christos Kokkotis, Kyriakos Apostolidis, Dimitrios Menychtas, Ioannis Kansizoglou, Evangeli Karampina, Maria Karageorgopoulou, Athanasios Gkrekidis, Serafeim Moustakidis, Evangelos Karakasis, Erasmia Giannakou, Maria Michalopoulou, Georgios Ch Sirakoulis and Nikolaos Aggelousis
J. Funct. Morphol. Kinesiol. 2025, 10(1), 73; https://doi.org/10.3390/jfmk10010073 - 22 Feb 2025
Viewed by 212
Abstract
Background/Objectives: Falls among the older adult population represent a significant public health concern, often leading to diminished quality of life and serious injuries that escalate healthcare costs, and they may even prove fatal. Accurate fall risk prediction is therefore crucial for implementing timely [...] Read more.
Background/Objectives: Falls among the older adult population represent a significant public health concern, often leading to diminished quality of life and serious injuries that escalate healthcare costs, and they may even prove fatal. Accurate fall risk prediction is therefore crucial for implementing timely preventive measures. However, to date, there is no definitive metric to identify individuals with high risk of experiencing a fall. To address this, the present study proposes a novel approach that transforms biomechanical time-series data, derived from gait analysis, into visual representations to facilitate the application of deep learning (DL) methods for fall risk assessment. Methods: By leveraging convolutional neural networks (CNNs) and Siamese neural networks (SNNs), the proposed framework effectively addresses the challenges of limited datasets and delivers robust predictive capabilities. Results: Through the extraction of distinctive gait-related features and the generation of class-discriminative activation maps using Grad-CAM, the random forest (RF) machine learning (ML) model not only achieves commendable accuracy (83.29%) but also enhances explainability. Conclusions: Ultimately, this study underscores the potential of advanced computational tools and machine learning algorithms to improve fall risk prediction, reduce healthcare burdens, and promote greater independence and well-being among the older adults. Full article
Show Figures

Figure 1

Figure 1
<p>Visual representation of the single 2D matrix per subject [<a href="#B12-jfmk-10-00073" class="html-bibr">12</a>].</p>
Full article ">Figure 2
<p>Proposed workflow pipeline for the prediction of high fall risk older adults in the community. (<b>a</b>) The concept of the Siamese neural network. (<b>b</b>) The proposed pipeline for classifying the images.</p>
Full article ">Figure 3
<p>Dissimilarity distance between two non-faller images compared to the dissimilarity distance between a non-faller and a faller image.</p>
Full article ">Figure 4
<p>Loss function through iterations during training.</p>
Full article ">Figure 5
<p>Grad-CAM in a non-faller. Boxes highlight the critical points according to the Grad-CAM algorithm.</p>
Full article ">Figure 6
<p>Grad-CAM in a faller. Boxes highlight the critical points according to the Grad-CAM algorithm.</p>
Full article ">Figure 7
<p>Mean Grad-CAM masks for the testing group of the employed fallers and non-fallers. Specifically, (<b>a</b>) presents non-faller participants and (<b>b</b>) presents fallers.</p>
Full article ">
25 pages, 7898 KiB  
Article
Document Relevance Filtering by Natural Language Processing and Machine Learning: A Multidisciplinary Case Study of Patents
by Raj Bridgelall
Appl. Sci. 2025, 15(5), 2357; https://doi.org/10.3390/app15052357 - 22 Feb 2025
Viewed by 439
Abstract
The exponential growth of patent datasets poses a significant challenge in filtering relevant documents for research and innovation. Traditional semantic search methods based on keywords often fail to capture the complexity and variability in multidisciplinary terminology, leading to inefficiencies. This study addresses the [...] Read more.
The exponential growth of patent datasets poses a significant challenge in filtering relevant documents for research and innovation. Traditional semantic search methods based on keywords often fail to capture the complexity and variability in multidisciplinary terminology, leading to inefficiencies. This study addresses the problem by systematically evaluating supervised and unsupervised machine learning (ML) techniques for document relevance filtering across five technology domains: solid-state batteries, electric vehicle chargers, connected vehicles, electric vertical takeoff and landing aircraft, and light detecting and ranging (LiDAR) sensors. The contributions include benchmarking the performance of 10 classical models. These models include extreme gradient boosting, random forest, and support vector machines; a deep artificial neural network; and three natural language processing methods: latent Dirichlet allocation, non-negative matrix factorization, and k-means clustering of a manifold-learned reduced feature dimension. Applying these methods to more than 4200 patents filtered from a database of 9.6 million patents revealed that most supervised ML models outperform the unsupervised methods. An average of seven supervised ML models achieved significantly higher precision, recall, and F1-scores across all technology domains, while unsupervised methods show variability depending on domain characteristics. These results offer a practical framework for optimizing document relevance filtering, enabling researchers and practitioners to efficiently manage large datasets and enhance innovation. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

Figure 1
<p>The methodological workflow.</p>
Full article ">Figure 2
<p>SSB top bigrams in (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 3
<p>SSB term co-occurrence for (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 4
<p>EVC top bigrams in (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 5
<p>EVC term co-occurrence for (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 6
<p>CV top bigrams in (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 7
<p>CV term co-occurrence for (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 8
<p>eVTOL top bigrams in (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 9
<p>eVTOL term co-occurrence for (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 10
<p>LiDAR top bigrams in (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 11
<p>LiDAR term co-occurrence for (<b>a</b>) irrelevant and (<b>b</b>) relevant documents.</p>
Full article ">Figure 12
<p>Visualization of the confusion index from the unsupervised ML for each technology domain.</p>
Full article ">
Back to TopTop