[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (5,446)

Search Parameters:
Keywords = decision trees

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
46 pages, 17108 KiB  
Article
Predicting the Effect of RSW Parameters on the Shear Force and Nugget Diameter of Similar and Dissimilar Joints Using Machine Learning Algorithms and Multilayer Perceptron
by Marwan T. Mezher, Alejandro Pereira and Tomasz Trzepieciński
Materials 2024, 17(24), 6250; https://doi.org/10.3390/ma17246250 (registering DOI) - 20 Dec 2024
Abstract
Resistance spot-welded joints are crucial parts in contemporary manufacturing technology due to their ubiquitous use in the automobile industry. The necessity of improving manufacturing efficiency and quality at an affordable cost requires deep knowledge of the resistance spot welding (RSW) process and the [...] Read more.
Resistance spot-welded joints are crucial parts in contemporary manufacturing technology due to their ubiquitous use in the automobile industry. The necessity of improving manufacturing efficiency and quality at an affordable cost requires deep knowledge of the resistance spot welding (RSW) process and the development of artificial neural network (ANN)- and machine learning (ML)-based modelling techniques, apt for providing essential tools for design, planning, and incorporation in the welding process. Tensile shear force and nugget diameter are the most crucial outputs for evaluating the quality of a resistance spot-welded specimen. This study uses ML and ANN models to predict shear force and nugget diameter responses to RSW parameters. The RSW analysis was executed on similar and dissimilar AISI 304 and grade 2 titanium alloy joints with equal and unequal thicknesses. The input parameters included welding current, pressure, welding duration, squeezing time, holding time, pulse welding, and sheet thickness. Linear regression, Decision tree, Support vector machine (SVM), Random forest (RF), Gradient-boosting, CatBoost, K-Nearest Neighbour (KNN), Ridge, Lasso, and ElasticNet machine learning algorithms, along with two different structures of Multilayer Perceptron, were utilized for studying the impact of the RSW parameters on the shear force and nugget diameter. Different validation metrics were applied to assess each model’s quality. Two equations were developed to determine the shear force and nugget diameter based on the investigation parameters. The current research also presents a prediction of the Relative Importance (RI) of RSW factors. Shear force and nugget diameter predictions were examined using SHapley (SHAP) Additive Explanations for the first time in the RSW field. Trainbr as the training function and Logsig as the transfer function delivered the best ANN model for predicting shear force in a one-output structure. Trainrp with Tansig made the most accurate predictions for nugget diameter in a one-output structure and for shear force and diameter in a two-output structure. Depending on validation metrics, the Random forest model outperformed the other ML algorithms in predicting shear force or nugget diameter in a one-output model, while the Decision tree model gave the best prediction using a two-output structure. Linear regression made the worst ML predictions for shear force, while ElasticNet made the worst nugget diameter forecasts in a one-output model. However, in two-output models, Lasso made the worst predictions. Full article
(This article belongs to the Section Metals and Alloys)
Show Figures

Figure 1

Figure 1
<p>Schematic illustration of the tensile RSW sample.</p>
Full article ">Figure 2
<p>Schematic illustration of RSW process.</p>
Full article ">Figure 3
<p>Resistance spot-welded specimens of the ten cases (<b>A</b>–<b>J</b>).</p>
Full article ">Figure 3 Cont.
<p>Resistance spot-welded specimens of the ten cases (<b>A</b>–<b>J</b>).</p>
Full article ">Figure 4
<p>Fractured RSW samples of the ten cases (<b>A</b>–<b>J</b>).</p>
Full article ">Figure 4 Cont.
<p>Fractured RSW samples of the ten cases (<b>A</b>–<b>J</b>).</p>
Full article ">Figure 5
<p>Schematic of the linear regression curve.</p>
Full article ">Figure 6
<p>Schematic of the Decision tree model.</p>
Full article ">Figure 7
<p>Diagrams of the SVR ML model: (<b>a</b>) linear SVR, (<b>b</b>) non-linear SVR.</p>
Full article ">Figure 8
<p>Diagram of the KNN ML model.</p>
Full article ">Figure 9
<p>Diagram of the Gradient-boosting ML algorithm.</p>
Full article ">Figure 10
<p>Diagram of the Random forest ML algorithm.</p>
Full article ">Figure 11
<p>Neural network structure of the RSW process, (<b>a</b>) one output, (<b>b</b>) two outputs.</p>
Full article ">Figure 12
<p>ANN model with one-output structure.</p>
Full article ">Figure 13
<p>MSE of shear force using various training and transfer functions.</p>
Full article ">Figure 14
<p>MSE of shear force using different ML models.</p>
Full article ">Figure 15
<p>Actual and predicted shear force using different ML models.</p>
Full article ">Figure 15 Cont.
<p>Actual and predicted shear force using different ML models.</p>
Full article ">Figure 16
<p>MSE of nugget diameter using various training and transfer functions.</p>
Full article ">Figure 17
<p>MSE of nugget diameter using different ML models.</p>
Full article ">Figure 18
<p>Actual and predicted nugget diameter using different ML models.</p>
Full article ">Figure 18 Cont.
<p>Actual and predicted nugget diameter using different ML models.</p>
Full article ">Figure 19
<p>ANN model with two-output structure.</p>
Full article ">Figure 20
<p>MSE of shear force and nugget diameter using various training and transfer functions.</p>
Full article ">Figure 21
<p>Regression curve of actual and predicted data of the best two-output ANN model using Trainrp with Tansig.</p>
Full article ">Figure 22
<p>MSE of shear force and nugget diameter in the two-output structure using different ML models.</p>
Full article ">Figure 23
<p>Relative importance of the RSW parameters on the shear force based on (<b>a</b>) Gradient-boosting, (<b>b</b>) CatBoost, (<b>c</b>) Random forest, (<b>d</b>) Decision tree models.</p>
Full article ">Figure 24
<p>Relative importance of the RSW parameters on the nugget diameter based on (<b>a</b>) Gradient boosting, (<b>b</b>) CatBoost, (<b>c</b>) Random forest, (<b>d</b>) Decision tree models.</p>
Full article ">Figure 25
<p>Summary plot of SHAP value impact on shear force for different algorithms: (<b>a</b>) Gradient boosting, (<b>b</b>) CatBoost, (<b>c</b>) Random forest, (<b>d</b>) Decision tree, (<b>e</b>) SVM, (<b>f</b>) KNN, (<b>g</b>) Linear regression, (<b>h</b>) Lasso, (<b>i</b>) ElasticNet, (<b>j</b>) Ridge.</p>
Full article ">Figure 25 Cont.
<p>Summary plot of SHAP value impact on shear force for different algorithms: (<b>a</b>) Gradient boosting, (<b>b</b>) CatBoost, (<b>c</b>) Random forest, (<b>d</b>) Decision tree, (<b>e</b>) SVM, (<b>f</b>) KNN, (<b>g</b>) Linear regression, (<b>h</b>) Lasso, (<b>i</b>) ElasticNet, (<b>j</b>) Ridge.</p>
Full article ">Figure 26
<p>Summary plot of SHAP value impact on nugget diameter for different algorithms: (<b>a</b>) Gradient boosting, (<b>b</b>) CatBoost, (<b>c</b>) Random forest, (<b>d</b>) Decision tree, (<b>e</b>) SVM, (<b>f</b>) KNN, (<b>g</b>) Linear regression, (<b>h</b>) Lasso, (<b>i</b>) ElasticNet, (<b>j</b>) Ridge.</p>
Full article ">Figure 26 Cont.
<p>Summary plot of SHAP value impact on nugget diameter for different algorithms: (<b>a</b>) Gradient boosting, (<b>b</b>) CatBoost, (<b>c</b>) Random forest, (<b>d</b>) Decision tree, (<b>e</b>) SVM, (<b>f</b>) KNN, (<b>g</b>) Linear regression, (<b>h</b>) Lasso, (<b>i</b>) ElasticNet, (<b>j</b>) Ridge.</p>
Full article ">Figure 27
<p>SHAP decision plot of shear force for different algorithms: (<b>a</b>) Random forest, (<b>b</b>) Decision tree, (<b>c</b>) Linear regression, (<b>d</b>) Ridge.</p>
Full article ">Figure 28
<p>SHAP decision plot of nugget diameter for different algorithms: (<b>a</b>) Random forest, (<b>b</b>) Decision tree, (<b>c</b>) Linear regression, (<b>d</b>) Ridge.</p>
Full article ">Figure 29
<p>SHAP dependence plot of shear force for different algorithms: (<b>a</b>) Decision tree, (<b>b</b>) Ridge.</p>
Full article ">Figure 30
<p>SHAP dependence plot of nugget diameter for different algorithms: (<b>a</b>) Decision tree, (<b>b</b>) Ridge.</p>
Full article ">
24 pages, 46652 KiB  
Article
Hyperspectral Reconstruction Method Based on Global Gradient Information and Local Low-Rank Priors
by Chipeng Cao, Jie Li, Pan Wang, Weiqiang Jin, Runrun Zou and Chun Qi
Remote Sens. 2024, 16(24), 4759; https://doi.org/10.3390/rs16244759 - 20 Dec 2024
Abstract
Hyperspectral compressed imaging is a novel imaging detection technology based on compressed sensing theory that can quickly acquire spectral information of terrestrial objects in a single exposure. It combines reconstruction algorithms to recover hyperspectral data from low-dimensional measurement images. However, hyperspectral images from [...] Read more.
Hyperspectral compressed imaging is a novel imaging detection technology based on compressed sensing theory that can quickly acquire spectral information of terrestrial objects in a single exposure. It combines reconstruction algorithms to recover hyperspectral data from low-dimensional measurement images. However, hyperspectral images from different scenes often exhibit high-frequency data sparsity and existing deep reconstruction algorithms struggle to establish accurate mapping models, leading to issues with detail loss in the reconstruction results. To address this issue, we propose a hyperspectral reconstruction method based on global gradient information and local low-rank priors. First, to improve the prior model’s efficiency in utilizing information of different frequencies, we design a gradient sampling strategy and training framework based on decision trees, leveraging changes in the loss function gradient information to enhance the model’s predictive capability for data of varying frequencies. Second, utilizing the local low-rank prior characteristics of the representative coefficient matrix, we develop a sparse sensing denoising module to effectively improve the local smoothness of point predictions. Finally, by establishing a regularization term for the reconstruction process based on the semantic similarity between the denoised results and prior spectral data, we ensure spatial consistency and spectral fidelity in the reconstruction results. Experimental results indicate that the proposed method achieves better detail recovery across different scenes, demonstrates improved generalization performance for reconstructing information of various frequencies, and yields higher reconstruction quality. Full article
Show Figures

Figure 1

Figure 1
<p>Structural composition of the DCCHI system and data structure of SD-CASSI detector sampling.</p>
Full article ">Figure 2
<p>Reconstruction algorithm framework.</p>
Full article ">Figure 3
<p>RGB images from the KAIST, Harvard, and hyperspectral remote sensing datasets.</p>
Full article ">Figure 4
<p>Selected spectral curves of the pixel point with coordinates (180, 70), showing a visual comparison of different methods in the spectral dimension and comparing the pseudocolor images and local spatial detail information under different wavelengths.</p>
Full article ">Figure 5
<p>Selected spectral curves of the pixel point with coordinates (150, 100), showing a visual comparison of different methods in the spectral dimension and comparing the pseudocolor images and local spatial detail information of different wavelengths.</p>
Full article ">Figure 6
<p>Comparison of the spectral consistency of reconstruction results with different methods on the PaviaU hyperspectral remote sensing datast at sample point coordinates (180, 110), along with a comparison of the spatial detail information of the reconstruction results at different wavelengths.</p>
Full article ">Figure 7
<p>Comparison of the spectral consistency of reconstruction results with different methods on the PaviaC hyperspectral remote sensing dataset at sample point coordinates (190, 50), along with a comparison of the spatial detail information of the reconstruction results at different wavelengths.</p>
Full article ">Figure 8
<p>Comparison of spectral reconstruction results for different crops.</p>
Full article ">Figure 9
<p>Impact of hyperparameter settings on reconstruction quality.</p>
Full article ">Figure 10
<p>Comparison of pseudocolor images generated from the predictions of different prior models for the Harvard Scene 04 hyperspectral data at the 5th, 12th, and 25th bands, along with the spectral differences of the predictions at different wavelengths.</p>
Full article ">Figure 11
<p>Comparison of pseudocolor images generated from the predictions of different prior models for the PaviaU hyperspectral remote sensing data at the 3rd, 13th, and 26th bands, along with the spectral differences of the predictions at different wavelengths.</p>
Full article ">Figure 12
<p>Variation in reconstruction quality with increasing iteration count under the same solving framework for different regularization constraint methods.</p>
Full article ">
18 pages, 3499 KiB  
Article
The Artificial Tree: Integrating Microalgae into Sustainable Architecture for CO2 Capture and Urban Efficiency—A Comprehensive Analysis
by Rosa Cervera, María Rosa Villalba and Javier Sánchez
Buildings 2024, 14(12), 4045; https://doi.org/10.3390/buildings14124045 - 20 Dec 2024
Abstract
The Artificial Tree project, developed by the authors, presents an innovative approach to urban sustainability by integrating microalgae cultivation systems for CO2 capture, biomass production, and urban cooling. This study evaluates the project’s feasibility and effectiveness in transforming urban furniture into functional [...] Read more.
The Artificial Tree project, developed by the authors, presents an innovative approach to urban sustainability by integrating microalgae cultivation systems for CO2 capture, biomass production, and urban cooling. This study evaluates the project’s feasibility and effectiveness in transforming urban furniture into functional photobioreactors that enhance environmental quality. Inspired by natural aesthetics, the Artificial Tree functions as both a CO2 sink and a biofertilizer producer. Using Scenedesmus microalgae, the system captures 50 kg of CO2 annually per unit and generates 28 kg of biomass, which further reduces emissions when utilized as a biofertilizer. To assess its impact, a multi-criteria decision analysis (MCDA) method was employed, considering factors such as CO2 capture, biomass production, social engagement, visual appeal, and scalability. This methodology incorporated a three-level qualitative scale and criteria tailored to compare similar projects with at least three months of operation and available data on microalgae productivity. Results highlight that the Artificial Tree achieves up to 2.5 times more CO2 fixation than a mature tree while combining environmental benefits with public engagement. Its modular and aesthetic design supports educational outreach and inspires larger-scale implementation. This project demonstrates significant potential to redefine urban spaces sustainably by seamlessly integrating functionality, artistic expression, and public interaction. Full article
(This article belongs to the Special Issue Climate-Responsive Architectural and Urban Design)
Show Figures

Figure 1

Figure 1
<p>(<b>a</b>) Photo of the upper part of the “Artificial Tree”; (<b>b</b>) top plans of the system.</p>
Full article ">Figure 2
<p>(<b>a</b>) Photo showing the “Artificial Tree” with a human scale; (<b>b</b>) render created prior to construction.</p>
Full article ">Figure 3
<p>(<b>a</b>) Elements of the “Artificial Tree”; (<b>b</b>) schematic diagram of the PBR principle [<a href="#B4-buildings-14-04045" class="html-bibr">4</a>].</p>
Full article ">Figure 4
<p>The Liquid3 project: (<b>a</b>) description of the functionality of the system; (<b>b</b>) photo of the first installation on Makedonska Street in Belgrade. Courtesy of Dr. Ivan Spasojevic [<a href="#B28-buildings-14-04045" class="html-bibr">28</a>].</p>
Full article ">Figure 5
<p>Radar chart representing the points obtained from the three-level qualitative scale given in <a href="#buildings-14-04045-t001" class="html-table">Table 1</a>.</p>
Full article ">Figure 6
<p>Bar diagram representing the cumulative value by category and the total value of points earned by each project.</p>
Full article ">
26 pages, 6567 KiB  
Article
Forecasting Electricity Production in a Small Hydropower Plant (SHP) Using Artificial Intelligence (AI)
by Dawid Maciejewski, Krzysztof Mudryk and Maciej Sporysz
Energies 2024, 17(24), 6401; https://doi.org/10.3390/en17246401 - 19 Dec 2024
Abstract
This article devises the Artificial Intelligence (AI) methods of designing models of short-term forecasting (in 12 h and 24 h horizons) of electricity production in a selected Small Hydropower Plant (SHP). Renewable Energy Sources (RESs) are difficult to predict due to weather variability. [...] Read more.
This article devises the Artificial Intelligence (AI) methods of designing models of short-term forecasting (in 12 h and 24 h horizons) of electricity production in a selected Small Hydropower Plant (SHP). Renewable Energy Sources (RESs) are difficult to predict due to weather variability. Electricity production by a run-of-river SHP is marked by the variability related to the access to instantaneous flow in the river and weather conditions. In order to develop predictive models of an SHP facility (installed capacity 760 kW), which is located in Southern Poland on the Skawa River, hourly data from nearby meteorological stations and a water gauge station were collected as explanatory variables. Data on the water management of the retention reservoir above the SHP were also included. The variable to be explained was the hourly electricity production, which was obtained from the tested SHP over a period of 3 years and 10 months. Obtaining these data to build models required contact with state institutions and private entrepreneurs of the SHP. Four AI methods were chosen to create predictive models: two types of Artificial Neural Networks (ANNs), Multilayer Perceptron (MLP) and Radial Base Functions (RBFs), and two types of decision trees methods, Random Forest (RF) and Gradient-Boosted Decision Trees (GBDTs). Finally, after applying forecast quality measures of Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R2), the most effective model was indicated. The decision trees method proved to be more accurate than ANN models. The best GBDT models’ errors were MAPE 3.17% and MAE 9.97 kWh (for 12 h horizon), and MAPE 3.41% and MAE 10.96 kWh (for 24 h horizon). MLPs had worse results: MAPE from 5.41% to 5.55% and MAE from 18.02 kWh to 18.40 kWh (for 12 h horizon), and MAPE from 7.30% to 7.50% and MAE from 24.12 kWh to 24.83 kWh (for 24 h horizon). Forecasts using RBF were not made due to the very low quality of training and testing (the correlation coefficient was approximately 0.3). Full article
Show Figures

Figure 1

Figure 1
<p>Location of SHP infrastructure elements: 1—SHP building, 2—transformer, 3—weir, 4—water inlet to the fish ladder, 5—water inlet to the canal with turbines, 6—water outlet from the fish ladder, 7—water outlet from the canal with turbines.</p>
Full article ">Figure 2
<p>A map with locations: 1—SHP, 2—Inwałd Meteorological Station, 3—Kalwaria Zebrzydowska Meteorological Station, 4—Wadowice Meteorological Station, 5—Wadowice Water Gauge Station, 6—Świnna Poręba Reservoir.</p>
Full article ">Figure 3
<p>Świnna Poręba Reservoir and Mucharskie Lake [<a href="#B29-energies-17-06401" class="html-bibr">29</a>].</p>
Full article ">Figure 4
<p>Sample scheme of MLP [<a href="#B40-energies-17-06401" class="html-bibr">40</a>].</p>
Full article ">Figure 5
<p>Architecture of RBF network [<a href="#B43-energies-17-06401" class="html-bibr">43</a>].</p>
Full article ">Figure 6
<p>RF prediction scheme [<a href="#B48-energies-17-06401" class="html-bibr">48</a>].</p>
Full article ">Figure 7
<p>Sample scheme of a GBDT [<a href="#B51-energies-17-06401" class="html-bibr">51</a>].</p>
Full article ">Figure 8
<p>Fragment of the file with parameters for forecasting.</p>
Full article ">Figure 9
<p>Comparison of real SHP’s electricity production in the SHP and 12 h prediction made by MLPs.</p>
Full article ">Figure 10
<p>Comparison of real SHP’s electricity production in the SHP and 24 h prediction made by MLPs.</p>
Full article ">Figure 11
<p>Comparison of real SHP’s electricity production in the SHP and a 12 h horizon prediction made by RFs.</p>
Full article ">Figure 12
<p>Comparison of real SHP’s electricity production in the SHP and a 24 h horizon prediction made by RFs.</p>
Full article ">Figure 13
<p>Comparison of real SHP’s electricity production in the SHP and a 12 h horizon prediction made by GBDTs.</p>
Full article ">Figure 14
<p>Comparison of real SHP’s electricity production in the SHP and a 24 h horizon prediction made by GBDTs.</p>
Full article ">
24 pages, 2119 KiB  
Article
Reframing Forest Harvest Scheduling Models for Ecosystem Services Management
by Silvana Ribeiro Nobre, Marc Eric McDill, Luiz Carlos Estraviz Rodriguez and Luis Diaz-Balteiro
Forests 2024, 15(12), 2236; https://doi.org/10.3390/f15122236 - 19 Dec 2024
Abstract
Linear programming models have been used in forest management planning since the 1960s. These models have been formulated in three basic ways: Models I, II, and III, which are defined by the sequences of management unit states represented by the variables. In Model [...] Read more.
Linear programming models have been used in forest management planning since the 1960s. These models have been formulated in three basic ways: Models I, II, and III, which are defined by the sequences of management unit states represented by the variables. In Model I, variables represent sequences of states from the beginning of the planning horizon to the end. In Model II, variables represent sequences of states from one intervention to the next. Finally, in Model III, variables represent a single arc in a management unit’s decision tree, i.e., two states. The objectives of this paper are to clarify the definitions of these model variations and evaluate the advantages and disadvantages of each model. This second objective is to test the hypothesis that the relative performance of these models varies with the increasing number of ecosystem services (ES) incorporated into the models. This objective was achieved by formulating a case study problem using each model type. The case study includes three increasingly complex scenarios, each incorporating additional ecosystem services. Results show that despite having more variables and constraints, Model III requires the least time to formulate due to its less dense parameter matrix. Model II has the shortest solution times, followed closely by Model III, while Model I requires the longest times for both formulation and solution. These results are increasingly apparent in more complex scenarios. Full article
(This article belongs to the Special Issue Multiple-Use and Ecosystem Services of Forests—2nd Edition)
Show Figures

Figure 1

Figure 1
<p>Decision tree graph representation of management alternatives for a single example forest management unit. The brown node is the initial node, intervention nodes are blue, and non-intervention nodes are gray.</p>
Full article ">Figure 2
<p>Model I representation of the management alternatives for the example management unit shown in <a href="#forests-15-02236-f001" class="html-fig">Figure 1</a>.</p>
Full article ">Figure 3
<p>Model II representation of the management alternatives for the example management unit shown in <a href="#forests-15-02236-f001" class="html-fig">Figure 1</a>.</p>
Full article ">Figure 4
<p>Model-building time plus solution time for Models I, II, and III in three scenarios.</p>
Full article ">
26 pages, 1777 KiB  
Systematic Review
Machine Learning Models in Sepsis Outcome Prediction for ICU Patients: Integrating Routine Laboratory Tests—A Systematic Review
by Florentina Mușat, Dan Nicolae Păduraru, Alexandra Bolocan, Cosmin Alexandru Palcău, Andreea-Maria Copăceanu, Daniel Ion, Viorel Jinga and Octavian Andronic
Biomedicines 2024, 12(12), 2892; https://doi.org/10.3390/biomedicines12122892 - 19 Dec 2024
Viewed by 151
Abstract
Background. Sepsis presents significant diagnostic and prognostic challenges, and traditional scoring systems, such as SOFA and APACHE, show limitations in predictive accuracy. Machine learning (ML)-based predictive survival models can support risk assessment and treatment decision-making in the intensive care unit (ICU) by accounting [...] Read more.
Background. Sepsis presents significant diagnostic and prognostic challenges, and traditional scoring systems, such as SOFA and APACHE, show limitations in predictive accuracy. Machine learning (ML)-based predictive survival models can support risk assessment and treatment decision-making in the intensive care unit (ICU) by accounting for the numerous and complex factors that influence the outcome in the septic patient. Methods. A systematic literature review of studies published from 2014 to 2024 was conducted using the PubMed database. Eligible studies investigated the development of ML models incorporating commonly available laboratory and clinical data for predicting survival outcomes in adult ICU patients with sepsis. Study selection followed the PRISMA guidelines and relied on predefined inclusion criteria. All records were independently assessed by two reviewers, with conflicts resolved by a third senior reviewer. Data related to study design, methodology, results, and interpretation of the results were extracted in a predefined grid. Results. Overall, 19 studies were identified, encompassing primarily logistic regression, random forests, and neural networks. Most used datasets were US-based (MIMIC-III, MIMIC-IV, and eICU-CRD). The most common variables used in model development were age, albumin levels, lactate levels, and ventilator. ML models demonstrated superior performance metrics compared to conventional methods and traditional scoring systems. The best-performing model was a gradient boosting decision tree, with an area under curve of 0.992, an accuracy of 0.954, and a sensitivity of 0.917. However, several critical limitations should be carefully considered when interpreting the results, such as population selection bias (i.e., single center studies), small sample sizes, limited external validation, and model interpretability. Conclusions. Through real-time integration of routine laboratory and clinical data, ML-based tools can assist clinical decision-making and enhance the consistency and quality of sepsis management across various healthcare contexts, including ICUs with limited resources. Full article
(This article belongs to the Special Issue Artificial Intelligence Applications in Cancer and Other Diseases)
Show Figures

Figure 1

Figure 1
<p>Literature review flow—PRISMA diagram.</p>
Full article ">Figure 2
<p>Prevalence of the most important variables for sepsis mortality prediction based on the extracted data. BUN—blood urea nitrogen. SpO<sub>2</sub>—blood oxygen saturation.</p>
Full article ">Figure 3
<p>Model classification by count of studies. Others: DCQMFF—double coefficient quadratic multivariate fitting function, KNN—k-nearest neighbor, RFS—random survival forest, RVM—relevance vector machine, naïve Bayes.</p>
Full article ">Figure 4
<p>Accuracy and AUC at validation for the ML models with the best performance metrics from each study. * For the eICU-CRD dataset. ** Model developed with a subset of selected variables. *** Model developed with all available variables. <sup>†</sup> Model developed using the physiological and prognostic variables. <sup>‡</sup> Model developed using the clinical care variables. <sup>§</sup> Performance determined in comparison with predictions from physicians, abbMEDS, mREMS, and SOFA. CNN—convolutional neural network, DCQMFF—double coefficient quadratic multivariate fitting function, GBDT—gradient boosting decision tree, GBM—gradient boosting machine, LSTM—long short-term memory networks, MLP-NN—multilayer perceptron neural network, RF—random forest, SVM—support vector machine [<a href="#B6-biomedicines-12-02892" class="html-bibr">6</a>,<a href="#B8-biomedicines-12-02892" class="html-bibr">8</a>,<a href="#B9-biomedicines-12-02892" class="html-bibr">9</a>,<a href="#B11-biomedicines-12-02892" class="html-bibr">11</a>,<a href="#B12-biomedicines-12-02892" class="html-bibr">12</a>,<a href="#B22-biomedicines-12-02892" class="html-bibr">22</a>,<a href="#B32-biomedicines-12-02892" class="html-bibr">32</a>,<a href="#B34-biomedicines-12-02892" class="html-bibr">34</a>,<a href="#B35-biomedicines-12-02892" class="html-bibr">35</a>,<a href="#B36-biomedicines-12-02892" class="html-bibr">36</a>,<a href="#B37-biomedicines-12-02892" class="html-bibr">37</a>,<a href="#B38-biomedicines-12-02892" class="html-bibr">38</a>,<a href="#B39-biomedicines-12-02892" class="html-bibr">39</a>,<a href="#B40-biomedicines-12-02892" class="html-bibr">40</a>,<a href="#B41-biomedicines-12-02892" class="html-bibr">41</a>,<a href="#B42-biomedicines-12-02892" class="html-bibr">42</a>].</p>
Full article ">
19 pages, 4979 KiB  
Article
Current and Potential Land Use/Land Cover (LULC) Scenarios in Dry Lands Using a CA-Markov Simulation Model and the Classification and Regression Tree (CART) Method: A Cloud-Based Google Earth Engine (GEE) Approach
by Elsayed A. Abdelsamie, Abdel-rahman A. Mustafa, Abdelbaset S. El-Sorogy, Hanafey F. Maswada, Sattam A. Almadani, Mohamed S. Shokr, Ahmed I. El-Desoky and Jose Emilio Meroño de Larriva
Sustainability 2024, 16(24), 11130; https://doi.org/10.3390/su162411130 - 19 Dec 2024
Viewed by 205
Abstract
Rapid population growth accelerates changes in land use and land cover (LULC), straining natural resource availability. Monitoring LULC changes is essential for managing resources and assessing climate change impacts. This study focused on extracting LULC data from 1993 to 2024 using the classification [...] Read more.
Rapid population growth accelerates changes in land use and land cover (LULC), straining natural resource availability. Monitoring LULC changes is essential for managing resources and assessing climate change impacts. This study focused on extracting LULC data from 1993 to 2024 using the classification and regression tree (CART) method on the Google Earth Engine (GEE) platform in Qena Governorate, Egypt. Moreover, the cellular automata (CA) Markov model was used to anticipate the future changes in LULC for the research area in 2040 and 2050. Three multispectral satellite images—Landsat thematic mapper (TM), enhanced thematic mapper (ETM+), and operational land imager (OLI)—were analyzed and verified using the GEE code editor. The CART classifier, integrated into GEE, identified four major LULC categories: urban areas, water bodies, cultivated soils, and bare areas. From 1993 to 2008, urban areas expanded by 57 km2, while bare and cultivated soils decreased by 12.4 km2 and 42.7 km2, respectively. Between 2008 and 2024, water bodies increased by 24.4 km2, urban areas gained 24.2 km2, and cultivated and bare soils declined by 22.2 km2 and 26.4 km2, respectively. The CA-Markov model’s thematic maps highlighted the spatial distribution of forecasted LULC changes for 2040 and 2050. The results indicated that the urban areas, agricultural land, and water bodies will all increase. However, as anticipated, the areas of bare lands shrank during the years under study. These findings provide valuable insights for decision makers, aiding in improved land-use management, strategic planning for land reclamation, and sustainable agricultural production programs. Full article
(This article belongs to the Special Issue Sustainable Development and Land Use Change in Tropical Ecosystems)
Show Figures

Figure 1

Figure 1
<p>Location of the study area.</p>
Full article ">Figure 2
<p>Flowchart of the methodology framework followed in the current study.</p>
Full article ">Figure 3
<p>The classified image of the study area in 1993.</p>
Full article ">Figure 4
<p>The classified image of the study area in 2008.</p>
Full article ">Figure 5
<p>The classified image of the study area in 2024.</p>
Full article ">Figure 6
<p>The changes in km<sup>2</sup> of each LULC category for 1993–2008.</p>
Full article ">Figure 7
<p>The changes in km<sup>2</sup> of each LULC category for 2008–2024.</p>
Full article ">Figure 8
<p>LULC change distribution over time in km<sup>2</sup> between 1993, 2008, and 2024.</p>
Full article ">Figure 9
<p>Spatial distribution map of forecasted LULC changes during 2040.</p>
Full article ">Figure 10
<p>Spatial distribution map of forecasted LULC changes during 2050.</p>
Full article ">Figure 11
<p>The comparison between simulated and real in km<sup>2</sup> of 2008 and 2024.</p>
Full article ">
20 pages, 19148 KiB  
Article
Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning
by Dan Jiang, Fei Guo, Ziteng Zhang, Xiaoqing Yu, Jing Dong, Hongchi Zhang and Zhen Zhang
Buildings 2024, 14(12), 4024; https://doi.org/10.3390/buildings14124024 - 18 Dec 2024
Viewed by 223
Abstract
The relationship between coronary heart disease (CHD) and complex urban built environments remains a subject of considerable uncertainty. The development of predictive models via machine learning to explore the underlying mechanisms of this association, as well as the formulation of intervention policies and [...] Read more.
The relationship between coronary heart disease (CHD) and complex urban built environments remains a subject of considerable uncertainty. The development of predictive models via machine learning to explore the underlying mechanisms of this association, as well as the formulation of intervention policies and planning strategies, has emerged as a pivotal area of research. A cross-sectional dataset of hospital admissions for CHD over the course of a year from a hospital in Dalian City, China, was assembled and matched with multi-source built environment data via residential addresses. This study evaluates five machine learning models, including decision tree (DT), random forest (RF), eXtreme gradient boosting (XGBoost), multi-layer perceptron (MLP), and support vector machine (SVM), and compares them with multiple linear regression models. The results show that DT, RF, and XGBoost exhibit superior predictive capabilities, with all R2 values exceeding 0.70. The DT model performed the best, with an R2 value of 0.818, and the best performance was based on metrics such as MAE and MSE. Additionally, using explainable AI techniques, this study reveals the contribution of different built environment factors to CHD and identifies the significant factors influencing CHD in cold regions, ranked as age, Digital Elevation Model (DEM), house price (HP), sky view factor (SVF), and interaction factors. Stratified analyses by age and gender show variations in the influencing factors for different groups: for those under 60 years old, Road Density is the most influential factor; for the 61–70 age group, house price is the top factor; for the 71–80 age group, age is the most significant factor; for those over 81 years old, building height is the leading factor; in males, GDP is the most influential factor; and in females, age is the most influential factor. This study explores the feasibility and performance of machine learning in predicting CHD risk in the built environment of cold regions and provides a comprehensive methodology and workflow for predicting cardiovascular disease risk based on refined neighborhood-level built environment factors, offering scientific support for the construction of sustainable healthy cities. Full article
Show Figures

Figure 1

Figure 1
<p>Location map of Dalian city and the study area.</p>
Full article ">Figure 2
<p>Patient distribution in Dalian city.</p>
Full article ">Figure 3
<p>Distribution of urban built environment variables in the four districts of Dalian city: (<b>a</b>) BH, (<b>b</b>) SVF, (<b>c</b>) UR, (<b>d</b>) FAI, (<b>e</b>) DEM, (<b>f</b>) MNDWI, (<b>g</b>) NDBI, (<b>h</b>) NDVI, (<b>i</b>) hospital, (<b>j</b>) food, (<b>k</b>) PT, (<b>l</b>) RS, (<b>m</b>) AP, (<b>n</b>) Sports, (<b>o</b>) GDP, (<b>p</b>) HP, (<b>q</b>) POP.</p>
Full article ">Figure 4
<p>Detailed workflow for this study.</p>
Full article ">Figure 5
<p>Scatter plot of model fitting performance: machine learning models vs. linear regression.</p>
Full article ">Figure 6
<p>Error histogram: machine learning models vs. linear regression.</p>
Full article ">Figure 7
<p>Results of CHD–built environment element association analyses for all populations: (<b>a</b>) order of importance of SHAP values; (<b>b</b>) scatterplot of different SHAP values.</p>
Full article ">Figure 8
<p>Results of the analysis of the interaction effect of CHD–built environment elements: (<b>a</b>) order of importance of SHAP values; (<b>b</b>) scatterplot of different SHAP values.</p>
Full article ">Figure 9
<p>Results of CHD–built environment factor analyses for different age groups: (<b>a</b>) ranking of SHAP importance for ≤60 years age group, (<b>b</b>) scatter plot of different SHAP values for ≤60 years age group, (<b>c</b>) ranking of SHAP importance for 61–70 years age group, (<b>d</b>) scatter plot of different SHAP values for 61–70 years age group, (<b>e</b>) ranking of SHAP importance for 71–80 years age group, (<b>f</b>) scatterplot of different SHAP values for age group 71–80 years, (<b>g</b>) ranking of SHAP importance for ≥80 years age group, and (<b>h</b>) scatterplot of different SHAP values for age group ≥80 years.</p>
Full article ">Figure 10
<p>Results of CHD–built environment factor analyses in different gender groups: (<b>a</b>) ranking of SHAP importance in male group, (<b>b</b>) scatter plot of different SHAP values in male group, (<b>c</b>) ranking of SHAP importance in female group, and (<b>d</b>) scatter plot of different SHAP values in female group.</p>
Full article ">
24 pages, 3564 KiB  
Article
Optimizing Femtosecond Texturing Process Parameters Through Advanced Machine Learning Models in Tribological Applications
by Yassmin Seid Ahmed
Lubricants 2024, 12(12), 454; https://doi.org/10.3390/lubricants12120454 - 18 Dec 2024
Viewed by 360
Abstract
Surface texturing plays a vital role in enhancing tribological performance, reducing friction and wear, and improving durability in industrial applications. This study introduces an innovative approach by employing machine learning models—specifically, decision trees, support vector machines, and artificial neural networks—to predict optimal femtosecond [...] Read more.
Surface texturing plays a vital role in enhancing tribological performance, reducing friction and wear, and improving durability in industrial applications. This study introduces an innovative approach by employing machine learning models—specifically, decision trees, support vector machines, and artificial neural networks—to predict optimal femtosecond laser surface texturing parameters for tungsten carbide tested with WS2 and TiCN coatings. Traditionally, the selection of laser parameters has relied heavily on a trial-and-error method, which is both time-consuming and inefficient. By integrating machine learning, this study advances beyond conventional methods to accurately predict the depth and quality of textured features. The ANN demonstrated superior predictive accuracy among the models tested, outperforming SVM and Decision Trees. This machine learning-based approach not only optimizes the surface texturing process by reducing experimental effort but also enhances the resultant surface performance, making it well-suited for applications in sectors such as automotive and oil and gas. Full article
Show Figures

Graphical abstract

Graphical abstract
Full article ">Figure 1
<p>Machine learning process flow.</p>
Full article ">Figure 2
<p>Main concept of decision trees.</p>
Full article ">Figure 3
<p>Artificial neural network framework.</p>
Full article ">Figure 4
<p>Genetic algorithm flowchart.</p>
Full article ">Figure 5
<p>Machine learning methodology.</p>
Full article ">Figure 6
<p>Friction coefficient values at different conditions.</p>
Full article ">Figure 7
<p>Wear rate values.</p>
Full article ">Figure 8
<p>Hardness values.</p>
Full article ">Figure 9
<p>Hyperparameter results.</p>
Full article ">Figure 10
<p>Prediction values for the textured WS<sub>2</sub>: (<b>a</b>) depth and (<b>b</b>) COF.</p>
Full article ">Figure 11
<p>Scatter plot comparing model predictions (red dots) with experimental values (blue dots) across various surface characteristics.</p>
Full article ">
28 pages, 16088 KiB  
Article
A Hierarchical Machine Learning-Based Strategy for Mapping Grassland in Manitoba’s Diverse Ecoregions
by Mirmajid Mousavi, James Kobina Mensah Biney, Barbara Kishchuk, Ali Youssef, Marcos R. C. Cordeiro, Glenn Friesen, Douglas Cattani, Mustapha Namous and Nasem Badreldin
Remote Sens. 2024, 16(24), 4730; https://doi.org/10.3390/rs16244730 - 18 Dec 2024
Viewed by 276
Abstract
Accurate and reliable knowledge about grassland distribution is essential for farmers, stakeholders, and government to effectively manage grassland resources from agro-economical and ecological perspectives. This study developed a novel pixel-based grassland classification approach using three supervised machine learning (ML) algorithms, which were assessed [...] Read more.
Accurate and reliable knowledge about grassland distribution is essential for farmers, stakeholders, and government to effectively manage grassland resources from agro-economical and ecological perspectives. This study developed a novel pixel-based grassland classification approach using three supervised machine learning (ML) algorithms, which were assessed in the province of Manitoba, Canada. The grassland classification process involved three stages: (1) to distinguish between vegetation and non-vegetation covers, (2) to differentiate grassland from non-grassland landscapes, and (3) to identify three specific grassland classes (tame, native, and mixed grasses). Initially, this study investigated different satellite data, such as Sentinel-1 (S1), Sentinel-2 (S2), and Landsat 8 and 9, individually and combined, using the random forest (RF) method, with the best performance at the first two steps achieved using a combination of S1 and S2. The combination was then utilized to conduct the first two steps of classification using support vector machine (SVM) and gradient tree boosting (GTB). In step 3, after filtering out non-grassland pixels, the performance of RF, SVM, and GTB classifiers was evaluated with combined S1 and S2 data to distinguish different grassland types. Eighty-nine multitemporal raster-based variables, including spectral bands, SAR backscatters, and digital elevation models (DEM), were input for ML models. RF had the highest classification accuracy at 69.96% overall accuracy (OA) and a Kappa value of 0.55. After feature selection, the variables were reduced to 61, increasing OA to 72.62% with a Kappa value of 0.58. GTB ranked second, with its OA and Kappa values improving from 67.69% and 0.50 to 72.18% and 0.58 after feature selection. The impact of raster data quality on grassland classification accuracy was assessed through multisensor image fusion. Grassland classification using the Hue, Saturation, and Value (HSV) fused images showed higher OA (59.18%) and Kappa values (0.36) than the Brovey Transform (BT) and non-fused images. Finally, a web map was created to show grassland results within the Soil Landscapes of Canada (SLC) polygons, relating soil landscapes to grassland distribution and providing valuable information for decision-makers and researchers. Future work may include extending the current methodology by considering other influential variables, like meteorological parameters or soil properties, to create a comprehensive grassland inventory across the whole Prairie ecozone of Canada. Full article
Show Figures

Figure 1

Figure 1
<p>Geographical location of study area. Manitoba’s PE, with different ecoregions, is located in the province of Manitoba, Canada.</p>
Full article ">Figure 2
<p>The spatial distribution of the ground-truthing sampling for all LULC classes included in the classification of Manitoba’s PE grasslands.</p>
Full article ">Figure 3
<p>General overview of the major steps and workflow of the novel strategy for grassland classification, which is developed to improve ecological monitoring using multisource RS data and advanced ML techniques. This workflow integrates data from the S1, S2, L8, and L9 satellites. It involves major stages of image preprocessing, multitemporal composition, image fusion using HSV and BT, and advanced ML classifiers, including RF, SVM, and GTB. The classification is performed in three steps to achieve fine-scale identification of native, tame, and mixed grasses, starting from basic vegetation classification (Step 1) to detailed grassland class differentiation (Step 3); # represents the generated grassland map from each ML process. Ancillary field data, topographic features, and LULC information were incorporated as inputs to generate the final grassland maps for web-based visualization.</p>
Full article ">Figure 4
<p>Atmospheric effects on RS imagery demonstrate the influence of atmospheric conditions on the quality of RS data, specifically how clouds and shadows can occlude pixels and impact the accuracy of the reflected signal received by MSS sensors. (<b>a</b>) The pixel occluded by a shadow shows a scenario where a shadow, cast by an obstacle like a cloud, causes the pixel to be occluded, leading to distorted signals received by the sensor. (<b>b</b>) The pixel occluded by a cloud shows a situation where a cloud directly occludes the pixel, resulting in inaccurate data due to the cloud’s interference with the reflected sunlight that reaches the sensor.</p>
Full article ">Figure 5
<p>The steps of HSV method modified from Al-Wassai et al. [<a href="#B85-remotesensing-16-04730" class="html-bibr">85</a>]. After transforming the RGB image to HSV format, its V channel was replaced with the HR channel, which was then converted back to RGB mode.</p>
Full article ">Figure 6
<p>Comparison of step 2 grassland classification results using different ML models: (<b>a</b>) RF, (<b>b</b>) SVM, and (<b>c</b>) GTB.</p>
Full article ">Figure 7
<p>Classification accuracy varies with different input features ranked based on ANOVA for the RF, SVM, and GTB.</p>
Full article ">Figure 8
<p>Sampled areas before and after image fusion for image quality improvement: (<b>a</b>) Landsat 30 m, (<b>b</b>) HSV fused image, and (<b>c</b>) BT fused image.</p>
Full article ">Figure 9
<p>Scatter plots of fused and non-fused bands using BT and HSV approaches. Except for band 2 (B2), HSV had a higher r-squared. Around 1900 points were selected to build the scatter plots, and the color bar represents the point density, speeded from low density (Blue) to high density (Red).</p>
Full article ">Figure 10
<p>The detailed grassland classification of Manitoba’s PE using RF supervised ML classification model and S1 + S2 data combination.</p>
Full article ">Figure 11
<p>Distribution of mixed, tamed, and native grasslands across three ecoregions, highlighting the areas covered by each grassland class. The percentage listed for each ecoregion shows its proportion of the total grassland area, with Southwest Manitoba Uplands at 2.42%, Lake Manitoba Plain at 41.83%, and Aspen Parkland at 55.75%. The relative dominance of each grassland type across the Aspen Parkland, Lake Manitoba Plain, and Southwest Manitoba Uplands illustrates regional differences in land use and ecological composition.</p>
Full article ">Figure A1
<p>The list of all features with their scores. The numbers following spectral bands, VIs and backscatter variables indicate multiple composite images created during the growing season. Red points show the features that were excluded from classification models to achieve their highest OA and Kappa coefficient; (<b>a</b>) ANOVA F-Value of RF; (<b>b</b>) ANOVA F-Value of SVM; and (<b>c</b>) ANOVA F-Value of GTB.</p>
Full article ">Figure A2
<p>The classification maps of pixel-level fusion with RF approach using (<b>a</b>) Multispectral image, (<b>b</b>) HSV fused image, and (<b>c</b>) BT fused image.</p>
Full article ">Figure A3
<p>Out-of-bag (OOB) error for different numbers of trees and number of variables per split was calculated. Different numbers of variables per split tested are the square root of the total number of variables (SQRT), the total number of variables (ALL), and the natural logarithm of the total number of variables (Ln).</p>
Full article ">Figure A4
<p>(<b>a</b>) OA of classification for different kernel types in the SVM Model. (<b>b</b>) Grid search to find the best value for the Cost/Regularization parameter for Linear kernel in SVM.</p>
Full article ">Figure A5
<p>The effect of the number of trees on OA in GTB classification.</p>
Full article ">
18 pages, 5341 KiB  
Article
Comparing and Optimizing Four Machine Learning Approaches to Radar-Based Quantitative Precipitation Estimation
by Miaomiao Liu, Juncheng Zuo, Jianguo Tan and Dongwei Liu
Remote Sens. 2024, 16(24), 4713; https://doi.org/10.3390/rs16244713 - 17 Dec 2024
Viewed by 261
Abstract
To improve radar-based quantitative precipitation estimation (QPE) methods, this study investigated the relationship between radar reflectivity (Z) and hourly rainfall intensity (R) using data from 289 precipitation events in Shanghai between September 2020 and March 2024. Two Z-R relationship models were compared in [...] Read more.
To improve radar-based quantitative precipitation estimation (QPE) methods, this study investigated the relationship between radar reflectivity (Z) and hourly rainfall intensity (R) using data from 289 precipitation events in Shanghai between September 2020 and March 2024. Two Z-R relationship models were compared in terms of their fitting performance: Z = 270.81 R1.09 (empirically fitted relationship) and Z = 300 R1.4 (standard relationship). The results show that the Z = 270.81 R1.09 model outperforms the Z = 300 R1.4 model in terms of fitting accuracy. Specifically, the Z = 270.81 R1.09 model more effectively captures the nonlinear relationship between radar reflectivity and rainfall intensity, with a higher degree of agreement between the fitted curve and the observed data points. This model demonstrated superior performance across all 289 precipitation events. This study evaluated the performance of four machine learning approaches while incorporating five meteorological features: specific differential phase shift (KDP), echo-top height (ET), vertical liquid water content (VIL), differential reflectivity (ZDR), and correlation coefficient (CC). Nine QPE models were constructed using these inputs. The key findings are as follows: (1) For models with a single-variable input, the KAN deep learning model outperformed Random Forest, Gradient Boosting Decision Trees, Support Vector Machines, and the traditional Z-R relationship. (2) When six features were used as inputs, the accuracy of the machine learning models improved significantly, with the KAN deep learning model outperforming other machine learning methods. Compared to using only radar reflectivity, the KAN deep learning model reduced the MRE by 20.78%, MAE by 4.07%, and RMSE by 12.74%, while increasing the coefficient of determination (R2) by 18.74%. (3) The integration of multiple meteorological features and machine learning optimization significantly enhanced QPE accuracy, with the KAN deep learning model performing best under varying meteorological conditions. This approach offers a promising method for improving radar-based QPE, particularly considering seasonal, weather system, and precipitation stage differentiation. Full article
Show Figures

Figure 1

Figure 1
<p>Distribution of automatic weather stations (blue dots) and the Qingpu radar (red triangle) in Shanghai.</p>
Full article ">Figure 2
<p>Schematic diagram of the 5 × 5 radar range bin data above the automatic weather station.</p>
Full article ">Figure 3
<p>Workflow diagram for the relationship between the Z and R model, SVM, GBDT, RFR, and the KAN deep learning model for single-variable and multivariable precipitation estimation.</p>
Full article ">Figure 4
<p>Single-variable KAN deep learning neural network architecture.</p>
Full article ">Figure 5
<p>Multivariable KAN deep learning neural network architecture.</p>
Full article ">Figure 6
<p>Comparison of the estimation effects of two Z-R relationships.</p>
Full article ">Figure 7
<p>Scatter density plots of estimated vs. actual precipitation for five single-variable models: (<b>a</b>) Z = 270.81 R<sup>1.09</sup>; (<b>b</b>) SVM; (<b>c</b>) RF; (<b>d</b>) GBDT; and the (<b>e</b>) KAN deep learning method. The black solid line represents the ideal scenario where estimated values are perfectly aligned with observed values (<span class="html-italic">y = x</span>), while the red solid line indicates the actual relationship between estimated and observed values, highlighting the bias between them.</p>
Full article ">Figure 8
<p>Map of radar reflectivity and spatial distribution of univariate precipitation estimates using five different models at 06:00 UTC on 24 June 2024: (<b>a</b>) radar reflectivity; (<b>b</b>) Z = 270.81 R<sup>1.09</sup>; (<b>c</b>) Support Vector Machine model; (<b>d</b>) Random Forest model; (<b>e</b>) Gradient Boosting Decision Tree model; and (<b>f</b>) KAN deep learning model.</p>
Full article ">Figure 8 Cont.
<p>Map of radar reflectivity and spatial distribution of univariate precipitation estimates using five different models at 06:00 UTC on 24 June 2024: (<b>a</b>) radar reflectivity; (<b>b</b>) Z = 270.81 R<sup>1.09</sup>; (<b>c</b>) Support Vector Machine model; (<b>d</b>) Random Forest model; (<b>e</b>) Gradient Boosting Decision Tree model; and (<b>f</b>) KAN deep learning model.</p>
Full article ">Figure 9
<p>Scatter density plots of estimated vs. actual precipitation for four multivariable models: (<b>a</b>) SVM (multivariable); (<b>b</b>) GBDT (multivariable); (<b>c</b>) RF (multivariable); and (<b>d</b>) KAN deep learning method (multivariable). The red solid line represents the ideal scenario where estimated values are perfectly aligned with observed values (<span class="html-italic">y = x</span>), while the black solid line indicates the actual relationship between estimated and observed values, highlighting the bias between them.</p>
Full article ">Figure 10
<p>Map of radar reflectivity and spatial distribution of multivariable precipitation estimates using four different models at 06:00 UTC on June 24, 2024: (<b>a</b>) radar reflectivity map; (<b>b</b>) Support Vector Machine model; (<b>c</b>) Random Forest model; (<b>d</b>) Gradient Boosting Decision Tree model; and (<b>e</b>) KAN deep learning model.</p>
Full article ">
38 pages, 926 KiB  
Article
Data Mining Approaches in Predicting Entrepreneurial Intentions Based on Internet Marketing Applications
by Milan Krivokuća, Mihalj Bakator, Dragan Ćoćkalo, Marijana Vidas-Bubanja, Vesna Makitan, Luka Djordjević, Borivoj Novaković and Stefan Ugrinov
Appl. Sci. 2024, 14(24), 11778; https://doi.org/10.3390/app142411778 - 17 Dec 2024
Viewed by 350
Abstract
Amidst the globalization of markets, there has been a continuous intensification of competitiveness between enterprises. The modern business environment has caused a shift in how business is conducted. Opportunities and challenges arise, which put a tremendous pressure on enterprises regardless of size and [...] Read more.
Amidst the globalization of markets, there has been a continuous intensification of competitiveness between enterprises. The modern business environment has caused a shift in how business is conducted. Opportunities and challenges arise, which put a tremendous pressure on enterprises regardless of size and industry. Entrepreneurship in enterprises plays an important role in obtaining a competitive edge in the market. Thus, entrepreneurial intentions in enterprises can often shape the future and survival of the enterprise. In this paper, the prediction of entrepreneurial intentions in enterprises through Internet marketing predictors is addressed. For this, several statistical methods in data mining were used. First, simpler approaches such as linear regression, logistic regression were used. Afterward, classifier decision trees QUEST (quick, unbiased, efficient, statistical tree), and CHAID (chi-squared automatic interaction detection) were used. The sample for analysis was 137 enterprises from Serbia. Furthermore, a supervised machine learning algorithm, support vector machine (SVM) was used. Finally, a feed-forward neural network (FNN) was applied. The results varied across the applied approach, thus providing significant insights into the dynamics of data mining for prediction outcomes in an enterprise setting. Full article
Show Figures

Figure 1

Figure 1
<p>Graphical presentation of the linear regression model.</p>
Full article ">Figure 2
<p>QUEST decision tree algorithm for predicting the entrepreneurship intentions.</p>
Full article ">
30 pages, 9613 KiB  
Article
Mapping Soil Properties in Tropical Rainforest Regions Using Integrated UAV-Based Hyperspectral Images and LiDAR Points
by Yiqing Chen, Tiezhu Shi, Qipei Li, Chao Yang, Zhensheng Wang, Zongzhu Chen and Xiaoyan Pan
Forests 2024, 15(12), 2222; https://doi.org/10.3390/f15122222 - 17 Dec 2024
Viewed by 244
Abstract
For tropical rainforest regions with dense vegetation cover, the development of effective large-scale soil mapping methods is crucial to improve soil management practices to replace the time-consuming and laborious conventional approaches. While machine learning (ML) algorithms demonstrate superior predictability of soil properties over [...] Read more.
For tropical rainforest regions with dense vegetation cover, the development of effective large-scale soil mapping methods is crucial to improve soil management practices to replace the time-consuming and laborious conventional approaches. While machine learning (ML) algorithms demonstrate superior predictability of soil properties over linear models, their practical and automated application for predicting soil properties using remote sensing data requires further assessment. Therefore, this study aims to integrate Unmanned Aerial Vehicles (UAVs)-based hyperspectral images and Light Detection and Ranging (LiDAR) points to predict the soil properties indirectly in two tropical rainforest mountains (Diaoluo and Limu) in Hainan Province, China. A total of 175 features, including texture features, vegetation indices, and forest parameters, were extracted from two study sites. Six ML models, Partial Least Squares Regression (PLSR), Random Forest (RF), Adaptive Boosting (AdaBoost), Gradient Boosting Decision Trees (GBDT), Extreme Gradient Boosting (XGBoost), and Multilayer Perceptron (MLP), were constructed to predict soil properties, including soil acidity (pH), total nitrogen (TN), soil organic carbon (SOC), and total phosphorus (TP). To enhance model performance, a Bayesian optimization algorithm (BOA) was introduced to obtain optimal model hyperparameters. The results showed that compared with the default parameter tuning method, BOA always improved models’ performances in predicting soil properties, achieving average R2 improvements of 202.93%, 121.48%, 8.90%, and 38.41% for soil pH, SOC, TN, and TP, respectively. In general, BOA effectively determined the complex interactions between hyperparameters and prediction features, leading to an improved model performance of ML methods compared to default parameter tuning models. The GBDT model generally outperformed other ML methods in predicting the soil pH and TN, while the XGBoost model achieved the highest prediction accuracy for SOC and TP. The fusion of hyperspectral images and LiDAR data resulted in better prediction of soil properties compared to using each single data source. The models utilizing the integration of features derived from hyperspectral images and LiDAR data outperformed those relying on one single data source. In summary, this study highlights the promising combination of UAV-based hyperspectral images with LiDAR data points to advance digital soil property mapping in forested areas, achieving large-scale soil management and monitoring. Full article
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)
Show Figures

Figure 1

Figure 1
<p>Workflow of the soil property mapping method in tropical rainforest regions.</p>
Full article ">Figure 2
<p>(<b>a</b>) Geographic location of Hainan Province, China; spatial distribution of soil samples in (<b>b</b>) Diaoluo, and (<b>c</b>) Limu mountain.</p>
Full article ">Figure 3
<p>Top and side 3D view of LiDAR point cloud of (<b>a</b>) Diaoluo and (<b>b</b>) Limu mountains.</p>
Full article ">Figure 4
<p>Comparison of soil properties between samples from Diaoluo and Limu mountains: (<b>a</b>) pH; (<b>b</b>) soil organic carbon (SOC); (<b>c</b>) total nitrogen (TN); and (<b>d</b>) total phosphorus (TP). Dashed lines represent the mean value.</p>
Full article ">Figure 4 Cont.
<p>Comparison of soil properties between samples from Diaoluo and Limu mountains: (<b>a</b>) pH; (<b>b</b>) soil organic carbon (SOC); (<b>c</b>) total nitrogen (TN); and (<b>d</b>) total phosphorus (TP). Dashed lines represent the mean value.</p>
Full article ">Figure 5
<p>Importance ranking of the 15 selected features for predicting the (<b>a</b>) pH, (<b>b</b>) soil organic carbon (SOC), (<b>c</b>) total nitrogen (TN), and (<b>d</b>) total phosphorus (TP).</p>
Full article ">Figure 6
<p>Scatter plots of the measured values against soil property levels predicted by the optimal models: (<b>a</b>) pH predicted by the GBDT model; (<b>b</b>) soil organic carbon (SOC) predicted by the XGBoost model; (<b>c</b>) total nitrogen (TN) predicted by the GBDT model; (<b>d</b>) total phosphorus (TP) predicted by the XGBoost model.</p>
Full article ">Figure 7
<p>Spatial distributions of the soil properties, including pH, soil organic carbon (SOC), total nitrogen (TN), and total phosphorus (TP), in (<b>a</b>) Diaoluo and (<b>b</b>) Limu mountains.</p>
Full article ">Figure 7 Cont.
<p>Spatial distributions of the soil properties, including pH, soil organic carbon (SOC), total nitrogen (TN), and total phosphorus (TP), in (<b>a</b>) Diaoluo and (<b>b</b>) Limu mountains.</p>
Full article ">
20 pages, 10226 KiB  
Article
Coffee Leaf Rust Disease Detection and Implementation of an Edge Device for Pruning Infected Leaves via Deep Learning Algorithms
by Raka Thoriq Araaf, Arkar Minn and Tofael Ahamed
Sensors 2024, 24(24), 8018; https://doi.org/10.3390/s24248018 - 16 Dec 2024
Viewed by 312
Abstract
Global warming and extreme climate conditions caused by unsuitable temperature and humidity lead to coffee leaf rust (Hemileia vastatrix) diseases in coffee plantations. Coffee leaf rust is a severe problem that reduces productivity. Currently, pesticide spraying is considered the most effective [...] Read more.
Global warming and extreme climate conditions caused by unsuitable temperature and humidity lead to coffee leaf rust (Hemileia vastatrix) diseases in coffee plantations. Coffee leaf rust is a severe problem that reduces productivity. Currently, pesticide spraying is considered the most effective solution for mitigating coffee leaf rust. However, the application of pesticide spray is still not efficient for most farmers worldwide. In these cases, pruning the most infected leaves with leaf rust at coffee plantations is important to help pesticide spraying to be more efficient by creating a more targeted, accessible treatment. Therefore, detecting coffee leaf rust is important to support the decision on pruning infected leaves. The dataset was acquired from a coffee farm in Majalengka Regency, Indonesia. Only images with clearly visible spots of coffee leaf rust were selected. Data collection was performed via two devices, a digital mirrorless camera and a phone camera, to diversify the dataset and test it with different datasets. The dataset, comprising a total of 2024 images, was divided into three sets with a ratio of 70% for training (1417 images), 20% for validation (405 images), and 10% for testing (202 images). Images with leaves infected by coffee leaf rust were labeled via LabelImg® with the label “CLR”. All labeled images were used to train the YOLOv5 and YOLOv8 algorithms through the convolutional neural network (CNN). The trained model was tested with a test dataset, a digital mirrorless camera image dataset (100 images), a phone camera dataset (100 images), and real-time detection with a coffee leaf rust image dataset. After the model was trained, coffee leaf rust was detected in each frame. The mean average precision (mAP) and recall for the trained YOLOv5 model were 69% and 63.4%, respectively. For YOLOv8, the mAP and recall were approximately 70.2% and 65.9%, respectively. To evaluate the performance of the two trained models in detecting coffee leaf rust on trees, 202 original images were used for testing with the best-trained weight from each model. Compared to YOLOv5, YOLOv8 demonstrated superior accuracy in detecting coffee leaf rust. With a mAP of 73.2%, YOLOv8 outperformed YOLOv5, which achieved a mAP of 70.5%. An edge device was utilized to deploy real-time detection of CLR with the best-trained model. The detection was successfully executed with high confidence in detecting CLR. The system was further integrated into pruning solutions for Arabica coffee farms. A pruning device was designed using Autodesk Fusion 360® and fabricated for testing on a coffee plantation in Indonesia. Full article
(This article belongs to the Special Issue Deep Learning for Intelligent Systems: Challenges and Opportunities)
Show Figures

Figure 1

Figure 1
<p>Coffee leaf rust disease under severe conditions.</p>
Full article ">Figure 2
<p>Conceptual research framework.</p>
Full article ">Figure 3
<p>Arabica coffee farm in Majalengka District.</p>
Full article ">Figure 4
<p>Image augmentation: (<b>a</b>) original image, (<b>b</b>) flipped horizontally, (<b>c</b>) flipped vertically, (<b>d</b>) rotated 90°, (<b>e</b>) rotated 180°, and (<b>f</b>) rotated 270°.</p>
Full article ">Figure 5
<p>Stages of coffee leaf rust disease in the dataset: (<b>a</b>) healthy, (<b>b</b>) early stage, (<b>c</b>) severe stage with chlorosis, (<b>d</b>) severe stage with chlorosis and defoliation.</p>
Full article ">Figure 6
<p>CLR detection process on the YOLO framework.</p>
Full article ">Figure 7
<p>YOLOv5 training configuration.</p>
Full article ">Figure 8
<p>YOLOv8 training configuration.</p>
Full article ">Figure 9
<p>Detection results on the testing dataset comprising images from the digital mirrorless camera: (<b>a</b>) original image, (<b>b</b>) YOLOv5 detection, and (<b>c</b>) YOLOv8 detection.</p>
Full article ">Figure 10
<p>Detection results on the testing dataset comprising images from the phone camera: (<b>a</b>) original image, (<b>b</b>) YOLOv5 detection, and (<b>c</b>) YOLOv8 detection.</p>
Full article ">Figure 11
<p>Real-time detection of CLR on Jetson Nano using YOLOv8: (<b>a</b>) infected and (<b>b</b>) healthy images.</p>
Full article ">Figure 12
<p>Challenge in dataset quality: (<b>a</b>) low-quality image (blurry) and (<b>b</b>) occluded object.</p>
Full article ">Figure 13
<p>Design of the pruning device components. (<b>a</b>) Pruning device, i.e., cutting part, (<b>b</b>) slicing part, and (<b>c</b>) vacuum tank.</p>
Full article ">Figure 14
<p>Design of the test for real-time detection before deployment in the field at an Arabica coffee farm: (<b>a</b>) Device components and (<b>b</b>) real-time detection.</p>
Full article ">
24 pages, 12705 KiB  
Article
Site Selection of Elderly Care Facilities Based on Multi-Source Spatial Big Data and Integrated Learning
by Yin Zhang, Junhong Zhu, Fangyi Li and Yingjie Wang
ISPRS Int. J. Geo-Inf. 2024, 13(12), 451; https://doi.org/10.3390/ijgi13120451 - 15 Dec 2024
Viewed by 365
Abstract
This study explores a method to improve the site selection for elderly care facilities in an aging region, using Hefei City, China, as the study area. It combines topographic conditions, population distribution, economic development status, and other multi-source spatial big data at a [...] Read more.
This study explores a method to improve the site selection for elderly care facilities in an aging region, using Hefei City, China, as the study area. It combines topographic conditions, population distribution, economic development status, and other multi-source spatial big data at a 500 m grid scale; constructs a prediction model for the suitability of sites for elderly care facilities based on integrated learning; and carries out a comprehensive evaluation and feature importance analysis. Finally, it uses trained random forest (RF) and gradient boosting decision tree (GBDT) models to predict preliminary site selection results for elderly care facilities. A second screening that compares three degrees of population aging is conducted to obtain the final site selection results. The results show the following: (1) The comprehensive evaluation indexes of the two integrated learning models, RF and GBDT, are above or below 80% as needed, which is better than the four single learning models. (2) The prediction results of the RF and GBDT models have 87.9% and 78.4% fit to existing elderly facilities, respectively, which indicates that the methods are reasonable and reliable. (3) The results of both the RF and GBDT models indicate that the closest distance to healthcare facilities and the size of the population distribution are the two most important factors affecting the location of elderly care facilities. (4) The results of the preliminary site selection show an overall spatial distribution of higher suitability in the main urban area and lower suitability in the suburban counties. The secondary screening finds that priority needs to be given to the periphery of the main urban area and to Lujiang County and other surrounding townships that have a more serious degree of aging as soon as possible in the site selection of new elderly care facilities. Full article
Show Figures

Figure 1

Figure 1
<p>Research framework.</p>
Full article ">Figure 2
<p>Administrative divisions and distribution of existing elderly facilities in Hefei City.</p>
Full article ">Figure 3
<p>Terrain conditions. (<b>a</b>) Altitude. (<b>b</b>) Slope. (<b>c</b>) Curvature.</p>
Full article ">Figure 4
<p>Economic situation. (<b>d</b>) GDP. (<b>e</b>) Average house price. (<b>f</b>) Land transaction price. (<b>g</b>) Companies. (<b>h</b>) Financial institutions. (<b>i</b>) Shopping. (<b>j</b>) Business locations. (<b>k</b>) Hotel accommodation.</p>
Full article ">Figure 5
<p>Transportation accessibility. (<b>l</b>) The closest distance to the road network. (<b>m</b>) The closest distance to the river network. (<b>n</b>) Transportation facilities. (<b>o</b>) Automobile-related services.</p>
Full article ">Figure 6
<p>Healthcare facilities. (<b>p</b>) The closest distance to a healthcare facility. (<b>q</b>) Density of healthcare facilities.</p>
Full article ">Figure 7
<p>Ease of living. (<b>r</b>) Dining. (<b>s</b>) Leisure and entertainment. (<b>t</b>) Tourist attractions. (<b>u</b>) Sports and fitness. (<b>v</b>) Life services. (<b>w</b>) Science and cultural education.</p>
Full article ">Figure 8
<p>Population distribution. (<b>x</b>) Population distribution scale. (<b>y</b>) Population distribution density. (<b>z</b>) Distribution scale of elderly population.</p>
Full article ">Figure 9
<p>Comparison of the six algorithms’ evaluation metrics.</p>
Full article ">Figure 10
<p>Comparison of preliminary site selection results based on RF and GBDT models. (<b>a</b>) Preliminary site selection results based on RF. (<b>b</b>) Preliminary site selection results based on GBDT.</p>
Full article ">Figure 11
<p>Screening results for a population of ≥8000 persons aged 65 years with ≥7% aging. (<b>a</b>) Screening results based on RF. (<b>b</b>) Screening results based on GBDT.</p>
Full article ">Figure 12
<p>Screening results for a population of ≥8000 persons aged 65 years with ≥14% aging. (<b>a</b>) Screening results based on RF. (<b>b</b>) Screening results based on GBDT.</p>
Full article ">Figure 13
<p>Screening results for a population of ≥8000 persons aged 65 years with ≥20% aging. (<b>a</b>) Screening results based on RF. (<b>b</b>) Screening results based on GBDT.</p>
Full article ">
Back to TopTop