ANN-based swarm intelligence for predicting expansive soil swell pressure and compression strength

Fazal E. Jalal^1,2,
Mudassir Iqbal³,
Waseem Akhtar Khan⁴,
Arshad Jamal⁵,
Kennedy Onyelowe⁶ &
…
Lekhraj⁷

2407 Accesses
19 Citations
Explore all metrics

An Author Correction to this article was published on 06 November 2024

This article has been updated

Abstract

This research suggests a robust integration of artificial neural networks (ANN) for predicting swell pressure and the unconfined compression strength of expansive soils (P_sUCS-ES). Four novel ANN-based models, namely ANN-PSO (i.e., particle swarm optimization), ANN-GWO (i.e., grey wolf optimization), ANN-SMA (i.e., slime mould algorithm) alongside ANN-MPA (i.e., marine predators’ algorithm) were deployed to assess the P_sUCS-ES. The models were trained using the nine most influential parameters affecting P_sUCS-ES, collected from a broader range of 145 published papers. The observed results were compared with the predictions made by the ANN-based metaheuristics models. The efficacy of all these formulated models was evaluated by utilizing mean absolute error (MAE), Nash–Sutcliffe (NS) efficiency, performance index ρ, regression coefficient (R²), root mean square error (RMSE), ratio of RMSE to standard deviation of actual observations (RSR), variance account for (VAF), Willmott’s index of agreement (WI), and weighted mean absolute percentage error (WMAPE). All the developed models for P_s-ES had an R significantly > 0.8 for the overall dataset. However, ANN-MPA excelled in yielding high R values for training dataset (TrD), testing dataset (TsD), and validation dataset (VdD). This model also exhibited the lowest MAE of 5.63%, 5.68%, and 5.48% for TrD, TsD, and VdD, respectively. The results of the UCS model’s performance revealed that R exceeded 0.9 in the TrD. However, R decreased for TsD and VdD. Also, the ANN-MPA model yielded higher R values (0.89, 0.93, and 0.94) and comparatively low MAE values (5.11%, 5.67, and 3.61%) in the case of PSO, GWO, and SMA, respectively. The UCS models witnessed an overfitting problem because the aforementioned R values of the metaheuristics were 0.62, 0.56, and 0.58 (TsD), respectively. On the contrary, no significant observation was recorded in the VdD of UCS models. All the ANN-base models were also tested using the a-20 index. For all the formulated models, maximum points were recorded to lie within ± 20% error. The results of sensitivity as well as monotonicity analyses depicted trending results that corroborate the existing literature. Therefore, it can be inferred that the recently built swarm-based ANN models, particularly ANN-MPA, can solve the complexities of tuning the hyperparameters of the ANN-predicted P_sUCS-ES that can be replicated in practical scenarios of geoenvironmental engineering.

A Comparative Analysis of Hybrid Computational Models Constructed with Swarm Intelligence Algorithms for Estimating Soil Compression Index

Article 16 May 2022

Harnessing Nature-Inspired Soft Computing for Reinforced Soil Bearing Capacity Prediction: A Neuro-nomograph Approach for Efficient Design

Article 27 July 2023

Advancing earth science in geotechnical engineering: A data-driven soft computing technique for unconfined compressive strength prediction in soft soil

Article 17 August 2024

Introduction

Expansive behaviour of swelling clay is a complicated process as prominent clay minerals, for instance, kaolinite, illite, montmorillonite etc. are present, which leads to higher swell-shrink as the moisture fluctuates. The physicochemical properties of the expansive soils (ES) are immensely perplexed. Their volume change behaviour is attributed to a typical S-shaped swelling characteristics curve in the form of a three-phase swelling which can be further compartmentalized as preliminary, primary and secondary swelling stages^{1,2,3,4,5,6,7,8}.

Firstly, the larger stresses in the form of swelling pressures (P_s), ASTM D4546, are generated when the volume change is blocked. The swelling pressure of ES (P_s-ES) is a fundamental parameter in estimating the behaviour of soft clays as well as an imperative characteristic of designing geotechnical structures^7,9,10. According to Meshram et al.¹¹, it offers comparatively better correlations using mineralogical, geotechnical and microfabric characteristics. Several direct and indirect techniques are available to predict the P_s-ES such that the latter methods are based on experimental results and engineering judgement. Furthermore, Du et al.¹² and Yin et al.¹³ suggested that to characterize the P_s under various conditions, numerous predictive models have been developed, such as Gouy–Chapman diffused double layer models, heat-driven/energy-related models, and data-driven/hybrid models are the three types of existent models¹⁴. Secondly, the unconfined compressive strength of ES (UCS-ES) is a desideratum for various parameters used in road design, primarily for highway construction^15,16,17. Also, the brittle behaviour of the ES yields low tensile strength thus leading to lesser UCS, and ASTM D2166, which could be improved by soil stabilization¹⁸. For instance, the UCS of lime-treated expansive soil increases at higher CaO content for various conditions, and additionally, the other engineering properties are also enhanced^15,19,20. The highest UCS of CaO-stabilized ES was recorded for the samples compacted at their optimum moisture content (OMC)¹⁵. While evaluating the UCS for various drying-wetting cycles, Wu et al.²¹ reported that the UCS-ES decreased by around 50% after the first drying-wetting cycle (i.e., UCS is inversely related to the drying-wetting cycles), whereas it perpetually increased at extended curing periods.

A rich amount of literature exists on the influence of the ES characteristics, (such as distribution of the grain sizes, consistency limits, compaction characteristics, and swelling, among others) on their mechanical properties. For instance, the plasticity index (PI) increases at higher montmorillonite content which ultimately increases the P_sUCS-ES. This cohesive nature can be associated with the low specific surface area (SSA) with higher cation exchange capacity (CEC) value of the smectites in the ES²². Similarly, maximum dry density (MDD) is another major indicator of the compressibility of the ES, and its high value depicts larger UCS and lesser P_s, whereas the OMC behaves vice-versa²³. Additionally, the natural water content (w_n) also substantially impacts the swell-strength characteristics of various ES. At high values of the w_n, more water enters the clay minerals which increases the swelling thereby leading to higher P_s and lesser values of the UCS-ES^24,25.

Various machine learning (ML) algorithms approaches have been widely considered in the recent past that are capable of accurately predicting many real-world problems^26,27,28,29. The recently developed AI techniques include artificial neural networks (ANNs)³⁰, genetic-based programming³¹, eXtreme gradient boosting (XGBoost)^32,33, multivariate adaptive regression splines (MARS)³⁴, alternate decision trees (ADTs), logistic regression (LR), M5 model trees, genetic algorithm (GA) among others^35,36. Giustolisi et al.³⁷ classified the mathematical models, i.e., white, black, and grey box models (WBM, BBM, and GBM, respectively), such that the WBMs exhibit parameters based on physical laws which form accurate physical associations, but their hidden mechanism has not been fully understood. The BBMs incorporate regressive data-driven systems wherein the active associations are not known and require to be predicted. While the GBMs are methodical systems wherein a mathematical framework efficaciously determines the overall behaviour. In this regard, the ANN is classified as ‘BBM’ due to lesser transparency and their inability to form closed-form prediction equations^38,39. The ML models are deployed to compute the P_sUCS-ES, which are imperative for designing foundations as well as constructing pavements resting on swelling soils. In addition, these laboratory tests are time-consuming, whereas the problematic soils are found in over 40 countries across the globe³¹. The main advantage of the ANN approach to calculate P_sUCS-ES is the capability to model complex, non-linear relationships between input variables and ES characteristics which lead to robust predictions compared to conventional methods The PSO is advantageous because of its rapid convergence ability, requiring only fewer parameters to adjust thus proving to be efficacious in dealing optimization problems⁴⁰. GWO is advantageous owing to its balanced exploration and exploitation techniques that lead to enhanced convergence speed. Moreover, this algorithm is simple to implement and understand which renders it accessible to researchers and practitioners. SMA exhibits various merits because it is easy to implement, adaptable, and bio-inspired, and it explores the search space efficiently by simulating the growth and foraging behaviour of slime moulds. Finally, inspired by the hunting behaviour of marine predators the MPA is advantageous because of its diversity in maintenance, adaptive strategy, and efficient convergence which makes it suitable for various real-world applications^{40,41,42,43,44}.

ANNs are computer programs which are used to estimate and categorize issues related to the information handling of the data^45,46. They are inspired by the biological structure of our brain as well as the nervous system which directly captures the association between inputs and outputs, however, there is no empirical formulation yielded^47,48. The formulated ANN model depicted that soil biochar composite having 5% biochar replacement yielded excellent results in lessening soil erosion. The ANN-based model forecasted the soil water characteristics curves reasonably well⁴⁹. On the contrary, it was found by Das et al.⁵⁰ that the SVM model outclassed the developed ANN models. In yet another study on P_s-ES and UCS-ES (known as P_sUCS-ES), the results of ANN modelling yielded the most satisfactory values in terms of R-value in the case of training as well as testing datasets (TrD and TsD, respectively). The comparison results showed that both the GEP and ANN are efficient and robust methods to determine the P_sUCS-ES^31,51. Therefore this study incorporates five advanced optimization methods, such as PSO⁵², GWO⁵³, SMA⁵⁴, MPA⁴³ alongside the ANN modelling to enhance the predictive capability. Ikizler et al.⁵⁵ formulated an ANN model to estimate the horizontal and vertical P_s-ES. The ANN formulation decreases the number of laboratory tests thereby attaining cost-effectiveness and robustness. Kumar et al.^32,56 used hybrid ANNs and deep learning-based simulation models (on 81 case histories of static pile load tests conducted in various regions of Vietnam) facilitating the safe and economical designs of eco-friendly piles. In a variety of geotechnical engineering systems, a lesser number of easily calculated input factors were used to model the unsaturated ES for the sake of predicting their mechanical behaviour⁵⁷. In another finding, the modelling results of ANN estimated the mechanical properties of pond ash stabilized ES impressively (with a coefficient of correlation R ≈ 0.96)⁵⁸. Recently, new empirical prediction models were developed by Jalal et al.³¹ for the determination of P_sUCS-ES by deploying neural networks, i.e., ANN, adaptive neuro-fuzzy inference system (ANFIS), and genetic programming approach, i.e., GEP^59,60. The results revealed that both the GEP as well as ANN are efficient methods to accurately compute P_sUCS-ES. Furthermore, they suggested reliable and easy-to-use GEP equations for the prediction of P_sUCS-ES are given in Eqs. (1) and (2), respectively.

$$ P_{s} = CF - \left( {\left( {\frac{7.25}{{G_{s} }}} \right)(OMC - SP + 0.91)} \right) + \left( {\left( {\frac{1}{3.71 + OMC} \times (MDD + 0.72)PI} \right) + OMC} \right) + \left( {\frac{1}{{\left( {\frac{1}{silt} - 52} \right)}} \times ( - 0.43\rho_{d\max }^{2} )} \right) $$

(1)

$$ UCS = \left( {\frac{sand(OMC - CF)}{{2w_{n} - OMC + G_{s} }}} \right) + \left( {sand + \rho_{d\max } + 0.19} \right)(\rho_{d\max } - 9.86) - \left( {\frac{silt}{{CF}}} \right) + (CF - (3 \times sand - 2G_{s} + SP)) $$

(2)

where CF is clay fraction, G_s is specific gravity, MDD is maximum dry density, OMC is optimum moisture content, PI is plasticity index, SP is the swell percent, and w_n is the natural moisture content.

The determination of P_s-ES is time-consuming while the prediction of the UCS-ES is also cumbersome from the standpoint of time and cost. Previously, the P_sUCS-ES have been determined by developing a variety of correlations using traditional statistical analyses (including GEP and ANN) wherein smaller R-values were recorded and the results were also not optimized^31,38. However, Jumaa and Yousif⁶¹ found that the ANN outclassed the GEP model by yielding comparatively accurate performance. From the standpoint of these uncertainties, the existing research utilizes ANN in conjunction with PSO, GWO, SMA, and MPA to improve the past models to determine P_sUCS-ES. Hyperparameter optimization is critical in ML model development which ensures optimal efficiency by fine-tuning parameters such as learning rates and regularization strengths. It is noteworthy to mention that the hyperparameter optimization review often highlights its role in improving model robustness while addressing issues such as computational complexity and overfitting. They also take into consideration the emerging methods, for example, Bayesian optimization and evolutionary algorithms, that encompass more efficacious exploration of hyperparameter spaces for better model generalization as well as robustness. Furthermore, the ANN-optimized models developed in the current study by using easily determinable geomechanical properties corroborated by past research^31,62,63,64. Note that, P_s and UCS were the two output predictor variables. The motive of the research was to optimize the ANN models using recently developed algorithms, and to compare the performance of the developed models, such as (i) ANN-PSO, (ii) ANN-GWO, (iii) ANN-SMA, and (iv) ANN-MPA, for the estimation of P_sUCS-ES by deploying simple geotechnical tests.

Methodology

ANN

These are simple yet dependable algorithmic models^40,41. To accomplish particular tasks, the ANNs try to mimic how the human nervous system and brain work. Their use has significantly increased in recent years across several technical disciplines. In addition, they have also been applied in evaluating different characteristics of the ES³¹. Their structure as well as functioning, such that a distinctive ANN structure comprises many processing elements (i.e., nodes) which have been arranged in layers (like input, output, and hidden layer/s) has been previously described. Note that the best-hidden layer can be found through the trial and error method⁶⁵. The input value of the preceding layer $({x}_{i})$ over every node is multiplied with the help of varying connection weight $\left({w}_{ji}\right).$ The addition process of weighted input signals took place on every node alongside the addition of a threshold value $\left({\phi }_{j}\right)$, too. After that, a non-linear transfer function $\left(f((.\right))$ is used over the joint input $\left({I}_{j}\right)$ for generating node output $\left({y}_{j}\right).$ It is important to state that the transfer functions commonly employed are linear and/or sigmoidal⁶⁶.

$$ I_{ij} = \sum\limits_{i = 1}^{n} {w_{ji} + \phi_{j} } $$

(3)

$$ y_{j} = f(I_{j} ) $$

(4)

The output of a layer acts as input at nodes in subsequent layers whereas this procedure is iteratively repeated. The entire process is given in Fig. 1 whereas the pertaining formulae are expressed in Eqs. (3) and (4). The data is induced to the input layer after which the system weights must be attuned iteratively according to set guidelines for determining the best combination of weights via a ‘training’ procedure with the help of deploying Levenberg–Marquardt backpropagation approach. Finally, after sufficient training, the model is terminated when the changes in resulting error are minimal. Moreover, the entire data is divided into three distinct sets,

i.e., TrD, TsD and VdD. It is important to state that the ANNs use the training set to identify patterns in the data. Also, the network training evaluates the combination of weights ${w}_{ji}$ among different neurons for yielding a global minimum of the error function by(Eq. 5). Furthermore, the main objective of TsD aims at assessing the robustness of the trained network bt finally evaluating the VdD.

$$ y_{k}^{j} = f\left( {\sum\nolimits_{j = 1}^{nk - 1} {w_{ji}^{k} } + y_{j}^{n - 1} } \right) $$

(5)

More information about the ANN algorithm and accompanying mechanism can be found in available literature^22,41,47,64.

PSO

It is another evolutionary programming approach that is influenced by the flocking habits of birds as well as fish. This concept was given by Kennedy and Eberhart⁶⁷ for the first time. The algorithm exhibits its roots in social psychology and artificial lifespan as well as engineering. Like other population-based metaheuristics, PSO has a “population of particles” that fly through the hyperspace solution via set velocities. Note that the velocities of each particle can be stochastically updated at each iteration based on the historical best location. A defined fitness function is used to derive both the particle as well as the best positions in the neighbourhood⁶⁸.

In addition, each particle's motion naturally progresses towards the optimal or nearly optimal solution. At each iteration, the position of an individual particle can be adjusted accordingly. After that, the next generation swarm is produced based on revised particle locations seeing their individual best location (${L}_{best}$) and the entire swarm’s best position (${G}_{best}$) as depicted in Fig. 2. The positions of the particles and their velocities are computed by Eqs. (6) and (7):

$$ V{}_{i}^{t + 1} = wV_{t}^{t} + m_{1} n_{1} (L_{best,i}^{t} - Y_{i}^{t} ) + m_{2} n_{2} (G_{best,i}^{t} - Y_{i}^{t} ) $$

(6)

$$ Y{}_{i}^{t + 1} = Y_{i}^{t} + V{}_{i}^{t + 1} $$

(7)

where, ${V}_{i}^{t+1}$ and ${V}_{i}^{t}$ represent the particle $i$ velocities in the case of iterations t + 1 as well as t, respectively. Similarly, ${Y}_{i}^{t+1}$ and ${Y}_{i}^{t}$ denote the i^th positions in the case of iterations t + 1 and t, respectively. The parameters $w,$ indicates the cognitive social effects, ${m}_{1} and {m}_{2}$ denote the inertial parameters, and ${n}_{1} and {n}_{2}$ correspond to the matrix of arbitrary numbers with range [0,1]. The ${L}_{best}$ and the ${G}_{best}$ in the following generation is obtained using Eqs. (8) and (9):

$$ L_{best,i}^{t + 1} = \left\{ \begin{gathered} Y_{i}^{t + 1} ,h(Y_{i}^{t + 1} ) < h(L_{best,i}^{t} ) \hfill \\ L_{best,i}^{t} ,h(Y_{i}^{t + 1} ) \ge h(L_{best,i}^{t} ) \hfill \\ \end{gathered} \right. $$

(8)

$$ G_{best,i}^{t + 1} = \arg \min \{ h(L_{best,0}^{t + 1} ), \ldots ,h(L_{best,ns}^{t + 1} ),h $$

(9)

where ${n}_{s}$ represents the summation of particles in the swarm.

As the exploration for optimum solution progresses, the random and irregular movement of particles (swarm) in search space now closely replicates the swarm of mosquitoes. The main strength of adopting the PSO for complex real-life problems is that it is not largely influenced by non-linearity. Furthermore, PSO can exhibit better and faster convergence to optimum solutions in a variety of scenarios. It is computationally more exhaustive and robust than a variety of exact mathematical methods. However, like other metaheuristics, a key issue in applying PSO is to establish a reasonable trade-off between intensification (exploitation) as well as diversification (exploration). In recent years the algorithm has witnessed widespread applications such as power systems, traffic control⁷¹, geotechnical investigation⁷² and, rainfall-runoff modelling⁷³.

GWO

Mirjalili et al.⁷⁴ floated the concept of this swarm intelligence optimization approach (metaheuristic algorithm) for the first time. The GWO draws inspiration from the cooperative hunting behaviour observed in grey wolves⁷⁵. Metaheuristic algorithms are designed to generate high-quality solutions from a random population. The generation takes inspiration from natural system behaviours and continues until a specific termination condition is fulfilled⁷⁶. GWO is based on three key steps i.e., surrounding prey, hunting, and sand attacking prey. To mathematically simulate wolf leadership order, assume the finest solution is alpha (α), the preceding one is beta (β), and finally it is the delta (δ). All other possible solutions can be assumed as omega (ω).

During the hunt the grey wolves encircle prey; the following equations (Eqs. 10 and 11) are given to numerically simulate grey wolf encircling behaviour.

$$ \vec{D} = \left| {C.\vec{X}_{prey(t)} - \vec{X}_{wolf} (t)} \right| $$

(10)

$$ \vec{X}_{wolf} (t + 1) = \vec{X}_{prey} (t) - \vec{A}.\vec{D} $$

(11)

$\overrightarrow{A}$ and $\overrightarrow{\text{C}}$ are the coefficient vectors, t represents existing iterations, and the prey position vector is ${\overrightarrow{\text{X}}}_{\text{prey}}$, and the grey wolf position vector is ${\overrightarrow{\text{X}}}_{\text{wolf}}$. The calculation of vectors $\overrightarrow{A}$ and $\overrightarrow{\text{C}}$ is according to Eqs. (12) and (13);

$$ \vec{A} = 2ar_{1} - a $$

(12)

$$ \vec{C} = 2r_{2} $$

(13)

where r₁ and r₂ are random vectors in the interval [0, 1], whereas a is linearly lowered from 2 to 0 throughout iterations.

Alpha (α) has usually guided the hunt, whereas, β as well as δ may take part in hunting occasionally. To mathematically model grey wolf hunting behaviour⁷⁷, the first three optimal solutions are preserved, while ω are required to relocate by Eqs. (14) to (20).

$$ \vec{D}_{alpha} = \left| {C_{1} \cdot \vec{X}_{alpha} - \vec{X}} \right| $$

(14)

$$ \vec{D}_{beta} = \left| {C_{2} \cdot \vec{X}_{beta} - \vec{X}} \right| $$

(15)

$$ \vec{D}_{delta} = \left| {C_{3} \cdot \vec{X}_{delta} - \vec{X}} \right| $$

(16)

$$ \vec{X}_{1} = \vec{X}_{alpha} - A_{1} \cdot \vec{D}_{alpha} $$

(17)

$$ \vec{X}_{2} = \vec{X}_{beta} - A_{2} \cdot \vec{D}_{beta} $$

(18)

$$ \vec{X}_{3} = \vec{X}_{delta} - A_{3} \cdot \vec{D}_{delta} $$

(19)

$$ \vec{X}(t + 1) = \frac{{\vec{X}_{1} + \vec{X}_{2} + \vec{X}_{3} }}{3} $$

(20)

The new solution appears to be positioned at random within α, β, and δ. It is to say that, the new solution position can be evaluated using these three best solutions. The position updating in GWO is presented in Fig. 3. GWO is advantageous to optimize problems because of its viable properties in contrast to other metaheuristics⁷⁸. This metaheuristic algorithm is also known for its simplicity, scalability, and special capability to keep the appropriate balance between diversification and intensification. In recent years, GWO has been employed for numerous engineering implications^79,80,81,82.

SMA

Li et al.⁵⁴ introduced a modified stochastic optimization method, i.e., SMA, that entirely relies on the oscillating behaviour of slime mould (SM). The SMA independently follows the oscillation method, replicating the Physarum polycephalum activation and morphological changes of SM. This is done during exploration, searching and foraging all without finishing the lifespan. The SMA method incorporates highly customized and adaptive weights for modelling and generating true and false responses to the reaction of the SM. Thus, it creates the optimum path to link food using improved space exploration skills and great exploitation tendencies^44,54,83.

The SMA optimization process operates in three distinct phases; (a) searching and approaching food using smell, (b) try wrapping the food as per the quality and composition of food, and (c) swinging and oscillating to seek a superior location^54,84. The comprehensive mathematical explanation of every phase is examined in this section and is given in Fig. 4.

1st Phase (Searching and approaching food)

In the first phase, the SM seek and approach food owing to its odour in the atmosphere as mathematically expressed by Eqs. (21) to (22).

When, $r<q$, then;

$$ Y_{i} = Y_{b} (t) + x_{b} [W_{t} .Y_{A} (t) - Y_{B} (t)] $$

(21)

When, $r\ge q$, then;

$$ Y_{i} = (x_{c} .Y_{i} ) $$

(22)

where, ${Y}_{i}$ refers to the location and orientation of the SM in the current cycle ($t$). ${Y}_{A}$ and ${Y}_{B}$ are two arbitrarily selected SM entities with weight (${W}_{t}$). ${Y}_{b}$ depicts the position of an entity with maximum saturation and concentration of odour. ${x}_{c}$ is the factor which lowers down linearly from 1 to 0. The other additional parameters such as $q$, ${x}_{b}$, and $b$ are specified in Eqs. (23) to (25).

$$ x_{b} = [ - b,b] $$

(23)

$$ b = \arctan h\left\{ { - \left( {\frac{t}{{t_{\max } }}} \right) + 1} \right\} $$

(24)

$$ q = \tanh \left| {S_{i} - F} \right|;i = 1,2, \ldots ,m $$

(25)

where, ${S}_{i}$ as well as $F$ indicate fitness of ${Y}_{i}$ and best performance among the total iterations completed, respectively. The ${W}_{t}$ can be explicitly stated in Eq. (26).

$$ W_{t} (smellindex(i)) = \left\{ {\begin{array}{*{20}c} {1 + r \cdot \log \left( {\frac{{BF - S_{i} }}{BF - WF} + 1} \right);Conditions} \\ {1 - r \cdot \log \left( {\frac{{BF - S_{i} }}{BF - WF} + 1} \right);Others} \\ \end{array} } \right. $$

(26)

$$ Smell - index = Sort(S) $$

(27)

where, $r$ shows the randomized variable between 0 and 1. $WF$ and $BF$ indicate the worst and optimum fitness within the latest iteration or cycle. The $smell index$ shows the arranged collection of best fittest scores, given as Eq. (27).

2nd Phase (Wrapping food as per quality)

In the second phase, the vascular tissues of SM are squeezed. The ${W}_{t}$ of the space is regulated. The exploration and research of additional locations are conducted in this phase. When the bio-oscillator produces stronger and greater waves the cytoplasm starts travelling faster and the thicker and bigger vein receives the heavily saturated, concentrated and healthy food. With the rise of highly concentrated food, the ${W}_{t}$ of the search space rises and it is reduced owing to the low concentration. The algebraic interpretation of this phase is provided in the form of Eqs. (28) to (30).

$$ Y^{*} = r_{and} \cdot (V_{\max } - V_{\min } ) + V_{\min } ;r{}_{and} < z $$

(28)

$$ Y^{*} = Y_{b} + x_{b} \cdot \{ W_{t} \cdot (Y_{A} - Y_{B} )\} ;r < q $$

(29)

$$ Y^{*} = x_{b} \cdot Y;r \ge q $$

(30)

where, ${V}_{min}$ and ${V}_{max}$ show the searching region from minimum to maximum value, respectively.

3rd Phase (waving and oscillation)

The SM depends completely on the propagation and amplification of waves produced during biological activity, such as changing the cytoplasmic flow in the veins. The ${x}_{b}$ varies in the range [− b, b]. It gradually approaches zero with the progression of the algorithm, as the number of iterations increases. While ${x}_{c}$ oscillates between [− 1,1] and it also eventually approaches zero.

Total net level of complexity of SMA:

The total net level of complexity of SMA comprises the complexity of the initialization process, performance assessment, strength or weight transformation and positioning⁵⁴. Mathematically it can be provided in Eq. (31).

$$ SMA_{OverallNetComplexity} = C[d + t_{\max } .m.\{ 1 + \log (m) + d\} ] $$

(31)

where, $m$ and $d$ denote the maximum cells in the SM and the dimensionality of features, respectively.

The absence of an acceleration and mutation strategy may limit the wide-scale adoption of the SMA⁸⁵. Furthermore, it also lacks in offering feature extraction when performed in the binary versions of algorithms.

MPA

Faramarzi et al.⁴³ presented a novel marine predator algorithm (MPA), which works on the effective swarm-inspired metaheuristic. Unlike other evolution algorithms, swarm-inspired algorithms adapt and generate new approaches which are differentiated mainly by their ability to search across many networks for the best response⁸⁶. As shown in Fig. 5, MPA pertains to general foraging tactics of aquatic and marine creatures, like Brownian motion and Levy flight of prey and predator (inspired organisms). It is followed by the optimal encounter rate strategy of biological predator–prey interactions⁴³. The predator forages and eats, whilst the prey gets eaten. The ease and simplicity of the velocity-based MPA approach, along with its excellent performance make it a viable substitute for traditional optimization algorithms⁴³.

Similar to the vast number of population-based metaheuristic algorithms, MPA is initialized with uniform allocation of the objective function and initial response in a search space, as expressed in Eq. (32).

$$ Y_{i,j} = V_{\min ,j} + \{ R \times (V_{\max ,j} - V_{\min ,j} )\} \;\;\;i = 1,{ }2,{ } \ldots ,{ }m{ }\;and\;j = 1,{ }2,{ } \ldots ,{ }d $$

(32)

$R$ is the evenly distributed vector on a random basis with a value ranging from 0 to 1 with ${V}_{min,j}$ and ${V}_{max,j}$ representing the minimum and maximum limits of the variable value to be assessed, respectively. In a search space, $d$ and $m$ indicate the highest dimension and total agents, respectively. ${Y}_{i,j}$ indicates the randomized matrix of the solution set PIcked randomly having $m\times d$ dimensional space.

As per the existence of the fittest concept, the best predators who are better at exploring, foraging and searching for prey are permitted to assemble an elite matrix to record cost-function data, as shown in Eq. (33).

$$ E_{lite} = \left[ {\begin{array}{*{20}c} {Y_{11}^{1} } & {Y_{12}^{1} ....} & {Y_{1\dim }^{1} } \\ \vdots & \ddots & \vdots \\ {Y_{n1}^{1} } & {Y_{n2}^{1} ....} & {Y_{n\dim }^{1} } \\ \end{array} } \right] $$

(33)

Both prey and predators are working as search agents, simultaneously. When the predators explore their prey, the prey simultaneously looks for its feed. Thus, the ${E}_{lite}$ is revised in the end stage of each loop if a leading predator is substituted with a healthier one.

Prey is a separate distinct matrix, equal in dimension to the Elite, that predators have access to change their positions. In short, the initiation of the algorithm produces the first prey, with the finest (predator) evolving into the Elite. Thus, another Eq. (34) is used to describe the prey matrix.

$$ P_{ry} = Y_{ij} = \left[ {\begin{array}{*{20}c} {Y_{11} } & {Y_{12} ....} & {Y_{1\dim } } \\ \vdots & \ddots & \vdots \\ {Y_{n1} } & {Y_{n1} ....} & {Y_{n\dim } } \\ \end{array} } \right] $$

(34)

The optimization of MPA includes three phases for revising, modifying and updating the original response with the search space, which are closely linked to the two foregoing matrices. All three phases are evaluated by the predator–prey velocity ratio. The first, second and third phase refers to a high, unit as well as low-velocity ratio, respectively. The comprehensive mathematical explanation of each stage is given below:

1st Phase (exploration with high velocity)

After the completion of one-third of the total iterations, the predators explore and switch locations quicker than the prey with a high-velocity ratio. Following Eq. (35), the mathematical expression for the exploration can be written as Eqs. (36) and (37).

When;

$$ I < \frac{1}{3}(t_{\max } ) $$

(35)

Then;

$$ S_{i} = R_{b} \otimes (E_{lite(i)} - R_{b} \otimes Y_{i} );i = 1,2,...,m $$

(36)

$$ Y_{i} = Y_{i} + (C.R \otimes S{}_{i});C = constant = 0.5 $$

(37)

${R}_{b}$ is the randomized vector for representing the normally distributed Brownian motion. While $I$ and ${I}_{max}$ describes the present and maximum possible iteration, respectively.

2nd Phase (evolution from exploration to exploitation with unit velocity)

In this phase, the space exploration is transitorily converted to exploitation and both the prey and predator alter location at similar velocity (with velocity-ratio ≈ 1.0). It occurs between one-third and two-thirds of the total iterations. However, if the prey is adopting Levy flight, then the most appropriate motion for the predator is Brownian motion, thus, the population is separated into two. Following Eq. (38), for a first and second half part, the step size and the position of prey can be mathematically expressed as Eqs. (39) to (40) and Eqs. (41) to (43), respectively.

When;

$$ \frac{1}{3}(I_{\max } ) < I < \frac{2}{3}(I_{\max } ) $$

(38)

For the first semi-population;

$$ S_{i} = R_{l} \otimes (E_{lite(i)} - R_{l} \otimes Y_{i} );i = 1,2, \ldots ,\frac{m}{2} $$

(39)

$$ Y_{i} = Y_{i} + (C.R \otimes S{}_{i});C = constant = 0.5 $$

(40)

For other semi-populations;

$$ S_{i} = R_{b} \otimes (R_{b} \otimes E_{lite(i)} - Y_{i} );i = 1,2, \ldots ,m $$

(41)

$$ Y_{i} = E_{lite(i)} + (C.F \otimes S{}_{i});C = constant = 0.5 $$

(42)

$$ CF = \left( {1 - \frac{I}{{I_{\max } }}} \right)^{{\frac{2I}{{I_{\max } }}}} $$

(43)

${R}_{l}$ is the randomized vector for representing the normally distributed Levy flight and $F$ is the adaptable variable governing the Brownian movement of predators.

3rd Phase (exploitation with low velocity)

In the final stage of optimization, when the current iteration surpasses two-thirds of the total iterations, the perfect exploitation occurs. Unlike, the first phase, the predators switch their locations considerably more gradually than the prey with lower velocity-ratio. By Eq. (44); the completely altered position of predators adopting Levy flight is mathematically expressed as Eqs. (45) to (46).

if;

$$ I > \frac{1}{3}(t_{\max } ) $$

(44)

then;

$$ S_{i} = R_{l} \otimes (R_{l} \otimes E_{lite(i)} - Y_{i} );i = 1,2, \ldots ,m $$

(45)

$$ Y_{i} = E_{lite(i)} + (C.F \otimes S{}_{i});C = constant = 0.5 $$

(46)

Eddy's formation with possible impact

MPA incorporates the formation of eddy's and uses Fish Aggregating Devices (FADs) to find an alternative response to the influence of natural and environmental variables and, as a result, modify the predator behaviour^87,88, as can be seen in Eqs. (47) to (50);

if;

$$ p \le (FADs = 2) $$

(47)

then;

$$ Y_{i} = Y_{i} + F \times [Y_{\min } + R \otimes (Y_{\max } - Y_{\min } )] \otimes X $$

(48)

if;

$$ p > (FADs = 0.2) $$

(49)

then;

$$ Y_{i} = Y_{i} + [FAD(1 - p) + p] \times (Y_{r1} - Y_{r2} ) $$

(50)

$p$ denotes the FADs probability and $X$ is the binary vector response. The subscripts $r1$ and $r2$ are representing the random locations of the prey matrix (${Y}_{i}$).

Marine memory

Marine predators are extremely proficient in recognizing the region of productive foraging. As a result, the marine working memory function is also assessed in the MPA optimization process⁴³. The ultimate focus of this new function is to eliminate local points and recall the previous finest position to assist agents in increasing uniform convergence^89,90.

As previously explained, MPA is referred to as solely a velocity-driven method, therefore introducing a binary multi-objective alternative can be a major enhancement^43,91. Finally, Fig. 6 illustrates the construction steps of hybrid ANNs deployed in the current research to evaluate the P_sUCS-ES. This figure outlines the process of using ANNs combined with swarm intelligence algorithms for optimizing the modelling of ES. It begins with the initialization of the swarm size and ANN parameters, followed by setting the metaheuristic parameters for algorithms such as PSO, GWO, SCA, and MPA approaches. Furthermore, this process includes testing the metaheuristics with ANN and selecting the best fit, which is then utilized to calculate the optimized weights and buses for the ANN model. Each model was ranked based on its performance in training and testing, with the highest scores given to the top performers and the lowest to the underperformers for each metric. The final score for each model was calculated by summing these individual rankings. Ultimately, the combined scores from both phases determined the model's overall ranking³². As a result, this leads to a sustainable construction approach by enhancing the understanding of the swell-strength nature of the problematic soils.

Data processing and analysis

Data preprocessing

To formulate the prognostic models, 168 and 145 observations of P_s and UCS, from 61 and 99 internationally published papers (Table 1), respectively, were considered. In addition, nine basic soil characteristics were collected from two separately developed databases. The original database was constructed after an extensive literature study by initially recording 250 datasets (for P_s-ES) and 190 datasets (for UCS-ES). After that, easy-to-determine geotechnical parameters were recorded for developing models to predict the swell-strength properties of ES. After the collection of all data points, numerous ANN trials were run to evaluate the validity. The data points that diverged substantially (around 20% or more) from the general trend were ignored (i.e., 82 records for P_s and 45 records for UCS). Therefore, 168 observation points of P_s-ES and 145 points of UCS-ES were finally deployed to formulate the hybrid models. The important factors affecting the P_sUCS-ES were investigated based on a recent literature review. However, the swell percent, MDD and OMC for some cases were absent and correlations were used to determine the missing values. Similarly, an average value of G_s was considered for some of the datasets³¹. Additionally, the contribution of G_s (between 2.3 and 2.8) on the P_sUCS-ES was negligible owing to its small range, however, it was considered by Akan and Keskin⁹³ for predicting the UCS-ES. The information related to several other geotechnical factors was scarcely present in the existing literature for several datasets. As a result, it could significantly reduce the total number of observations. Also, it may affect the generalization capability of the predicted models. As a result, these parameters were omitted in the development of models in the current study.

Table 1 Researches ID and research references of the two expansive soil databases collected in this study.

Full size table

Descriptive statistics and statistical visualization

Table 2 presents the descriptive statistics of the considered input as well as the two outputs such that these geotechnical indices are observed to affect the P_sUCS-ES. It is shown in Table 2, that the P_sUCS-ES range between 12.5 and 521 Kpa, and 6.4 and 1060 kPa, respectively. Additionally, w_n and sand content values for the P_s have not been included because their impact is lesser for the given range of data. Note that the w_n of the ES is different at different temperatures and drying times. However, the motive for selecting the w_n as an input parameter is due to its close association with the plastic limit and a variety of environmental factors. According to Patel⁹⁴, the swelling capacity of ES primarily relies on its mineral composition, as well as the moisture content and density present in its natural environment. In general, clays with PI > 25, LL > 40, and w_n near the PL or less may witness higher expansion. Also, the ES are problematic owing to their mechanical behaviour which is largely hydrophilic⁹⁵. Also, the Pearson correlation coefficient (r) calculated for the w_n was − 0.23293 for UCS-ES. It is known that the r-values illustrate a higher share of changes in the engineering characteristics of the ES. Moreover, the values given in Table 2 are suggested for the evaluation of P_sUCS-ES using the aforementioned computational intelligence models in the current research study. The efficacy and robustness of the formulated models is significantly affected by the dispersal of various data points⁴⁷. Moreover, to envisage the association among the ES input factors, graphical plots are given in Figs. 7 and 8 which depict the distribution histograms of various input factors as well as the two outputs (P_s and UCS), respectively The distribution of input data for P_s-ES is shown as a box plot in Fig. 9a which shows the 25% to 75% data distribution alongside the visual interpretation of the mean and median of the given dataset. Similarly, the box plot for the P_s-ES is manifested in Fig. 9b. Most of the data points considered in this study vary between 70 and 200. Secondly, the distribution of input data for UCS-ES is supplemented with a box plot shown in Fig. 10a which shows the 25% to 75% data distribution alongside the visual interpretation of the mean and median of the given dataset. Similarly, for the UCS-ES, the box plot is manifested as Fig. 10b. Most of the data points considered in this study vary between 100 and 300 MPa.

Table 2 Descriptive statistics of different input as well as output factors deployed in ANN-based formulated models (ANN-PSO, ANN-GWO, ANN-SMA)³¹.

Full size table

The Spearman rank correlation coefficient for P_s UCS-ES has been plotted in Fig. 11a,b, respectively. One of the most widely employed measures of relationship is Pearson's correlation coefficient, which is generally given by r^31,96. In the current research, nine parameters were selected to model P_s UCS-ES to avoid further complexity of the developed models. Note that, the P_s-ES is largely governed by all parameters especially CF (r = 0.64), OMC (r = − 0.60) and PI (r = 0.45), while, UCS-ES is significantly influenced by sand content (r = 0.58), MDD (r = 0.47) and OMC (r = − 0.39), respectively. By and large, a high correlation prevails in the P_sUCS-ES in the case of all input factors here.

AI-based analysis

The collected databases (168 instances for P_s, and 145 instances for UCS) were distinctly distributed as TrD and TsD. Note that the testing was performed to check the accuracy and robustness of the trained model using unseen data. Therefore, 70% of the dataset was selected randomly as the TrD, while the remaining 30% dataset was employed to test and validate the formulated models. Taherdangkoo et al.⁹⁷ developed an efficient neural network model to determine the maximum P_s of clayey soils by partitioning the dataset into ratios of 70:30. Several other studies in the same field follow the same partitioning ratio^31,98,99.

To evaluate the performance of the formulated models, commonly employed performance indices such as MAE, NSE efficiency, P_i, R², RMSE, RSR, VAF, WI, and WMAPE were determined^100,101,102. The formulae of these indices can be expressed as Eqs. (51) to (59), respectively:

$$ MAE = \frac{{\sum\nolimits_{i = 1}^{n} {\left| {e_{i} - p_{i} } \right|} }}{n} $$

(51)

$$ NS = 1 - \frac{{\sum\nolimits_{i = 1}^{n} {(e_{i} - p_{i} )^{2} } }}{{\sum\nolimits_{i = 1}^{n} {(e_{i} - \overline{e}_{i} )^{2} } }} $$

(52)

$$ P_{i} = adj.R^{2} + 0.01VAF - RMSE $$

(53)

$$ R^{2} = \left( {\frac{{\sum\nolimits_{i = 1}^{n} {(e_{i} - \overline{e}_{i} )(p_{i} - \overline{p}_{i} )} }}{{\sum\nolimits_{i = 1}^{n} {(e_{i} - \overline{e}_{i} )^{2} \sum\nolimits_{i = 1}^{n} {(p_{i} - \overline{p}_{i} )^{2} } } }}} \right)^{2} $$

(54)

$$ RMSE = \sqrt {\frac{{\sum\nolimits_{i = 1}^{n} {(e_{i} - p_{i} )^{2} } }}{n}} $$

(55)

$$ RSR = \frac{RMSE}{{\frac{1}{n}\sum\nolimits_{i = 1}^{n} {(e_{i} - e_{mean} )}^{2} }} $$

(56)

$$ VAF(\% ) = (1 - \frac{{{\text{var}} (e_{i} - p_{i} )}}{{{\text{var}} (e_{i} )}}) \times 100 $$

(57)

$$ WI = 1 - \left[ {\frac{{\sum\nolimits_{i = 1}^{n} {(e_{i} - p_{i} )^{2} } }}{{\sum\nolimits_{i = 1}^{n} {\{ \left| {p_{i} - e_{mean} } \right| + \left| {e_{i} - e_{mean} } \right|\}^{2} } }}} \right] $$

(58)

$$ WMAPE = \frac{{\sum\nolimits_{i = 1}^{n} {\left| {\frac{{e_{i} - p_{i} }}{{e_{i} }}} \right| \times e_{i} } }}{{\sum\nolimits_{i = 1}^{n} {e_{i} } }} $$

(59)

where ${y}_{i}$ and ${\widehat{y}}_{i}$ refer to actual and predicted ith values, $n$ means data samples in a dataset, ${y}_{mean}$ refers to the average of the actual values whereas $p$ means the total input parameters.

Results and discussion

This section presents the detailed results of the developed models to predict the P_sUCS-ES. For both the target variables, similar nine attributes, namely, clay fraction CF, liquid limit LL, plasticity index PI, maximum dry density MDD, optimum moisture content OMC, swell percent SP, natural water content w_n, sand and silt acted to be the input parameters, as mentioned earlier. As a result, 168 experimental results for P_s-ES and 145 records of UCS-ES were employed. Initially, 70% of the data was utilized as the TrD, whereas the remaining data was separated into validation dataset (VdD) and training dataset (TsD). Subsequently, the performance of the formulated models was validated and tested with the help of the aforementioned performance indices. Moreover, the comparison of robustness as well as the general performance of the formulated models is also described. Finally, statistical testing and uncertainty analysis (UA) were performed to determine the overall performance of the ANN-based models.

Configuration of ANN hybrid models

It is a desideratum to initially determine the optimum hyperparameters for the development of ANN-based models which is generally established using a trial-and-error procedure¹⁰³. The optimum number of neurons achieved from trials for both P_s-ES and UCS-ES models varied from 8 to 14, as listed in Table 3. The maximum number of iterations (k), as well as swarm size (n_s), were kept constant during modelling at 500 and 50, respectively, to compare the developed models.

Table 3 Parametric configuration of the developed hybrid ANN models.

Full size table

For developing ANN-PSO hybrid models, first of all, the ANN was initialized using RMSE as a fitness function, and then the PSO algorithm was deployed for optimizing hyperparameters of the ANN. After that, ANN was initialized with 10 input neurons, 10 neurons in the hidden layer, and one output neuron for modelling the P_s-ES. On the contrary, for UCS-ES modelling, 11 neurons were used in the hidden layer to constitute 121 and 133 weights and biases for P_sUCS-ES models, respectively. The optimum hyperparameters for PSO were set equal to 0.30, 1, and 2 as inertial weight (w), social coefficient (c₁), and acceleration coefficient (c₂), respectively.

In the case of ANN-GWO hybrid models, the wolf group was kept equal to 50 individuals. The number of inputs, hidden, and output neurons were adopted such that 97 and 121 weights and biases were obtained in the case of P_sUCS-ES models. Based on the hidden neurons, the number of optimized weights as well as biases in the case of ANN-SMA and ANN-MPA are 145 and 157 for P_s-ES models whereas, 145 and 169 for UCS-ES models, respectively. The deterministic parameter “z” for the ANN-SMA was adopted as 0.20, whereas for ANN-MPA, Fish Aggregating Device (FAD) and P were set as 0.20 and 0.50, respectively, as listed in Table 3. Note that, the process for training the metaheuristic model is identical; however, the values of weights as well as biases in the case of the developed model are not the same in each case.

The convergence of the algorithm in searching local optima may be trapped; therefore, it is essential to investigate the merging behaviour of the optimization algorithm in assessing the robustness of the developed model. Furthermore, Fig. 12 as well as Fig. 13 display the convergence curves in the case of developed hybrid models (P_s-ES and UCS-ES, respectively). It is evident that ANN-PSO and ANN-GWO converge faster (almost equivalent) as compared to the other models, however, ANN-MPA surpasses other models in achieving higher accuracy. It is because the percent difference between ANN-PSO as well as ANN-GWO models is merely 1.5%, in contrast to the 15.11% and 64.58% difference in the case of ANN-SMA as well as ANN-MPA hybrid models, respectively. Moreover, the computational cost for the developed models using MATLAB was observed as 192.74 s, 189.87 s, 224.24 s, and 376.59 s in the case of ANN-PSO, ANN-GWO, ANN-SMA, as well as ANN-MPA, respectively, for 500 iterations of P_s-ES models. Similarly, for UCS-ES models, these values were recorded as 192.71 s, 194.18 s, 211.66 s, and 383.78 s, respectively. It is also stated that the number of iterations were finalized for the sake of comparison and this is why the local results were only derived. The curves show that further iterations may not significantly alter the accuracy of formulated models.

Performance evaluation of the formulated models

This portion evaluates the accuracy analysis of the formulated models by the statistical evaluation equations (Table 4 and Table 5)¹⁰⁴. The performance evaluation of TrD is presented. The performance level for the developed models of P_s-ES was recorded in the range of 79.54% (R² = 0.7954) to 85.4% (R² = 0.854) in terms of coefficient of determination. Similarly, the UCS-ES models yielded an accuracy of 80.07% (R² = 0.8007) to 86.22% (R² = 0.8622). The TrD of both the developed models manifested a correlation (R greater than 0.8 which reflects a strong fit to the observed data points^105,106. The results of the ANN-MPA and ANN-GWO (for P_s-ES), and ANN-MPA (for UCS-ES) were found to have R² exceeding 0.80, and therefore they are considered to be yielding the best performance, i.e., low error indices. On the contrary, the ANN-PSO and ANN-SMA were observed to yield comparatively lower values while computing the swell-strength characteristics of the ES. The best R² values in the case of the ANN and ANN-MPA modelling can be summarized as: (R²_train of ANN = 0.864 and 0.9409, R²_train of ANN-MPA = 0.8541 and 0.8624, and R²_test of ANN = 0.7832 and 0.7921, R²_test of ANN-MPA = 0.8796 and 0.8799). Furthermore, overfitting can be observed in ANN modelling of the P_sUCS-ES where the testing R² in both the cases is below 0.8. However, this issue is refined and the results have having higher degree of accuracy in ANN-MPA modelling where the training and testing R² are almost equivalent.

Table 4 Details of performance indices for P_s-ES during ANN-based modelling.

Full size table

Table 5 Details of performance indices for UCS-ES during ANN-based modelling.

Full size table

The values of MAE were calculated in the range of 5.63% to 6.71% and 5.11% to 6.49% for the TrD of P_s-ES and UCS-ES models, respectively. RMSE values were recorded in the acceptable range of 7.10% to 8.42% and 6.66% to 8.1% for P_s-ES and UCS-ES models, respectively. The results reveal that ANN-MPA outperforms other models from the viewpoint of correlation as well as accuracy. The maximum values of R² were obtained for the ANN-MPA as 0.854 and 0.8624 for P_sUCS-ES models, respectively. Moreover, the lowest MAE (5.63% and 5.11%) and RMSE (7.10% and 6.66%) were also obtained for P_sUCS-ES, respectively, in the case of ANN-MPA models. Apart from correlation and mentioned errors, the models were also evaluated using the Nash–Sutcliffe (NS) performance index. The values for NS (in ANN-MPA models) were recorded in the range of 0.79 to 0.8622, with the maximum value of 0.854 and 0.8622 for P_sUCS-ES, respectively. The values of NS > 0.75 are found to yield excellent performance. Hence, the currently developed models also manifest strong goodness of fit.

The accuracy of the formulated models was also evaluated with the help of an error histogram and slope of the regression line obtained using the plot of experimental to predicted results, as shown in Figs. 14, 15, 16, and 17 (P_s-ES) and Figs. 18, 19, 20, as well as Fig. 21 (UCS-ES), respectively. It is evident that the scatter of data points for all the developed models mainly lies within the slope of ± 20% deviation from the best-fit line, which also represents the close agreement of predicted and actual results³¹. The error histogram showed 78%, 88%, 82%, and 85% of the TrD of P_s-ES models within ± 10% relative error for ANN-PSO, ANN-GWO, ANN-SMA, and ANN-MPA, respectively. Similarly, UCS-ES models yielded 85%, 90%, 78%, and 89% of the predictions within ± 10% relative error for ANN-PSO, ANN-GWO, ANN-SMA, as well as ANN-MPA, respectively.

Furthermore, a few other visual representations, such as Taylor diagrams as well as the Accuracy matrix, are also given to assess the performance of the formulated ANN-based models. The former refers to the mathematical 2-D representation of the comparative evaluation of the model from the standpoint of root mean squared error (RMSE), R (between predicted and experimental values), and the ratio of their standard deviation. Each model is identified within the diagram by a marker, character, or point, which quantifies its evaluation on a linear and radial scale. The position of the marker depicts the model performance; the closer the marker is to the reference point, the higher the accuracy of the developed model. Figure 22 manifests P_s-ES models with R values > 0.8, representing a strong agreement among observed as well as predicted values. The correlation values for UCS-ES models are also ≥ 0.78, depicting a good fit to experimental results (Fig. 23). The marker points of almost all the models are in proximity to reference points, however, the ANN-MPA being the closest one, represents a relatively more robust model.

For evaluating the accuracy of the formulated models, the accuracy matrix is also presented in Figs. 24 and 25. The percentage accuracy of the model is expressed in terms of ρ relative to their ideal values.

For instance, ideal values for mean absolute error (MAE), RMSE, and R² are 0, 0 and 1, respectively. Table 4 shows MAE, RMSE, and R² for the ANN-MPA P_s-ES model observed as 0.0563, 0.0710, and 0.8541, respectively. Hence, the accuracy of the ANN-MPA is 94.37% (100–5.63), 92.51% (100–7.10) and 85.41% from the viewpoint of MAE, RMSE, and R², respectively. Correspondingly, the accuracy of the ANN-MPA model in the case of UCS-ES approaches 94.89%, 93.34%, and 86.24% in terms of MAE, RMSE, and R², respectively.

Validation of the developed models

The attainment of higher accuracy of the VdD indicates a more robust and accurate model. Therefore, in this study, the developed models were validated with the help of two levels of validation. Firstly, 30% of the unused data separated from the main dataset was divided equally among TsD and VdD. In the second level of validation, a simulated dataset was used for parametric analysis, which is presented to see the effect of variable change and its impact on the P_sUCS-ES.

First level validation

A portion of the primary dataset was used in K-fold cross-validation having K = 5 to validate the ANN-based formulated models. The statistical evaluation of all the proposed models is furnished in Tables 4 and 5 for P_sUCS-ES models, respectively. The results reveal that ANN-MPA manifest a more robust model, yielding R² = 0.8826, RMSE = 0.0701, and MAE = 0.0568 and R² = 0.8766, RMSE = 0.066, as well as MAE = 0.0548 for TsD and VdD respectively, for predicting P_s-ES. Similarly, ANN-MPA manifested R² = 0.8608, RMSE = 0.0695, as well as MAE = 0.0567 for test data and R² = 0.8990, RMSE = 0.0511, and MAE = 0.0361 for VdD, in the case of UCS-ES model. It is pertinent to mention that the magnitudes of correlation are greater, whereas the magnitude of errors for the test and VdD lies below the TrD, which represents no overfitting during the training stage of the ANN-MPA. Figure 14b,c also depicts that most of the prediction of the ANN-MPA lies in between ± 20% of the deviation of the best-fit line. In the case of P_s-ES models, the performance of other models is depicted in Figs. 14, 15, 16, and 17, which reflects that the accuracy of the formulated models is equivalent to ANN-MPA. ANN-GWO furnished second robust results in forecasting the P_s-ES, whereas, the UCS-ES prediction exhibited overfitting in the training process (Figs. 18, 19, 20, and 21, respectively).

Uncertainty and statistical testing

The credibility evaluation of a typical AI model is necessary in the case of a prediction model to estimate the target variable for the new dataset. The current study employed UA to evaluate the quantifiable assessment of errors of the developed models to predict P_sUCS-ES. This analysis was performed on 1^st level of validation, i.e., on the TrD, TsD, and VdD, including 118, 25 and 25 for P_s-ES and 101, 22, and 22 experimental results for UCS-ES, as listed in Tables 6 and 7, respectively. To perform the UA, an absolute error was initially calculated between the predicted and experimental values for all three datasets. Subsequently, the mean of error (MOE) and standard deviation (SD) were computed for the said data. Furthermore, the margin of error (ME) was determined at a 95% confidence interval to yield the width of confidence bound (WCB). Upper bound (UB), lower bound (LB), as well as standard error (SE) were also determined to compute WCB. The results of WCB for the formulated models have been provided in Tables 8 and 9 for P_s-ES and UCS-ES, respectively.

Table 6 Results of Uncertainty analysis (UA) for P_s-ES during ANN-based modelling.

Full size table

Table 7 Results of Uncertainty analysis (UA) for UCS-ES during ANN-based modelling.

Full size table

Table 8 Results of one-tailed t-test for P_s-ES during ANN-based modelling.

Full size table

Table 9 Results of one-tailed t-test for UCS-ES during ANN-based modelling.

Full size table

Second level validation

The value of WCB for a good model shall be as small as possible; hence, the model with minimum WCB reflects a robust model.

For both cases, P_s-ES and UCS-ES, the ANN-MPA manifested minimum WCB, therefore, it ranked first in robustness for TrD, TsD, and VdD data, which is also depicted in Fig. 23.

Second-level validation

Owing to the overfitting problem while formulation of AI models, the models generated in this study were validated on different sets. For this purpose, simulated datasets were created as shown in Table 10. Moreover, as depicted in Fig. 26 the effect of changing parameters has been studied by keeping remaining variables constant. The details of the parametric and sensitivity analysis are given below.

Table 10 Details of simulated datasets for P_sUCS-ES for validation purposes.

Full size table

Parametric analysis

Table 10 illustrates the details of the simulated datasets produced alongside the fluctuating range of the considered input parameters¹⁰⁷. It is pertinent to mention that, the summation of all the input parameters had been 100% the same everywhere to simulate the real-world scenario. Moreover, the LL, PI, G_s, MDD, OMC, SP, w_n, sand, and silt were designated at their minimum, maximum, and mean entities.

It is depicted in Fig. 26 that, as anticipated, all the trends are shown by smooth curves. Figure 26a–e,g depict the expected increase in P_s-ES with rising CF, LL, PI, G_s and MDD, respectively, while, Fig. 26f, i–j displays the reverse decreasing trend in P_s-ES with increasing OMC, w_n, sand and silt, respectively. These results are consistent with the R-value reflected by the given matrix in Fig. 11, as well as they are in good agreement with the findings of Jalal et al.³¹. However, the decrease in the P_s-ES at higher water content is associated with larger values of P_s-ES with LL (Fig. 26b), which is reflected by the Δw = 0.6(PI/LL)¹⁰⁸. On the contrary, the forecasted UCS-ES elevated with increasing PI, MDD, OMC, and SP, as shown in Fig. 26m–q, while, it lowered down in the case of G_s and silt content, Fig. 26n,t, respectively, which are are in good agreement with the GEP parametric study results of Jalal et al.³¹. But, for OMC and SP, the P_s-ES was observed to follow an increasing trend after some time since the OMC and MDD are significantly influenced by the particle size being fine. It is stated that the greater impact is by the content of CF¹⁰⁹. Figure 26k–l shows that the UCS-ES lowered down with the increase in CF and LL of the original ES¹¹⁰. Figure 26r showed that with the increase in soil water content, the UCS-ES was observed to decrease¹¹¹. Also, the trends between swell-strength characteristics and the nine aforementioned input parameters attained in the level-2 validation stage are in good agreement with the behaviour of the actual dataset (as shown in Figs. 7 and 8, respectively), which verifies the robustness of the proposed model.

Sensitivity analysis

Sensitivity analysis (SA) evaluates the impact on the output of a formulated model with changing input parameters. It gives an idea about the most significant input parameters, and as a result, by eliminating the relatively trivial parameters, the number of inputs could be lessened, thereby lowering the perplexity of the model alongside the time required for training a specific model. To conduct the SA for the current study on P_sUCS-ES, the generally employed cosine amplitude technique (referred to as, CAM) was incorporated wherein the data pairs assist in the construction of data array, = [x₁,x₂,x₃,…, x_i,…,x_n], such that the variable x_i in the array, X, refers to the length vector of m in the form of:

$$ x_{i} = [x_{i1} ,x_{i2} ,x_{i3} ,...,x_{im} ] $$

(60)

The association among A_ij (strength of the relation) versus the datasets of x_i as well as x_j is determined with the help of Eq. (61):

$$ A_{ij} = \frac{{\sum\nolimits_{k = 1}^{m} {x_{ik} x_{jk} } }}{{\sum\nolimits_{k = 1}^{m} {x^{2} ik\sum\nolimits_{k = 1}^{m} {x_{ik}^{2} } } }} $$

(61)

The A_ij values for P_sUCS-ES versus the input parameters are depicted in Fig. 27. In the TrD of P_s-ES, the CF and MDD are the governing parameters whose effect exceeds 0.90 whereas the w_n, sand and silt have the lowest impact on the P_s-ES. The results of ANN-SMA and ANN-PSO are higher than those of ANN-GWO for all studied input parameters except PI and SP. On the contrary, in the TrD, TsD, and VdD of UCS-ES, the MDD and sand appear to largely govern the strength of the ES. In the TrD of UCS-ES, the effect of PI, OMC, and w_n is recorded to be the least, respectively. Furthermore, the efficiency of results with various algorithms in the TrD of UCS-ES case follows the order: of ANN-SMA > ANN-GWO > ANN-PSO. Similarly, in the case of TsD and VdD of P_s-ES, CF and PI are the most significant input parameters whereas w_n, sand and silt are the least significant parameters. Interestingly, the efficiency of results is higher for ANN-SMA and ANN-PSO in the case of TsD (P_sUCS-ES). However, ANN-PSO and ANN-SMA yield the lowest results for P_sUCS-ES in the case of VdD. Hence, ANN-SMA exhibits the most reliable results for P_s-ES while ANN-SMA and ANN-PSO are equally efficient algorithms in the case of UCS-ES.

Summary and conclusions

In various civil engineering projects, the swell-strength properties of expansive soils (ES) are crucial for evaluating the design of structures resting on the ES. Usually, laboratory tests are conducted for computing the swell pressure as well as the unconfined compression strength of the ES (referred to as ‘P_sUCS-ES’) which are not only time-consuming but also expensive. Thus, this study aims to find a robust and efficacious alternative to conduct the actual laboratory tests with efficient AI-based models. This would help to estimate the P_sUCS-ES based on available experimental databases from the past literature. This study concentrates on the formulation of metaheuristics by deploying PSO, GWO, SMA, and MPA for the evaluation of the P_sUCS-ES obtained from ANN modelling. A database of 168 P_s and 145 UCS observations was considered by consulting 61 and 99 internationally published papers, respectively, after a detailed literature search. 70% of the dataset was selected randomly as the TrD, whereas the rest of the unused dataset was deployed to test and validate the developed models. Based on the aforementioned modelling, the following conclusions are drawn:1. All the models were trained using the best hyperparameters of the ANN model resulting from PSO, GWO, SMA, and MPA. In the case of P_s-ES modelling, the fixation of several neurons in the hidden layer is purely a trial-and-error method. Furthermore, the ANN models of P_sUCS-ES using PSO were uniformly optimized with inertial weights equalling 0.3, social coefficient of unity, and acceleration coefficient of 2. The ANN-GWO metaheuristic (189.87 s) exhibited superior performance from the standpoint of computational cost, whereas PSO (192.71 s) surpassed in the case of the UCS-ES models.

UCS2. Validation of the ANN-based P_sUCS-ES models using wide statistical indices (such as MAE, NS, ρ, R², RMSE, RSR, VAF, WI, and WMAPE) was performed. It was recorded that all the developed models for P_s-ES exhibited R significantly exceeding 0.8 for the TrD, TsD, and VdD. However, ANN-MPA excelled in yielding high R values and exhibited the lowest absolute error for all these three distinct.

3. The results of UCS-ES models performance revealed that R only exceeded 0.9 in the case of TrD, but, not for TsD and VdD. Also, the ANN-MPA model yielded higher R values (0.89, 0.93, and 0.94), and comparatively low MAE values (5.11%, 5.67, and 3.61%) in the case of PSO, GWO, and SMA, respectively. UCSUCS.

4. All the ANN-base models were also tested using the a-20 index. For all the formulated models, maximum points were recorded to lie within ± 20% error. In addition, the ANN-SMA interpreted higher accuracy in terms of the a-20 index, and its superiority was also supported by the results depicted in Taylor’s diagram and the WCB values.

5. The uncertainty analysis UA for P_s-ES models showed that the ANN-MPA is observed to be the most accurate model followed by ANN-GWO, ANN-SMA, and ANN-PSO for the TrD. This type of trend was also recorded for the TsD and VdD except that ANN-PSO outperformed ANN-SMA. On the other hand, in the case of UCS-ES models, the ANN-MPA exhibited the highest accuracy followed by ANN-GWO, ANN-PSO, and ANN-SMA, for TrD. The parameter and sensitivity analyses of ANN-based P_sUCS-ES models also revealed coherent variation of the considered input parameters with the outputs.

This study is limited to the range of the parameters mentioned in the available dataset considered in this paper. Also, the inherent time and cost attributed to the initial creation of the aforementioned experimental database are still challenging. The models formulated here are based on specific soil characteristics and environmental conditions. In addition, the presence of biases or inaccuracies in this database could affect the robustness of the developed models. The validation of these models is also limited to the existing database. Moreover, trial and error in model optimization, overfitting issues, and computational costs are other noteworthy limitations while developing models. It is suggested to evaluate other optimization techniques including random forest and support vector machines in future research.

Data availability

The data used in the manuscript may be provided upon requesting Fazal E Jalal (jalal@szu.edu.cn).

Change history

06 November 2024
A Correction to this paper has been published: https://doi.org/10.1038/s41598-024-76107-4

References

Behnood, A. Soil and clay stabilization with calcium-and non-calcium-based additives: A state-of-the-art review of challenges, approaches and techniques. Transp. Geotech. 17, 14–32 (2018).
Google Scholar
Li, T., Hou, R., Xu, C., Liu, B. & Qian, X. Experimental study on structural stability of expansive soil-anchor cable system under dry–wet cycle effect. Arab. J. Sci. Eng. 47(10), 12901–12914 (2022).
CAS Google Scholar
Waheed, M. A., Al-Amoudi, O. S. B. & Al-Osta, M. A. Molecular-level behavior induction in the constitutive modeling of swelling clayey soils: A review. Transp. Geotech. 39, 100947 (2023).
Google Scholar
Sharmila, B., Bhuvaneshwari, S. & Landlin, G. Application of lignosulphonate: A sustainable approach towards strength improvement and swell management of expansive soils. Bull. Eng. Geol. Environ. 80, 6395–6413 (2021).
Google Scholar
Christopher, I. C. & Chimobi, N. D. Emerging trends in expansive soil stabilisation: A review. J. Rock Mech. Geotech. Eng. 11, 423–440 (2019).
Google Scholar
Khennouf, A. & Baheddi, M. Heave analysis of shallow foundations founded in swelling clayey soil at N’Gaous city in Algeria. Stud. Geotech. Mech. 42(3), 210–221 (2020).
ADS Google Scholar
Du, J. et al. Characterization of controlled low-strength materials from waste expansive soils. Constr. Build. Mater. 411, 134690 (2024).
Google Scholar
He, H., Wang, S., Shen, W. & Zhang, W. The influence of pipe-jacking tunneling on deformation of existing tunnels in soft soils and the effectiveness of protection measures. Transp. Geotech. 42, 101061 (2023).
Google Scholar
Cantillo, V., Mercado, V. & Pájaro, C. Empirical correlations for the swelling pressure of expansive clays in the city of Barranquilla, Colombia. Earth Sci. Res. J. 21(1), 45–49 (2017).
Google Scholar
Pang, B. et al. Inner superhydrophobic materials based on waste fly ash: Microstructural morphology of microetching effects. Composites B 268, 111089 (2024).
CAS Google Scholar
Meshram, K., Singh, N. & Jain, P. Estimation of swelling characteristics of expansive soils with influence of clay mineralogy. Acta Agric. Scand. Sect. B 71(3), 202–207 (2021).
CAS Google Scholar
Du, J., Zhou, A., Lin, X., Bu, Y. & Kodikara, J. Prediction of swelling pressure of expansive soil using an improved molecular dynamics approach combining diffuse double layer theory. Appl. Clay Sci. 203, 105998 (2021).
CAS Google Scholar
Yin, P., Vanapalli, S. K. & Yu, S.-M. Morphological characteristics of desiccation-induced cracks in cohesive soils: A critical review. Bull. Eng. Geol. Environ. 81, 503 (2022).
Google Scholar
Ikechukwu, A. F. & Onyeka, N. Validation of semi-empirical models for the prediction of swelling stress for compacted unsaturated expansive soils. Civ. Eng. Archit. 9(5), 1640–1658 (2021).
Google Scholar
Driss, A.A.-E., Harichane, K., Ghrici, M. & Gadouri, H. Assessing the effect of moulding water content on the behaviour of lime-stabilised an expansive soil. Geomech. Geoeng. 2021, 1–13 (2021).
Google Scholar
Jalal, F. E. & Iqbal, M. Unconfined compression strength modelling of expansive soils for sustainable construction: GEP vs MEP. Environ. Earth Sci. 82(14), 364 (2023).
ADS Google Scholar
Lu, D., Ma, C., Du, X., Jin, L. & Gong, Q. Development of a new nonlinear unified strength theory for geomaterials based on the characteristic stress concept. Int. J. Geomech. 17(2), 04016058 (2017).
Google Scholar
Tiwari, N., Satyam, N. & Puppala, A. J. Strength and durability assessment of expansive soil stabilized with recycled ash and natural fibers. Transp. Geotech. 29, 100556 (2021).
Google Scholar
Nnabuihe, I., Okeke, O., Opara, A., Amadi, C. & Ehujuo, N. Effects of Coal Fly Ash and Rice-Husk Ash Admixtures on Lime Stabilization of Expansive Soils from Lokpaukwu and Awgu, Southeastern Nigeria (2021).
Parihar, N. S. & Gupta, A. K. Improvement of engineering properties of expansive soil using liming leather waste ash. Bull. Eng. Geol. Environ. 80, 2509–2522 (2021).
Google Scholar
Wu, Y. et al. Experimental study on strength characteristics of expansive soil improved by steel slag powder and cement under dry–wet cycles. Iran. J. Sci. Technol. Trans. Civ. Eng. 45(2), 941–952 (2021).
Google Scholar
Yilmaz, I. & Kaynar, O. Multiple regression, ANN (RBF, MLP) and ANFIS models for prediction of swell potential of clayey soils. Expert Syst. Appl. 38(5), 5958–5966 (2011).
Google Scholar
Alavi, A. H., Gandomi, A. H., Nejad, H. C., Mollahasani, A. & Rashed, A. Design equations for prediction of pressuremeter soil deformation moduli utilizing expression programming systems. Neural Comput. Appl. 23(6), 1771–1786 (2013).
Google Scholar
Abdollahi, M. & Vahedifard, F. Prediction of Lateral Swelling Pressure in Expansive Soils, Geo-Congress 2020: Geo-Systems, Sustainability, Geoenvironmental Engineering, and Unsaturated Soil Mechanics 367–376 (American Society of Civil Engineers, 2020).
Google Scholar
Dafalla, M., Mutaz, E. & Al-Shamrani, M. Compressive strength variations of lime-treated expansive soils. International Foundations Congress and Equipment Expo 1402–1409 (2015).
Mittal, M. et al. Prediction of coefficient of consolidation in soil using machine learning techniques. Microprocess. Microsyst. 82, 103830 (2021).
Google Scholar
Zhao, N., Li, D.-Q., Gu, S.-X. & Du, W. Analytical fragility relation for buried cast iron pipelines with lead-caulked joints based on machine learning algorithms. Earthq. Spectra 40(1), 566–583 (2024).
Google Scholar
Sun, W., Zhang, W. & Han, L. Determination of groundwater buoyancy reduction coefficient in clay: Model tests, numerical simulations and machine learning methods. Undergr. Space 13, 228–240 (2023).
Google Scholar
Wang, Y. et al. A comparative study of regional landslide susceptibility mapping with multiple machine learning models. Geol. J. https://doi.org/10.1002/gj.4902 (2023).
Article Google Scholar
Biswas, R. et al. A novel integrated approach of RUNge Kutta optimizer and ANN for estimating compressive strength of self-compacting concrete. Case Stud. Constr. Mater. 18, e02163 (2023).
Google Scholar
Jalal, F. E., Xu, Y., Iqbal, M., Javed, M. F. & Jamhiri, B. Predictive modeling of swell-strength of expansive soils using artificial intelligence approaches: ANN, ANFIS and GEP. J. Environ. Manag. 289, 112420 (2021).
Google Scholar
Kumar, M., Samui, P., Kumar, D. R. & Asteris, P. G. State-of-the-art XGBoost, RF and DNN based soft-computing models for PGPN piles. Geomech. Geoeng. 2024, 1–16 (2024).
Google Scholar
Shi, C. & Wang, Y. Development of subsurface geological cross-section from limited site-specific boreholes and prior geological knowledge using iterative convolution XGBoost. J. Geotechn. Geoenviron. Eng. 147(9), 04021082 (2021).
Google Scholar
Arthur, C. K., Temeng, V. A. & Ziggah, Y. Y. Multivariate Adaptive Regression Splines (MARS) approach to blast-induced ground vibration prediction. Int. J. Min. Reclam. Environ. 34(3), 198–222 (2020).
Google Scholar
Sujatha, M. & Jaidhar, C. Machine learning-based approaches to enhance the soil fertility: A review. Expert Syst. Appl. 240, 122557 (2023).
Google Scholar
Shi, M. et al. Ensemble regression based on polynomial regression-based decision tree and its application in the in-situ data of tunnel boring machine. Mech. Syst. Signal Process. 188, 110022 (2023).
Google Scholar
Giustolisi, O., Doglioni, A., Savic, D. A. & Webb, B. A multi-model approach to analysis of environmental phenomena. Environ. Model. Softw. 22(5), 674–682 (2007).
Google Scholar
Mohammadzadeh, S., Kazemi, S.-F., Mosavi, A., Nasseralshariati, E. & Tah, J. H. Prediction of compression index of fine-grained soils using a gene expression programming model. Infrastructures 4(2), 26 (2019).
Google Scholar
Sun, W., Hu, P., Lei, F., Zhu, N. & Jiang, Z. Case study of performance evaluation of ground source heat pump system based on ANN and ANFIS models. Appl. Therm. Eng. 87, 586–594 (2015).
Google Scholar
Shariati, M. et al. Application of a hybrid artificial neural network-particle swarm optimization (ANN-PSO) model in behavior prediction of channel shear connectors embedded in normal and high-strength concrete. Appl. Sci. 9(24), 5534 (2019).
CAS Google Scholar
Das, S. K. Artificial neural networks in geotechnical engineering: Modeling and application issues. Metaheuristics Water Geotech. Transp. Eng. 45, 231–267 (2013).
Google Scholar
Tang, H., Sun, W., Lin, A., Xue, M. & Zhang, X. A GWO-based multi-robot cooperation method for target searching in unknown environments. Expert Syst. Appl. 186, 115795 (2021).
Google Scholar
Faramarzi, A., Heidarinejad, M., Mirjalili, S. & Gandomi, A. H. Marine Predators Algorithm: A nature-inspired metaheuristic. Expert Syst. Appl. 152, 113377 (2020).
Google Scholar
Kaveh, A., Talatahari, S. & Khodadadi, N. Stochastic paint optimizer: Theory and application in civil engineering. Eng. Comput. 2020, 1–32 (2020).
Google Scholar
Venkatesh, K. & Bind, Y. K. ANN and neuro-fuzzy modeling for shear strength characterization of soils. Proc. Natl. Acad. Sci. India Sect. A 92, 243–249 (2020).
Google Scholar
Fabani, M. P. et al. Producing non-traditional flour from watermelon rind pomace: Artificial neural network (ANN) modeling of the drying process. J. Environ. Manag. 281, 111915 (2021).
CAS Google Scholar
Gandomi, A. H. & Roke, D. A. Assessment of artificial neural network and genetic programming as predictive tools. Adv. Eng. Softw. 88, 63–72 (2015).
Google Scholar
Yaman, M. A., Abd Elaty, M. & Taman, M. Predicting the ingredients of self compacting concrete using artificial neural network. Alexandr. Eng. J. 56(4), 523–532 (2017).
Google Scholar
Garg, A., Wani, I., Zhu, H. & Kushvaha, V. Exploring efficiency of biochar in enhancing water retention in soils with varying grain size distributions using ANN technique. Acta Geotech. 17(4), 1315–1326 (2022).
Google Scholar
Das, S., Samui, P., Khan, S. & Sivakugan, N. Machine learning techniques applied to prediction of residual strength of clay. Open Geosci. 3(4), 449–461 (2011).
ADS Google Scholar
Wang, W., Lv, B., Zhang, C., Li, N. & Pu, S. Mechanical and micro-structure characteristics of cement-treated expansive soil admixed with nano-MgO. Bull. Eng. Geol. Environ. 82, 1–11 (2023).
Google Scholar
Sharif, M., Amin, J., Raza, M., Yasmin, M. & Satapathy, S. C. An integrated design of particle swarm optimization (PSO) with fusion of features for detection of brain tumor. Pattern Recogn. Lett. 129, 150–157 (2020).
ADS Google Scholar
Tikhamarine, Y., Souag-Gamane, D., Ahmed, A. N., Kisi, O. & El-Shafie, A. Improving artificial intelligence models accuracy for monthly streamflow forecasting using grey Wolf optimization (GWO) algorithm. J. Hydrol. 582, 124435 (2020).
Google Scholar
Li, S., Chen, H., Wang, M., Heidari, A. A. & Mirjalili, S. Slime mould algorithm: A new method for stochastic optimization. Future Gen. Comput. Syst. 111, 300–323 (2020).
Google Scholar
Ikizler, S. B., Vekli, M., Dogan, E., Aytekin, M. & Kocabas, F. Prediction of swelling pressures of expansive soils using soft computing methods. Neural Comput. Appl. 24(2), 473–485 (2014).
Google Scholar
Kumar, M., Kumar, V., Rajagopal, B. G., Samui, P. & Burman, A. State of art soft computing based simulation models for bearing capacity of pile foundation: A comparative study of hybrid ANNs and conventional models. Model. Earth Syst. Environ. 9(2), 2533–2551 (2023).
Google Scholar
Li, K., Nowamooz, H., Chazallon, C. & Migualt, B. Mechanical behaviour of densely compacted expansive soils during wetting and drying cycles: An analytical model based on shakedown concept. Eur. J. Environ. Civ. Eng. 25(6), 1065–1079 (2021).
Google Scholar
Tiwari, N. & Satyam, N. Coupling effect of pond ash and polypropylene fiber on strength and durability of expansive soil subgrades: An integrated experimental and machine learning approach. J. Rock Mech. Geotech. Eng. 13, 1101–1112 (2021).
Google Scholar
Bardhan, A. Probabilistic assessment of heavy-haul railway track using multi-gene genetic programming. Appl. Math. Model. 125, 687–720 (2024).
MathSciNet Google Scholar
Bardhan, A. et al. A hybrid approach of ANN and improved PSO for estimating soaked CBR of subgrade soils of heavy-haul railway corridor. Int. J. Pavement Eng. 24(1), 2176494 (2023).
Google Scholar
Jumaa, G. B. & Yousif, A. R. Predicting shear capacity of FRP-reinforced concrete beams without stirrups by artificial neural networks, gene expression programming, and regression analysis. Adv. Civ. Eng. 2018, 1–16 (2018).
Google Scholar
Das, S. K., Samui, P., Sabat, A. K. & Sitharam, T. Prediction of swelling pressure of soil using artificial intelligence techniques. Environ. Earth Sci. 61(2), 393–403 (2010).
ADS Google Scholar
Das, S. K., Samui, P. & Sabat, A. K. Application of artificial intelligence to maximum dry density and unconfined compressive strength of cement stabilized soil. Geotech. Geol. Eng. 29(3), 329–342 (2011).
Google Scholar
Mozumder, R. A. & Laskar, A. I. Prediction of unconfined compressive strength of geopolymer stabilized clayey soil using artificial neural network. Comput. Geotech. 69, 291–300 (2015).
Google Scholar
Liu, S. et al. Physics-informed optimization for a data-driven approach in landslide susceptibility evaluation. J. Rock Mech. Geotech. Eng. https://doi.org/10.1016/j.jrmge.2023.11.039 (2024).
Article Google Scholar
Shahmansouri, A. A. et al. Artificial neural network model to predict the compressive strength of eco-friendly geopolymer concrete incorporating silica fume and natural zeolite. J. Clean. Prod. 279, 123697 (2021).
CAS Google Scholar
Kennedy, J. & Eberhart, R. Particle swarm optimization. Proceedings of ICNN'95-International Conference on Neural Networks 1942–1948 (IEEE, 1995).
Eberhart, R. & Kennedy, J. A new optimizer using particle swarm theory, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science 39–43 (Ieee, 1995).
Yatim, H., Dams, I. Z. M. & Hadi, M. S. Particle swarm optimization for identification of a flexible manipulator system. 2013 IEEE Symposium on Computers & Informatics (ISCI) 112-117 (IEEE, 2013).
Babanezhad, M. et al. Investigation on performance of particle swarm optimization (PSO) algorithm based fuzzy inference system (PSOFIS) in a combination of CFD modeling for prediction of fluid flow. Sci. Rep. 11(1), 1505 (2021).
ADS CAS PubMed PubMed Central Google Scholar
Celtek, S. A., Durdu, A. & Alı, M. E. M. Real-time traffic signal control with swarm optimization methods. Measurement 166, 108206 (2020).
Google Scholar
Kashani, A. R., Chiong, R., Mirjalili, S. & Gandomi, A. H. Particle swarm optimization variants for solving geotechnical problems: Review and comparative analysis. Arch. Comput. Methods Eng. 28(3), 1871–1927 (2021).
MathSciNet Google Scholar
Jahandideh-Tehrani, M., Bozorg-Haddad, O. & Loáiciga, H. A. Application of particle swarm optimization to water management: An introduction and overview. Environ. Monit. Assess. 192(5), 1–18 (2020).
Google Scholar
Mirjalili, S., Mirjalili, S. M. & Lewis, A. Grey wolf optimizer. Adv. Engi. Softw. 69, 46–61 (2014).
Google Scholar
Behnood, A. & Golafshani, E. M. Predicting the compressive strength of silica fume concrete using hybrid artificial neural network with multi-objective grey wolves. J. Clean. Prod. 202, 54–64 (2018).
CAS Google Scholar
Shabbar, R., Kasasbeh, A. & Ahmed, M. M. Charging station allocation for electric vehicle network using stochastic modeling and grey wolf optimization. Sustainability 13(6), 3314 (2021).
Google Scholar
Li, Q. et al. An enhanced grey wolf optimization based feature selection wrapped kernel extreme learning machine for medical diagnosis. Comput. Math. Methods Med. 2017, 1–15 (2017).
ADS CAS Google Scholar
Faris, H., Aljarah, I., Al-Betar, M. A. & Mirjalili, S. Grey wolf optimizer: A review of recent variants and applications. Neural Comput. Appl. 30(2), 413–435 (2018).
Google Scholar
Chen, W. et al. Spatial prediction of landslide susceptibility using gis-based data mining techniques of anfis with whale optimization algorithm (woa) and grey wolf optimizer (gwo). Appl. Sci. 9(18), 3755 (2019).
Google Scholar
Himanshu, N., Kumar, V., Burman, A., Maity, D. & Gordan, B. Grey wolf optimization approach for searching critical failure surface in soil slopes. Eng. Comput. 37(3), 2059–2072 (2021).
Google Scholar
Menad, N. A., Noureddine, Z., Hemmati-Sarapardeh, A. & Shamshirband, S. Modeling temperature-based oil-water relative permeability by integrating advanced intelligent models with grey wolf optimization: Application to thermal enhanced oil recovery processes. Fuel 242, 649–663 (2019).
CAS Google Scholar
Miao, Z. et al. Grey wolf optimizer with an enhanced hierarchy and its application to the wireless sensor network coverage optimization problem. Appl. Soft Comput. 96, 106602 (2020).
Google Scholar
Mostafa, M., Rezk, H., Aly, M. & Ahmed, E. M. A new strategy based on slime mould algorithm to extract the optimal model parameters of solar PV panel. Sustain. Energy Technol. Assess. 42, 100849 (2020).
Google Scholar
Hoang, N.-D. & Tran, X.-L. Remote sensing-based urban green space detection using marine predators algorithm optimized machine learning approach. Math. Probl. Eng. 2021, 1–22 (2021).
Google Scholar
Liu, B. & Pouramini, S. Multi-objective optimization for thermal comfort enhancement and greenhouse gas emission reduction in residential buildings applying retrofitting measures by an Enhanced Water Strider Optimization Algorithm: A case study. Energy Rep. 7, 1915–1929 (2021).
Google Scholar
Jain, M., Singh, V. & Rani, A. A novel nature-inspired algorithm for optimization: Squirrel search algorithm. Swarm Evol. Comput. 44, 148–175 (2019).
Google Scholar
Filmalter, J. D., Dagorn, L., Cowley, P. D. & Taquet, M. First descriptions of the behavior of silky sharks, Carcharhinus falciformis, around drifting fish aggregating devices in the Indian Ocean. Bull. Mar. Sci. 87(3), 325–337 (2011).
Google Scholar
Yousri, D., Hasanien, H. M. & Fathy, A. Parameters identification of solid oxide fuel cell for static and dynamic simulation using comprehensive learning dynamic multi-swarm marine predators algorithm. Energy Convers. Manag. 228, 113692 (2021).
Google Scholar
Parouha, R. P. & Das, K. N. A memory based differential evolution algorithm for unconstrained optimization. Appl. Soft Comput. 38, 501–517 (2016).
Google Scholar
Abd Elaziz, M. et al. Utilization of random vector functional link integrated with marine predators algorithm for tensile behavior prediction of dissimilar friction stir welded aluminum alloy joints. J. Mater. Res. Technol. 9(5), 11370–11381 (2020).
CAS Google Scholar
Said, Z. et al. Optimizing density, dynamic viscosity, thermal conductivity and specific heat of a hybrid nanofluid obtained experimentally via ANFIS-based model and modern optimization. J. Mol. Liq. 321, 114287 (2021).
CAS Google Scholar
Bardhan, A. & Asteris, P. G. Application of hybrid ANN paradigms built with nature inspired meta-heuristics for modelling soil compaction parameters. Transport. Geotech. 41, 100995 (2023).
Google Scholar
Akan, R. & Keskin, S. N. The effect of data size of ANFIS and MLR models on prediction of unconfined compression strength of clayey soils. SN Appl. Sci. 1(8), 843 (2019).
Google Scholar
Patel, A. Geotechnical Investigations and Improvement of Ground Conditions (Woodhead Publishing, 2019).
Google Scholar
Yunlong, L. & Vanapalli, S. Pile behavior modeling in unsaturated expansive soils. In Modeling in Geotechnical Engineering 393–427 (Elsevier, 2021).
Google Scholar
Puth, M.-T., Neuhäuser, M. & Ruxton, G. D. Effective use of Pearson’s product–moment correlation coefficient. Anim. Behav. 93, 183–189 (2014).
Google Scholar
Taherdangkoo, R. et al. An efficient neural network model to determine maximum swelling pressure of clayey soils. Comput. Geotech. 162, 105693 (2023).
Google Scholar
Narmandakh, D. et al. The use of feed-forward and cascade-forward neural networks to determine swelling potential of clayey soils. Comput. Geotech. 157, 105319 (2023).
Google Scholar
Teodosio, B. et al. Shrink–swell index prediction through deep learning. Neural Comput. Appl. 35(6), 4569–4586 (2023).
Google Scholar
Bardhan, A. et al. A comparative analysis of hybrid computational models constructed with swarm intelligence algorithms for estimating soil compression index. Arch. Comput. Methods Eng. 29(7), 4735–4773 (2022).
Google Scholar
Skentou, A. D. et al. Closed-form equation for estimating unconfined compressive strength of granite from three non-destructive tests using soft computing models. Rock Mech. Rock Eng. 56(1), 487–514 (2023).
ADS Google Scholar
Kumar, D. R., Wipulanusat, W., Kumar, M., Keawsawasvong, S. & Samui, P. Optimized neural network-based state-of-the-art soft computing models for the bearing capacity of strip footings subjected to inclined loading. Intell. Syst. Appl. 21, 200314 (2024).
Google Scholar
Bardhan, A. et al. A novel integrated approach of augmented grey wolf optimizer and ANN for estimating axial load carrying-capacity of concrete-filled steel tube columns. Constr. Build. Mater. 337, 127454 (2022).
Google Scholar
Bardhan, A., Samui, P., Ghosh, K., Gandomi, A. H. & Bhattacharyya, S. ELM-based adaptive neuro swarm intelligence techniques for predicting the California bearing ratio of soils in soaked conditions. Appl. Soft Comput. 110, 107595 (2021).
Google Scholar
Iqbal, M., Zhang, D., Jalal, F. E. & Faisal Javed, M. Computational AI prediction models for residual tensile strength of GFRP bars aged in the alkaline concrete environment. Ocean Eng. 232, 109134 (2021).
Google Scholar
Lu, D., Liang, J., Du, X., Ma, C. & Gao, Z. Fractional elastoplastic constitutive model for soils based on a novel 3D fractional plastic flow rule. Comput. Geotechn. 105, 277–290 (2019).
Google Scholar
Zhang, X. et al. Assessing the impact of inertial load on the buckling behavior of piles with large slenderness ratios in liquefiable deposits. Soil Dyn. Earthq. Eng. 176, 108322 (2024).
Google Scholar
Briaud, J.-L., Zhang, X. & Moon, S. Shrink test–water content method for shrink and swell predictions. J. Geotechn. Geoenviron. Eng. 129(7), 590–600 (2003).
Google Scholar
Yusoff, S. A. N. M. et al. The effects of different compaction energy on geotechnical properties of kaolin and laterite. In AIP Conference Proceedings 030009 (AIP Publishing LLC, 2017).
Google Scholar
Bui Truong, S., Nguyen Thi, N. & Nguyen Thanh, D. An Experimental study on unconfined compressive strength of soft soil-cement mixtures with or without GGBFS in the coastal area of Vietnam. Adv. Civ. Eng. 2020, 1–12 (2020).
Google Scholar
Mousavi, F., Abdi, E., Ghalandarayeshi, S. & Page-Dumroese, D. S. Modeling unconfined compressive strength of fine-grained soils: Application of pocket penetrometer for predicting soil strength. Catena 196, 104890 (2021).
Google Scholar

Download references

Funding

Natural Science Foundation of China (Grant No. 52090084).

Author information

Authors and Affiliations

State Key Laboratory of Intelligent Geotechnics and Tunnelling, College of Civil and Transportation Engineering, Shenzhen University, Shenzhen, 518060, Guangdong, China
Fazal E. Jalal
Key Laboratory of Coastal Urban Resilient Infrastructures (Shenzhen University), Ministry of Education, Shenzhen, China
Fazal E. Jalal
Department of Civil Engineering, University of Engineering and Technology Peshawar, Peshawar, Pakistan
Mudassir Iqbal
Department of Civil Engineering, University of Louisiana at Lafayette, Lafayette, LA, 70503, USA
Waseem Akhtar Khan
Department of Civil Engineering, College of Engineering, Qassim University, Buraydah, 51452, Saudi Arabia
Arshad Jamal
Department of Civil Engineering, Kampala International University, Kampala, Uganda
Kennedy Onyelowe
Department of Computer Engineering and Applications, GLA University, Mathura, 281406, India
Lekhraj

Authors

Fazal E. Jalal
View author publications
You can also search for this author in PubMed Google Scholar
Mudassir Iqbal
View author publications
You can also search for this author in PubMed Google Scholar
Waseem Akhtar Khan
View author publications
You can also search for this author in PubMed Google Scholar
Arshad Jamal
View author publications
You can also search for this author in PubMed Google Scholar
Kennedy Onyelowe
View author publications
You can also search for this author in PubMed Google Scholar
Lekhraj
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

FE Jalal contributed to the research project by conceptualizing the study, designing the methodology, and data collection process, conducting data analysis, and experimental design and implementation. He also played a significant role in interpreting the results and drafting the manuscript. Mudassir Iqbal provided critical insights during the analysis, and interpretation of the findings, and provided valuable feedback for improvement. His expertise significantly enhanced the quality and rigour of the research. He offered guidance and advice throughout the study. Waseem Akhtar Khan made substantial contributions to the literature review, gathering relevant research articles, and synthesizing the information. Arshad Jamal played a crucial role in the literature review section by gathering relevant research articles and contributing to an overview of the optimization algorithms. Kennedy Onyelowe played a significant role in interpreting the results and drafting the manuscript. Lekhraj contributed in reviewing, editing and writing the manuscript.

Corresponding authors

Correspondence to Fazal E. Jalal, Mudassir Iqbal or Kennedy Onyelowe.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: In the original version of this Article author Arshad Jamal was incorrectly affiliated with ‘Department of Civil and Environmental Engineering, King Fahd University of Petroleum and Minerals, KFUPM, Box 5055, 31261, Dhahran, Saudi Arabia’. The correct affiliation is ‘Department of Civil Engineering, College of Engineering, Qassim University, Buraydah 51452, Saudi Arabia’ and author Mudassir Iqbal was omitted as a corresponding author. Correspondence and requests for materials should also be addressed to mudassiriqbal@uetpeshawar.edu.pk.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jalal, F.E., Iqbal, M., Khan, W.A. et al. ANN-based swarm intelligence for predicting expansive soil swell pressure and compression strength. Sci Rep 14, 14597 (2024). https://doi.org/10.1038/s41598-024-65547-7

Download citation

Received: 24 April 2024
Accepted: 20 June 2024
Published: 25 June 2024
DOI: https://doi.org/10.1038/s41598-024-65547-7
Springer Nature Limited

Keywords

This article is cited by

Application of tuned random forests model on cement paste including fly ash and MgO expansive additive
- Dongxia Liu
Multiscale and Multidisciplinary Modeling, Experiments and Design (2025)
Soft computing models for prediction of bentonite plastic concrete strength
- Waleed Bin Inqiad
- Muhammad Faisal Javed
- Fahid Aslam
Scientific Reports (2024)
Experimental analysis and gene expression programming optimization of sustainable concrete containing mineral fillers
- Ayesha Rauf
- Usama Asif
- Hisham Alabduljabbar
Scientific Reports (2024)
Development of machine learning models for forecasting the strength of resilient modulus of subgrade soil: genetic and artificial neural network approaches
- Laiba Khawaja
- Usama Asif
- Hisham Alabduljabbar
Scientific Reports (2024)
Investigation of drying shrinkage response in fibrillated microfiber reinforced preplaced concrete: experimental analysis and prediction models
- Muhammad Saqib
- Muhammad Faisal Javed
- M. Ijaz Khan
Innovative Infrastructure Solutions (2024)

ANN-based swarm intelligence for predicting expansive soil swell pressure and compression strength

Abstract

Similar content being viewed by others

Introduction

Methodology

ANN

PSO

GWO

SMA

1st Phase (Searching and approaching food)

2nd Phase (Wrapping food as per quality)

3rd Phase (waving and oscillation)

Total net level of complexity of SMA:

MPA

1st Phase (exploration with high velocity)

2nd Phase (evolution from exploration to exploitation with unit velocity)

3rd Phase (exploitation with low velocity)

Eddy's formation with possible impact

Marine memory

Data processing and analysis

Data preprocessing

Descriptive statistics and statistical visualization

AI-based analysis

Results and discussion

Configuration of ANN hybrid models

Performance evaluation of the formulated models

Validation of the developed models

First level validation

Uncertainty and statistical testing

Second level validation

Second-level validation

Parametric analysis

Sensitivity analysis

Summary and conclusions

Data availability

Change history

06 November 2024

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Search

Navigation