<p>Flowchart of activities to obtain the most important characteristics in the classification.</p> Full article ">Figure 2
<p>Number of samples existing in the database before applying the undersampling balancing technique.</p> Full article ">Figure 3
<p>Training curves for different models: (<b>a</b>) Training curve for the DT model; (<b>b</b>) Training curve for the RF model; and (<b>c</b>) Training curve for the XGB model.</p> Full article ">Figure 4
<p>SHAP summary plot for the decision tree multiclass classification model. The plot displays the contributions of features (genes) to the prediction of cancer types: breast cancer (BRCA), lung adenocarcinoma (LUAD), thyroid cancer (THCA), ovarian cancer (OV), and colon adenocarcinoma (COAD). Features are ranked by maximum average SHAP values, highlighting the most important genes for distinguishing between the classes.</p> Full article ">Figure 5
<p>SHAP summary plot for the random forest multiclass classification model. The plot displays the contributions of features (genes) to the prediction of cancer types: breast cancer (BRCA), lung adenocarcinoma (LUAD), thyroid cancer (THCA), ovarian cancer (OV), and colon adenocarcinoma (COAD). Features are ranked by maximum average SHAP values, highlighting the most important genes for distinguishing between the classes.</p> Full article ">Figure 6
<p>SHAP summary plot for the XGBoost multiclass classification model. The plot displays the contributions of features (genes) to the prediction of cancer types: breast cancer (BRCA), lung adenocarcinoma (LUAD), thyroid cancer (THCA), ovarian cancer (OV), and colon adenocarcinoma (COAD). Features are ranked by maximum average SHAP values, highlighting the most important genes for distinguishing between the classes.</p> Full article ">