Optimizing Automated Hematoma Expansion Classification from Baseline and Follow-Up Head Computed Tomography
<p>An example of an HE classification workflow with a high-sensitivity (95%) threshold classification. Combined segmentation and classification pipeline identifies the majority of subjects with HE (141 out of 148, 95.2%), and expert review of automated segmentations is limited to 35.5% of the subjects, correcting false-positive cases. This process results in 99.21% accurate HE classification in the whole dataset, with a final 0.7% false-negative rate. Notably, expert reviewers spend only a third of the time required for examining segmentations in the entire dataset, by focusing on test positive subjects, significantly improving efficiency. The approach is practical and efficient for generating ground truth annotations of HE in large ICH datasets.</p> "> Figure 2
<p>The pipeline for HE classification. Head CT scans were preprocessed for skull stripping, adjusting the intensities to the brain window/level, and resampling and registering to a common size space. The segmentation masks, along with the baseline and follow-up CTs, were used as input for a classification CNN to predict HE. The classifier outputs probability scores for each subject. Then, from the threshold array, sensitivity array, specificity array, and f1 score array, one can choose the optimal threshold; for example, a threshold based on the maximum F1 score [<a href="#B33-applsci-15-00111" class="html-bibr">33</a>]. After that, we can create the confusion matrix elements at a given threshold. Using ROC analysis of the final prediction probabilities [<a href="#B34-applsci-15-00111" class="html-bibr">34</a>], we established the 100%, 95%, and 90% sensitivity and specificity thresholds in the internal test cohort and evaluated them in the external validation cohort.</p> "> Figure 3
<p>Classification of ≥3 mL HE using CNN model and thresholds for 100%, 95%, and 90% sensitivity and specificity, as well as the highest accuracy threshold, in the internal test cohort (ATACH-2). These thresholds were then applied to the external validation cohort (Charité). The solid and dashed lines in the ROC curve refer to same-color sensitivity/specificity thresholds (as color coded in table cell) in the internal and external validation cohorts, respectively.</p> "> Figure 4
<p>Classification of ≥6 mL HE using CNN model and thresholds for 100%, 95%, and 90% sensitivity and specificity, as well as the highest accuracy threshold, in the internal test cohort (ATACH-2). These thresholds were then applied to the external validation cohort (Charité). The solid and dashed lines in the ROC curve refer to same-color sensitivity/specificity thresholds (as color coded in table cell) in the internal and external validation cohorts, respectively.</p> "> Figure 5
<p>Classification of ≥9 mL HE using CNN model and thresholds for 100%, 95%, and 90% sensitivity and specificity, as well as the highest accuracy threshold, in the internal test cohort (ATACH-2). These thresholds were then applied to the external validation cohort (Charité). The solid and dashed lines in the ROC curve refer to same-color sensitivity/specificity thresholds (as color coded in table cell) in the internal and external validation cohorts, respectively.</p> "> Figure 6
<p>Classification of ≥12.5 mL HE using CNN model and thresholds for 100%, 95%, and 90% sensitivity and specificity, as well as the highest accuracy threshold, in the internal test cohort (ATACH-2). These thresholds were then applied to the external validation cohort (Charité). The solid and dashed lines in the ROC curve refer to same-color sensitivity/specificity thresholds (as color coded in table cell) in the internal and external validation cohorts, respectively.</p> ">
Abstract
:1. Introduction
2. Materials and Methods
2.1. Patients’ Cohorts
2.2. Manual Segmentation of Hematomas on Head CTs as the Ground Truth
2.3. Automated Hematoma Segmentation Model
- Training framework and dataset: using the Yale cohort dataset, we applied a 5-fold cross-validation framework [24,25,26] to train and optimize the CNN model for automated segmentation of hematomas on non-contrast head CTs. Then, optimized hyperparameters were used to train the model on the whole Yale dataset;
- Preprocessing of head CT: in the preprocessing step, we removed the skull, adjusted CT scan intensities to the brain window/level, and registered all scans to a common space. For the skull stripping process, we first applied a 0-to-200 HU intensity threshold to remove bony structures followed by a mathematical operation based on head morphology [27] to eliminate any remaining soft tissues using dilation-erosion operations—including binary_erosion(), remove_small_objects(), and binary_dilation()—and removing the boundary with the findContours() function, as described previously [28]. Then, we applied the brain window/level (center = 40 HU, and width = 80 HU), which provides optimal contrast for visualization of brain parenchymal pathologies. Finally, we registered all brain scans to a template, homogenizing the voxel spacing and size (128 × 128 × 128) [28];
- Segmentation model training: we used nnU-Net (no-new-Net) [29] as the backbone for our segmentation model. The nnU-Net (no-new-Net) model is a self-adaptive deep-learning framework designed for medical image segmentation. The model was developed to self-adapt new neural network architecture for each medical image segmentation task, a process that is often time-consuming and requires a grid search strategy. nnU-Net automatically configures itself based on the properties of the provided dataset, adjusting its architecture, pre-processing, and post-processing accordingly. The model has gained considerable popularity in the medical imaging community due to its high performance, ease of use, and proven applicability in a variety of imaging modalities and segmentation tasks. The model was trained in a 5-fold cross-validation process with data augmentation, weight_decay = 3 × 10−5, initial_lr = 1 × 10−2, num_epochs = 100, PolyLRScheduler, optimizer = SGD, and image input size = (128,128,128);
- Metrics of segmentation performance: we used two metrics to evaluate the hematoma segmentation model performance: the Dice similarity coefficient and the Hausdorff distance. The Dice coefficient [30] measures the volumetric overlap between segmentation results and the ground truth. The Hausdorff distance (HD) [31] measures the surface distance as the maximum distance of a segmentation mask to the nearest point in the ground truth. Details are described in the Supplementary Material.
2.4. Optimizing HE Classification Model
- Definition of HE: to facilitate patient risk stratification, HE is typically defined as a binary outcome based on an absolute (e.g., ≥6 mL) or relative (e.g., ≥33%) volume increase. However, since relative volume increases are highly sensitive to potential errors in baseline hematoma quantification and can easily vary for small baseline ICHs, we adopted a series of absolute volume increases for binary definition of HE as proposed previously—i.e., ≥3, ≥6, ≥9, and ≥12.5 mL [13]. Using the manual segmentation of hematomas to generate the baseline (Vbaseline) and follow-up (Vfollowup) volumes, the absolute HE was calculated as follows:
- Using the , we generated binary ground truth classifications for ≥3, ≥6, ≥9, and ≥12.5 mL expansions;
- Direct HE classification based on automated hematoma segmentation results: using volumes from automated hematoma segmentation by the nn-UNET model, we generated a binary HE classification based on absolute volume increases from baseline to follow-up scans;
- Training HE classification models to improve subjects’ annotation (Figure 2): as mentioned in the Introduction section, direct HE classification based on hematoma volumes from automated segmentation is susceptible to cumulative errors from the segmentation model in both baseline and follow-up scans. To address this, we applied CNN models [32] specialized for medical image classification to predict HE annotation with inputs from both admission and follow-up scans as well as segmentation masks. A benefit of such a model is creating a probability range to identify subjects that require further review and potential correction of segmentations based on desired sensitivity or specificity. We applied stratified 5-fold cross-validation, maintaining the ratio of class labels (positive/negative) across each fold. This ensures each fold represents the overall distribution of classes, which is very important when the data are imbalanced. A total of 684 cases from Yale were randomly split (4-to−1) into 548 training cases (136 with HE ≥ 3 mL, 105 with HE ≥ 6 mL, 82 with HE ≥ 9 mL, and 68 with HE ≥ 12.5 mL) and 136 validation cases (35 with HE ≥ 3 mL, 24 with HE ≥ 6 mL, 20 with HE ≥ 9 mL, and 16 with HE ≥ 12.5 mL).
- We modified a DenseNet-121 model [32] to accept four inputs, baseline and follow-up CT scans, along with their respective hematoma segmentation masks, for HE prediction (Figure 2, Figures S1 and S2). The model was trained with loss_function = BCEWithLogitsLoss(), learning rate = 0.0001, weight_decay = 1 × 10−5, optimizer = Adam, scheduler = ReduceLROnPlateau, the number of epochs = 100, early stopping, dropout probability = 0.1, and image size = (128,128,128). After cross-validation, the optimized hyperparameters were used to train the final model on the whole dataset.
- Combining CNN with Support Vector Machines (SVMs): the CNN performs two primary tasks: feature extraction using convolution, pooling, and dropout layers, and classification through fully connected and linear layers. Many studies have demonstrated that combining CNNs with SVMs [35] yields better performance than using linear layers in a CNN alone [36,37]. We used the features from the CNN’s final layer as input for an SVM for HE classification. We used C-Support Vector Classification from Sklearn with a Radial Basic Function Kernel and the default configuration [38]. The inputs for the SVM were 4 × 1024 features from the last layer of the DenseNet extracted from baseline and follow-up head CTs and corresponding automated hematoma segmentation masks.
2.5. Identifying Subjects for Expert Review of ICH Segmentations, Minimizing HE Misclassification
2.6. Statistical Analysis
- The confusion matrix [25,39] provides a summary of the prediction results on classification and the true positives (TP), false positives (FP), true negatives (TN), and false negatives (FN) for each class. The metrics obtained from the confusion matrix include accuracy, sensitivity, specificity, and F1 Score [40,41,42].
- The ROC curve [34] plots the true positive rate (sensitivity/recall) versus the false positive rate (1—specificity) for different threshold values. The AUC (Area Under the Curve) [43] is the area under the ROC curve and provides a measure of the aggregate performance across all classification thresholds. An AUC of 0.5 indicates random prediction, while a value close to 1 indicates strong discrimination;
- The confidence interval (CI) for a performance metric [44] provides a range within which the metric falls with a specified probability (for example, 95% CI);
- Cross-validation (e.g., stratified k-fold cross-validation) splits the data into multiple training and validation sets to obtain a reliable estimate of performance and reduce the risk of overfitting;
- The Chi-Square test [45] is used to examine the relationship between categorical variables or to test the degree of fit between observed data and an expected distribution;
- The independent samples t-test [46] is a statistical hypothesis test used to determine whether there is a significant difference between the means of two groups, assuming the data follow a normal distribution,
3. Results
3.1. Patients’ Characteristics
3.2. Automated Hematoma Segmentation
3.3. HE Classification Directly Based on Automated Segmentation Volumes
3.4. HE Classification Based on CNN Classification Model
3.5. HE Classification Based on CNN + SVM Classification Model
3.6. Model Execution
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Broderick, J.P.; Grotta, J.C.; Naidech, A.M.; Steiner, T.; Sprigg, N.; Toyoda, K.; Dowlatshahi, D.; Demchuk, A.M.; Selim, M.; Mocco, J.; et al. The Story of Intracerebral Hemorrhage: From Recalcitrant to Treatable Disease. Stroke 2021, 52, 1905–1914. [Google Scholar] [CrossRef] [PubMed]
- Li, Z.; You, M.; Long, C.; Bi, R.; Xu, H.; He, Q.; Hu, B. Hematoma Expansion in Intracerebral Hemorrhage: An Update on Prediction and Treatment. Front. Neurol. 2020, 11, 702. [Google Scholar] [CrossRef] [PubMed]
- Abou Karam, G.; Chen, M.C.; Zeevi, D.; Harms, B.C.; Torres-Lopez, V.M.; Rivier, C.A.; Malhotra, A.; de Havenon, A.; Falcone, G.J.; Sheth, K.N.; et al. Time-Dependent Changes in Hematoma Expansion Rate after Supratentorial Intracerebral Hemorrhage and Its Relationship with Neurological Deterioration and Functional Outcome. Diagnostics 2024, 14, 308. [Google Scholar] [CrossRef] [PubMed]
- Bakar, B.; Akkaya, S.; Say, B.; Yuksel, U.; Alhan, A.; Turgut, E.; Ogden, M.; Ergun, U. In spontaneous intracerebral hematoma patients, prediction of the hematoma expansion risk and mortality risk using radiological and clinical markers and a newly developed scale. Neurol. Res. 2021, 43, 482–495. [Google Scholar] [CrossRef]
- Ovesen, C.; Christensen, A.F.; Krieger, D.W.; Rosenbaum, S.; Havsteen, I.; Christensen, H. Time course of early postadmission hematoma expansion in spontaneous intracerebral hemorrhage. Stroke 2014, 45, 994–999. [Google Scholar] [CrossRef]
- Roh, D.; Boehme, A.; Young, C.; Roth, W.; Gutierrez, J.; Flaherty, M.; Rosand, J.; Testai, F.; Woo, D.; Elkind, M.S.V. Hematoma expansion is more frequent in deep than lobar intracerebral hemorrhage. Neurology 2020, 95, e3386–e3393. [Google Scholar] [CrossRef]
- Yogendrakumar, V.; Demchuk, A.M.; Aviv, R.I.; Rodriguez-Luna, D.; Molina, C.A.; Blas, Y.S.; Dzialowski, I.; Kobayashi, A.; Boulanger, J.M.; Lum, C.; et al. Location of intracerebral haemorrhage predicts haematoma expansion. Eur. Stroke J. 2017, 2, 257–263. [Google Scholar] [CrossRef]
- Zaman, S.; Dierksen, F.; Knapp, A.; Haider, S.P.; Abou Karam, G.; Qureshi, A.I.; Falcone, G.J.; Sheth, K.N.; Payabvash, S. Radiomic Features of Acute Cerebral Hemorrhage on Non-Contrast CT Associated with Patient Survival. Diagnostics 2024, 14, 944. [Google Scholar] [CrossRef]
- Gladstone, D.J.; Aviv, R.I.; Demchuk, A.M.; Hill, M.D.; Thorpe, K.E.; Khoury, J.C.; Sucharew, H.J.; Al-Ajlan, F.; Butcher, K.; Dowlatshahi, D.; et al. Effect of Recombinant Activated Coagulation Factor VII on Hemorrhage Expansion Among Patients With Spot Sign-Positive Acute Intracerebral Hemorrhage: The SPOTLIGHT and STOP-IT Randomized Clinical Trials. JAMA Neurol. 2019, 76, 1493–1501. [Google Scholar] [CrossRef]
- Sprigg, N.; Flaherty, K.; Appleton, J.P.; Al-Shahi Salman, R.; Bereczki, D.; Beridze, M.; Christensen, H.; Ciccone, A.; Collins, R.; Czlonkowska, A.; et al. Tranexamic acid for hyperacute primary IntraCerebral Haemorrhage (TICH-2): An international randomised, placebo-controlled, phase 3 superiority trial. Lancet 2018, 391, 2107–2115. [Google Scholar] [CrossRef]
- Avadhani, R.; Ziai, W.C.; Thompson, R.E.; Mould, W.A.; Lane, K.; Nanni, A.; Iacobelli, M.; Sharrock, M.F.; Sansing, L.H.; Van Eldik, L.J.; et al. Clinical Trial Protocol for BEACH: A Phase 2a Study of MW189 in Patients with Acute Nontraumatic Intracerebral Hemorrhage. Neurocrit Care 2024, 40, 807–815. [Google Scholar] [CrossRef] [PubMed]
- Broderick, J.P.; Brott, T.G.; Duldner, J.E.; Tomsick, T.; Huster, G. Volume of intracerebral hemorrhage. A powerful and easy-to-use predictor of 30-day mortality. Stroke 1993, 24, 987–993. [Google Scholar] [CrossRef] [PubMed]
- Dowlatshahi, D.; Demchuk, A.M.; Flaherty, M.L.; Ali, M.; Lyden, P.L.; Smith, E.E.; on behalf of the VISTA Collaboration. Defining hematoma expansion in intracerebral hemorrhage: Relationship with patient outcomes. Neurology 2011, 76, 1238–1244. [Google Scholar] [CrossRef] [PubMed]
- Arab, A.; Chinda, B.; Medvedev, G.; Siu, W.; Guo, H.; Gu, T.; Moreno, S.; Hamarneh, G.; Ester, M.; Song, X. A fast and fully-automated deep-learning approach for accurate hemorrhage segmentation and volume quantification in non-contrast whole-head CT. Sci. Rep. 2020, 10, 19389. [Google Scholar] [CrossRef]
- Ironside, N.; Chen, C.J.; Mutasa, S.; Sim, J.L.; Marfatia, S.; Roh, D.; Ding, D.; Mayer, S.A.; Lignelli, A.; Connolly, E.S. Fully Automated Segmentation Algorithm for Hematoma Volumetric Analysis in Spontaneous Intracerebral Hemorrhage. Stroke 2019, 50, 3416–3423. [Google Scholar] [CrossRef]
- Cao, H.; Morotti, A.; Mazzacane, F.; Desser, D.; Schlunk, F.; Guttler, C.; Kniep, H.; Penzkofer, T.; Fiehler, J.; Hanning, U.; et al. External Validation and Retraining of DeepBleed: The First Open-Source 3D Deep Learning Network for the Segmentation of Spontaneous Intracerebral and Intraventricular Hemorrhage. J. Clin. Med. 2023, 12, 4005. [Google Scholar] [CrossRef]
- Kok, Y.E.; Pszczolkowski, S.; Law, Z.K.; Ali, A.; Krishnan, K.; Bath, P.M.; Sprigg, N.; Dineen, R.A.; French, A.P. Semantic Segmentation of Spontaneous Intracerebral Hemorrhage, Intraventricular Hemorrhage, and Associated Edema on CT Images Using Deep Learning. Radiol. Artif. Intell. 2022, 4, e220096. [Google Scholar] [CrossRef]
- Peng, Q.; Chen, X.; Zhang, C.; Li, W.; Liu, J.; Shi, T.; Wu, Y.; Feng, H.; Nian, Y.; Hu, R. Deep learning-based computed tomography image segmentation and volume measurement of intracerebral hemorrhage. Front. Neurosci. 2022, 16, 965680. [Google Scholar] [CrossRef]
- Piao, Z.; Gu, Y.H.; Jin, H.; Yoo, S.J. Intracerebral hemorrhage CT scan image segmentation with HarDNet based transformer. Sci. Rep. 2023, 13, 7208. [Google Scholar] [CrossRef]
- Yu, N.; Yu, H.; Li, H.; Ma, N.; Hu, C.; Wang, J. A Robust Deep Learning Segmentation Method for Hematoma Volumetric Detection in Intracerebral Hemorrhage. Stroke 2022, 53, 167–176. [Google Scholar] [CrossRef]
- Qureshi, A.I.; Palesch, Y.Y.; Barsan, W.G.; Hanley, D.F.; Hsu, C.Y.; Martin, R.L.; Moy, C.S.; Silbergleit, R.; Steiner, T.; Suarez, J.I.; et al. Intensive Blood-Pressure Lowering in Patients with Acute Cerebral Hemorrhage. N. Engl. J. Med. 2016, 375, 1033–1043. [Google Scholar] [CrossRef] [PubMed]
- Haider, S.P.; Qureshi, A.I.; Jain, A.; Tharmaseelan, H.; Berson, E.R.; Majidi, S.; Filippi, C.G.; Mak, A.; Werring, D.J.; Acosta, J.N.; et al. The coronal plane maximum diameter of deep intracerebral hemorrhage predicts functional outcome more accurately than hematoma volume. Int. J. Stroke 2022, 17, 777–784. [Google Scholar] [CrossRef] [PubMed]
- Vogt, E.; Vu, L.H.; Cao, H.; Speth, A.; Desser, D.; Schlunk, F.; Dell’Orco, A.; Nawabi, J. Multilesion Segmentations in Patients with Intracerebral Hemorrhage: Reliability of ICH, IVH and PHE Masks. Tomography 2023, 9, 89–97. [Google Scholar] [CrossRef] [PubMed]
- Kohavi, R. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of the IJCAI’95: Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, QC, Canada, 20–25 August 1995; pp. 1137–1143. [Google Scholar]
- Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer: London, UK, 2009. [Google Scholar]
- Sun, H.; Chen, X.; Shi, Q.; Hong, M.; Fu, X.; Sidiropoulos, N.D. Learning to Optimize: Training Deep Neural Networks for Interference Management. IEEE Trans. Signal Process. 2018, 66, 5438–5453. [Google Scholar] [CrossRef]
- Rafael Gonzalez, R.W. Digital Image Processing; Pearson: London, UK, 2017. [Google Scholar]
- Tran, A.T.; Zeevi, T.; Haider, S.P.; Abou Karam, G.; Berson, E.R.; Tharmaseelan, H.; Qureshi, A.I.; Sanelli, P.C.; Werring, D.J.; Malhotra, A.; et al. Uncertainty-aware deep-learning model for prediction of supratentorial hematoma expansion from admission non-contrast head computed tomography scan. NPJ Digit. Med. 2024, 7, 26. [Google Scholar] [CrossRef]
- Isensee, F.; Jaeger, P.F.; Kohl, S.A.; Petersen, J.; Maier-Hein, K.H. nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 2021, 18, 203–211. [Google Scholar] [CrossRef]
- Dice, L.R. Measures of the Amount of Ecologic Association Between Species. Ecology 1945, 26, 297–302. [Google Scholar] [CrossRef]
- Rockafellar, R.T.W.; Roger, J.-B. Variational Analysis; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
- Huang, G.; Liu, Z.; Van Der Maaten, L.; Weinberger, K.Q. Densely Connected Convolutional Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 4700–4708. [Google Scholar]
- Saito, T.; Rehmsmeier, M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE 2015, 10, e0118432. [Google Scholar] [CrossRef]
- Hanley, J.A.; McNeil, B.J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982, 143, 29–36. [Google Scholar] [CrossRef]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Tiwari, P.; Upadhyay, D.; Pant, B.; Mohd, N. Multiclass Classification of Disease Using CNN and SVM of Medical Imaging. In Proceedings of the Advances in Computing and Data Sciences. ICACDS 2022. Communications in Computer and Information Science, Kurnool, India, 22–23 April 2022. [Google Scholar]
- Sarvamangala, D.R.; Kulkarni, R.V. Convolutional neural networks in medical image understanding: A survey. Evol. Intell. 2022, 15, 1–22. [Google Scholar] [CrossRef] [PubMed]
- Buitinck, L.; Louppe, G.; Blondel, M.; Pedregosa, F.; Mueller, A.; Grisel, O.; Niculae, V.; Prettenhofer, P.; Gramfort, A.; Grobler, J.; et al. API design for machine learning software: Experiences from the scikit-learn project. In Proceedings of the ECML PKDD Workshop: Languages for Data Mining and Machine Learning, Prague, Czech Republic, 23–27 September 2013. [Google Scholar]
- Powers, D.M.W. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. Int. J. Mach. Learn. Technol. 2011, 2, 37–63. [Google Scholar] [CrossRef]
- Murphy, K.P. Machine Learning: A Probabilistic Perspective; The MIT Press: Cambridge, MA, USA; London, UK, 2012. [Google Scholar]
- Rainio, O.; Teuho, J.; Klen, R. Evaluation metrics and statistical tests for machine learning. Sci. Rep. 2024, 14, 6086. [Google Scholar] [CrossRef] [PubMed]
- Christen, P.; Hand, D.J.; Kirielle, N. A Review of the F-Measure: Its History, Properties, Criticism, and Alternatives. ACM Comput. Surv. 2023, 56, 73. [Google Scholar] [CrossRef]
- Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
- Wald, A. Tests of statistical hypotheses concerning several parameters when the number of observations is large. Trans. Am. Math. Soc. 1943, 54, 426–482. [Google Scholar] [CrossRef]
- Agresti, A. An Introduction to Categorical Data Analysis, 2nd ed.; Wiley-Interscience: Hoboken, NJ, USA, 2007. [Google Scholar]
- George, W.; Snecdecor, W.G.C. Statistical Methods, 8th ed.; Iowa State University Press: Ames, IA, USA, 1989. [Google Scholar]
- Liew, S.L.; Zavaliangos-Petropulu, A.; Jahanshad, N.; Lang, C.E.; Hayward, K.S.; Lohse, K.R.; Juliano, J.M.; Assogna, F.; Baugh, L.A.; Bhattacharya, A.K.; et al. The ENIGMA Stroke Recovery Working Group: Big data neuroimaging to study brain-behavior relationships after stroke. Hum. Brain Mapp. 2022, 43, 129–148. [Google Scholar] [CrossRef]
- Berson, E.R.; Aboian, M.S.; Malhotra, A.; Payabvash, S. Artificial Intelligence for Neuroimaging and Musculoskeletal Radiology: Overview of Current Commercial Algorithms. Semin. Roentgenol. 2023, 58, 178–183. [Google Scholar] [CrossRef]
- Naidech, A.M.; Grotta, J.; Elm, J.; Janis, S.; Dowlatshahi, D.; Toyoda, K.; Steiner, T.; Mayer, S.A.; Khanolkar, P.; Denlinger, J.; et al. Recombinant factor VIIa for hemorrhagic stroke treatment at earliest possible time (FASTEST): Protocol for a phase III, double-blind, randomized, placebo-controlled trial. Int. J. Stroke 2022, 17, 806–809. [Google Scholar] [CrossRef]
- Meretoja, A.; Yassi, N.; Wu, T.Y.; Churilov, L.; Sibolt, G.; Jeng, J.S.; Kleinig, T.; Spratt, N.J.; Thijs, V.; Wijeratne, T.; et al. Tranexamic acid in patients with intracerebral haemorrhage (STOP-AUST): A multicentre, randomised, placebo-controlled, phase 2 trial. Lancet Neurol. 2020, 19, 980–987. [Google Scholar] [CrossRef]
- Jain, A.; Malhotra, A.; Payabvash, S. Imaging of Spontaneous Intracerebral Hemorrhage. Neuroimaging Clin. N. Am. 2021, 31, 193–203. [Google Scholar] [CrossRef] [PubMed]
- Tanioka, S.; Aydin, O.U.; Hilbert, A.; Ishida, F.; Tsuda, K.; Araki, T.; Nakatsuka, Y.; Yago, T.; Kishimoto, T.; Ikezawa, M.; et al. Prediction of hematoma expansion in spontaneous intracerebral hemorrhage using a multimodal neural network. Sci. Rep. 2024, 14, 16465. [Google Scholar] [CrossRef] [PubMed]
- Ko, D.R.; Na, H.; Jung, S.; Lee, S.; Jeon, J.; Ahn, S.J. Hematoma expansion prediction in patients with intracerebral hemorrhage using a deep learning approach. J. Med. Artif. Intell. 2024, 7, 10. [Google Scholar] [CrossRef]
- Yalcin, C.; Abramova, V.; Terceno, M.; Oliver, A.; Silva, Y.; Llado, X. Hematoma expansion prediction in intracerebral hemorrhage patients by using synthesized CT images in an end-to-end deep learning framework. Comput. Med. Imaging Graph. 2024, 117, 102430. [Google Scholar] [CrossRef] [PubMed]
- Haider, S.P.; Qureshi, A.I.; Jain, A.; Tharmaseelan, H.; Berson, E.R.; Zeevi, T.; Werring, D.J.; Gross, M.; Mak, A.; Malhotra, A.; et al. Radiomic markers of intracerebral hemorrhage expansion on non-contrast CT: Independent validation and comparison with visual markers. Front. Neurosci. 2023, 17, 1225342. [Google Scholar] [CrossRef]
- Teng, L.; Ren, Q.; Zhang, P.; Wu, Z.; Guo, W.; Ren, T. Artificial Intelligence Can Effectively Predict Early Hematoma Expansion of Intracerebral Hemorrhage Analyzing Noncontrast Computed Tomography Image. Front. Aging Neurosci. 2021, 13, 632138. [Google Scholar] [CrossRef]
- Bardera, A.; Boada, I.; Feixas, M.; Remollo, S.; Blasco, G.; Silva, Y.; Pedraza, S. Semi-automated method for brain hematoma and edema quantification using computed tomography. Comput. Med. Imaging Graph. 2009, 33, 304–311. [Google Scholar] [CrossRef]
- Prakash, K.N.; Zhou, S.; Morgan, T.C.; Hanley, D.F.; Nowinski, W.L. Segmentation and quantification of intra-ventricular/cerebral hemorrhage in CT scans by modified distance regularized level set evolution technique. Int. J. Comput. Assist. Radiol. Surg. 2012, 7, 785–798. [Google Scholar] [CrossRef]
- Gong, S.; Zhong, Y.; Gong, Y.; Chan, N.Y.; Ma, W.; Mak, C.H.K.; Abrigo, J.; Dou, Q. Segmentation of Tiny Intracranial Hemorrhage Via Learning-to-Rank Local Feature Enhancement. In Proceedings of the 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, 27–30 May 2024; pp. 1–5. [Google Scholar]
- Toikkanen, M.; Kwon, D.; Lee, M. ReSGAN: Intracranial Hemorrhage Segmentation with Residuals of Synthetic Brain CT Scans; Springer: Cham, Switzerland, 2021; pp. 400–409. [Google Scholar]
- Zhang, G.; Chen, K.; Xu, S.; Cho, P.C.; Nan, Y.; Zhou, X.; Lv, C.; Li, C.; Xie, G. Lesion synthesis to improve intracranial hemorrhage detection and classification for CT images. Comput. Med. Imaging Graph. 2021, 90, 101929. [Google Scholar] [CrossRef]
- Agarap, A.F. An Architecture Combining Convolutional Neural Network (CNN) and Support Vector Machine (SVM) for Image Classification. arXiv 2017. [Google Scholar] [CrossRef]
- Khairandish, M.O.; Sharma, M.; Jain, V.; Chatterjee, J.M.; Jhanjhi, N.Z. A Hybrid CNN-SVM Threshold Segmentation Approach for Tumor Detection and Classification of MRI Brain Images. IRBM 2022, 43, 290–299. [Google Scholar] [CrossRef]
- Khoramipour, S.; Gandomkar, M.; Shakiba, M. Enhancement of brain tumor classification from MRI images using multi-path convolutional neural network with SVM classifier. Biomed. Signal Process. Control 2024, 93, 106117. [Google Scholar] [CrossRef]
- Liu, T.; Ye, X.; Sun, B. Combining Convolutional Neural Network and Support Vector Machine for Gait-based Gender Recognition. 2018 Chinese Automation Congress (CAC), Xi'an, China; 2018; pp. 3477–3481. [Google Scholar] [CrossRef]
- Niu, X.-X.; Suen, C.Y. A novel hybrid CNN–SVM classifier for recognizing handwritten digits. Pattern Recognit. 2012, 45, 1318–1325. [Google Scholar] [CrossRef]
Yale * (n = 684) | ATACH-2 * (n = 890) | Charité * (n = 687) | p Value | |
---|---|---|---|---|
Male | 358 (54.8%) | 543 (60.80%) | 530 (56,36%) | 0.345 |
Age [years] | 69.7 ± 14.3 | 62.2 ± 13.1 | 69.4 ± 14.2 | <0.001 |
Baseline NIH Stroke Scale | 8 (2 – 17) | 11 (6–16) | 6 (1–12) | <0.001 |
Baseline ICH volume [mL] | 18.7 ± 20.6 | 12.9 ± 12.5 | 30.2 ± 30.7 | <0.001 |
Follow-up ICH volume [mL] | 22.9 ± 25.8 | 15.6 ± 16.6 | 31.9 ± 34.5 | <0.001 |
≥3 mL HE | 171 (25%) | 226 (25.4%) | 180 (26.2%) | 0.872 |
≥6 mL HE | 129 (18.9%) | 148 (16.6%) | 128 (18,6%) | 0437 |
≥9 mL HE | 102 (14.9%) | 110 (12.4%) | 104 (15.1%) | 0.201 |
≥12.5 mL HE | 84 (12.3%) | 69 (7.7%) | 83 (12.1%) | 0.006 |
CT scan characteristics In-plane pixel spacing [mm] Slice thickness [mm] Min axial image matrix [n × n] Max axial matrix [n × n] Number of slices | [0.46 × 0.46] 4.81 ± 0.70 [472 × 472] [1024 × 1024] 35.9 ± 11.6 | [0.46 × 0.46] 5.29 ± 1.81 [512 × 512] [512 × 734] 30.9 ± 17.6 | [0.46 × 0.46] 4.54 ± 0.66 [434 × 434] [512 × 671] 30.26 ± 7.8 | <0.001 <0.001 |
Internal Independent Test (ATACH-2, n = 890) | External Independent Test (Charité, n = 687) | |||||||
---|---|---|---|---|---|---|---|---|
AUC | Accuracy | Sensitivity | Specificity | AUC | Accuracy | Sensitivity | Specificity | |
≥3 mL HE | 0.86 (0.84–0.88) | 0.90 (0.88–0.92) | 0.76 (0.70–0.81) | 0.95 (0.93–0.97) | 0.82 (0.79–0.85) | 0.87 (0.84–0.90) | 0.71 (0.64–0.78) | 0.93 (0.91–0.95) |
≥6 mL HE | 0.89 (0.87–0.91) | 0.95 (0.93–0.96) | 0.82 (0.76–0.88) | 0.97 (0.96–0.98) | 0.88 (0.86–0.90) | 0.94 (0.92–0.96) | 0.79 (0.72–0.86) | 0.97 (0.96–0.98) |
≥9 mL HE | 0.89 (0.87–0.91) | 0.96 (0.95–0.97) | 0.80 (0.73–0.87) | 0.98 (0.97–0.99) | 0.88 (0.86–0.90) | 0.95 (0.93–0.97) | 0.78 (0.70–0.86) | 0.98 (0.97–0.99) |
≥12.5 mL HE | 0.92 (0.90–0.94) | 0.98 (0.97–0.99) | 0.86 (0.77–0.94) | 0.99 (0.98–1.00) | 0.88 (0.85–0.99) | 0.95 (0.93–0.97) | 0.76 (0.66–0.84) | 0.98 (0.97–0.99) |
Internal Independent Test (ATACH-2, n = 890) | External Independent Test (Charité, n = 687) | |||||
---|---|---|---|---|---|---|
Classification by CNN Model | Classification Based on Segmentation Volumes | p Value | Classification by CNN Model | Classification Based on Segmentation Volumes | p Value | |
≥3 mL HE | 0.93 (0.90–0.95) | 0.86 (0.84–0.88) | 0.005 | 0.9 (0.87–0.92) | 0.82 (0.79–0.85) | 0.018 |
≥6 mL HE | 0.95 (0.94–0.97) | 0.89 (0.87–0.91) | 0.175 | 0.93 (0.90–0.95) | 0.88 (0.86–0.90) | 0.520 |
≥9 mL HE | 0.97 (0.95–0.98) | 0.89 (0.87–0.91) | 0.022 | 0.93 (0.91–0.95) | 0.88 (0.86–0.92) | 0.525 |
≥12.5 mL HE | 0.98 (0.98–0.99) | 0.92 (0.90–0.94) | 0.037 | 0.95 (0.93–0.98) | 0.87 (0.85–0.99) | 0.882 |
Internal Independent Test (ATACH-2, n = 890) | External Independent Test (Charité, n = 687) | |||||
---|---|---|---|---|---|---|
Prediction Model | Segmentation Volumes | p Value | Prediction Model | Segmentation Volumes | p Value | |
≥3 mL HE | 0.91 (0.89–0.93) | 0.86 (0.83–0.88) | 0.006 | 0.88 (0.84–0.91) | 0.82 (0.78–0.85) | 0.019 |
≥6 mL HE | 0.92 (0.89–0.95) | 0.90 (0.86–0.92) | 0.286 | 0.9 (0.86–0.94) | 0.88 (0.85–0.92) | 0.529 |
≥9 mL HE | 0.94 (0.92–0.97) | 0.89 (0.85–0.93) | 0.033 | 0.9 (0.85–0.94) | 0.88 (0.84–0.92) | 0.552 |
≥12.5 mL HE | 0.98 (0.95–1) | 0.92 (0.88–0.96) | 0.038 | 0.87 (0.82–0.93) | 0.87 (0.82–09.3) | 0.928 |
False-Negative Misclassification Rate with Each Model | 100% Sensitivity | 95% Sensitivity | 90% Sensitivity | Direct from Segmentation | ||||
---|---|---|---|---|---|---|---|---|
CNN Only | CNN/SVM | CNN Only | CNN/SVM | CNN Only | CNN/SVM | |||
ATACH-2 | ≥3 mL HE | 0 (0.0%) | 0 (0.0%) | 11 (1.2%) | 11 (1.2%) | 22 (2.5%) | 22 (2.5%) | 53(5.9%) |
≥6 mL HE | 0 (0.0%) | 0 (0.0%) | 7 (0.7%) | 7 (0.7%) | 14 (1.6%) | 14 (1.6%) | 23(2.6%) | |
≥9 mL HE | 0 (0.0%) | 0 (0.0%) | 5 (0.5%) | 5 (0.5%) | 11 (1.2%) | 11 (1.2%) | 19(2.1%) | |
≥12.5 mL HE | 0 (0.0%) | 0 (0.0%) | 3 (0.3%) | 3 (0.3%) | 6 (0.6%) | 6 (0.6%) | 9(1.0%) | |
Charité | ≥3 mL HE | 3 (0.4%) | 0 (0.0%) | 11 (1.6%) | 5 (0.7%) | 26 (3.8%) | 14 (2.0%) | 46(6.7%) |
≥6 mL HE | 5 (0.7%) | 0 (0.0%) | 12 (1.7%) | 7 (1.0%) | 21 (3.1%) | 12 (1.7%) | 22(3.2%) | |
≥9 mL HE | 3 (0.4%) | 0 (0.0%) | 11 (1.6%) | 2 (0.3%) | 20 (2.9%) | 14 (2.0%) | 21(3.1%) | |
≥12.5 mL HE | 5 (0.7%) | 2 (0.3%) | 14 (2.0%) | 16 (2.3%) | 16 (2.3%) | 19 (2.7%) | 17(2.5%) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Tran, A.T.; Desser, D.; Zeevi, T.; Abou Karam, G.; Zietz, J.; Dell’Orco, A.; Chen, M.-C.; Malhotra, A.; Qureshi, A.I.; Murthy, S.B.; et al. Optimizing Automated Hematoma Expansion Classification from Baseline and Follow-Up Head Computed Tomography. Appl. Sci. 2025, 15, 111. https://doi.org/10.3390/app15010111
Tran AT, Desser D, Zeevi T, Abou Karam G, Zietz J, Dell’Orco A, Chen M-C, Malhotra A, Qureshi AI, Murthy SB, et al. Optimizing Automated Hematoma Expansion Classification from Baseline and Follow-Up Head Computed Tomography. Applied Sciences. 2025; 15(1):111. https://doi.org/10.3390/app15010111
Chicago/Turabian StyleTran, Anh T., Dmitriy Desser, Tal Zeevi, Gaby Abou Karam, Julia Zietz, Andrea Dell’Orco, Min-Chiun Chen, Ajay Malhotra, Adnan I. Qureshi, Santosh B. Murthy, and et al. 2025. "Optimizing Automated Hematoma Expansion Classification from Baseline and Follow-Up Head Computed Tomography" Applied Sciences 15, no. 1: 111. https://doi.org/10.3390/app15010111
APA StyleTran, A. T., Desser, D., Zeevi, T., Abou Karam, G., Zietz, J., Dell’Orco, A., Chen, M. -C., Malhotra, A., Qureshi, A. I., Murthy, S. B., Majidi, S., Falcone, G. J., Sheth, K. N., Nawabi, J., & Payabvash, S. (2025). Optimizing Automated Hematoma Expansion Classification from Baseline and Follow-Up Head Computed Tomography. Applied Sciences, 15(1), 111. https://doi.org/10.3390/app15010111