MPB-UNet: Multi-Parallel Blocks UNet for MRI Automated Brain Tumor Segmentation
<p>Global architecture for brain tumor classification and segmentation.</p> "> Figure 2
<p>UNet architecture [<a href="#B15-electronics-14-00040" class="html-bibr">15</a>].</p> "> Figure 3
<p>Multi-parallel blocks UNet architecture.</p> "> Figure 4
<p>Definition of blocks.</p> "> Figure 5
<p>Network architecture with integrated spatial pyramid pooling for multi-scale feature extraction.</p> "> Figure 6
<p>Examples of images and their corresponding masks from the LGG Dataset [<a href="#B34-electronics-14-00040" class="html-bibr">34</a>].</p> "> Figure 7
<p>Sample segmentation results on test data. From left to right: original MRI image, ground truth, Classical UNet, Model 01, Model 02, Model 04, and MPB-UNet.</p> "> Figure 8
<p>Confusion matrices of the models (Classical UNet, Model 01, Model 02, Model 04, and MPB-UNet).</p> ">
Abstract
:1. Introduction
- A novel multi-parallel blocks UNet architecture that enhances feature extraction through parallel processing paths, inspired by the multi-scale processing capabilities of the human visual system. This design enables comprehensive feature analysis at multiple levels, leading to more robust tumor boundary detection and improved segmentation accuracy.
- Integration of Atrous Spatial Pyramid Pooling (ASPP) to enhance multi-scale feature extraction. ASPP utilizes parallel atrous convolutions with varying dilation rates, enabling the simultaneous processing of features at multiple scales. This approach improves segmentation accuracy for brain tumors of diverse sizes and shapes in MRI images.
2. Related Work
3. Methodology
3.1. Segmentation Module
3.1.1. UNet
3.1.2. MPB-UNet Architecture
3.1.3. Atrous Spatial Pyramid Pooling
4. Results and Discussion
4.1. Experimental Dataset
- Split the dataset into training, validation, and test sets, with 80% of the data used for training and validation and 20% for testing. This division ensures a robust evaluation of our model’s performance.
- Implemented comprehensive data augmentation techniques during the training phase, including random rotation (20 degrees), horizontal and vertical flips, random brightness adjustments (5%), random contrast variations (5%), and random cropping. These augmentation strategies were carefully selected to reflect realistic variations in medical imaging while preserving the critical diagnostic features of the brain images.
4.2. Evaluation Metrics
- True positive (TP) represents the correctly identified tumor areas, where the predicted tumor region matches the actual tumor region in the ground truth.
- True negative (TN) indicates the correctly identified non-tumor areas, where both the prediction and the ground truth agree on the absence of a tumor.
- False positive (FP) occurs when the model incorrectly predicts a tumor in an area where no actual tumor is present.
- False negative (FN) represents instances where the model fails to detect an actual tumor, misclassifying it as non-tumor tissue.
4.3. Discussions and Comparisons
4.3.1. Training Phase Analysis
- Model 01: Incorporates Blocks A1, B1, and C1 in the encoder, with a single decoder block in the decoder path.The model has a total of 190,290 parameters (743.32 KB), of which 189,618 (740.70 KB) are trainable and 672 (2.62 KB) are non-trainable parameters.
- Model 02: Expands on Model 01 by including Blocks A2, B2, and C2, along with a concatenation operation in the encoder, while retaining two decoder blocks in the decoder path. The model has a total of 3,617,594 parameters (13.80 MB), of which 3,615,306 (13.79 MB) are trainable and 2288 (8.94 KB) are non-trainable parameters.
- Model MPB-UNet (Model 03): Incorporates three stages in the encoder (from A1, B1, C1 to A3, B3, C3) as represented in Figure 3, with three corresponding decoder blocks and skip connections. The model has a total of 12,167,474 parameters (46.42 MB), of which 12,162,514 (46.40 MB) are trainable and 4960 (19.38 KB) are non-trainable parameters.
- Model 04: Represents our full proposed architecture, encompassing all blocks from A1 to A4 (and corresponding B and C blocks) in the encoder, coupled with four Deblocker modules in the decoder. The model has a total of 21,170,610 parameters (80.76 MB), of which 21,162,322 (80.73 MB) are trainable and 8288 (32.38 KB) are non-trainable parameters.
- Classical UNet: A baseline model implementing the standard U-Net architecture with four levels.The model has a total of 5,027,010 parameters (19.18 MB), of which 5,022,402 (19.16 MB) are trainable and 4608 (18.00 KB) are non-trainable parameters.
4.3.2. Testing Phase Analysis
4.3.3. Analysis of MPB-UNet Performance
- Optimal Architecture Depth: The MPB-UNet, with its three encoder and decoder stages, strikes an ideal balance between model complexity and feature extraction capability. Unlike shallower models, which may struggle to capture complex features, or deeper models, which might be prone to overfitting, the MPB-UNet achieves the optimal depth for this task.
- Efficient Feature Extraction: The multi-parallel block structure in the MPB-UNet allows for the efficient extraction of features at multiple scales. This is particularly beneficial for brain tumor segmentation, where tumors can vary significantly in size and shape.
- Reduced Overfitting: Compared with more complex models, the MPB-UNet has fewer parameters, which helps in reducing overfitting, especially given the limited size of most medical imaging datasets. This is evidenced by its consistent performance across both training and test sets.
4.3.4. Comparative Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Vankdothu, R.; Hameed, M.A. Brain tumor segmentation of MR images using SVM and fuzzy classifier in machine learning. Meas. Sensors 2022, 24, 100440. [Google Scholar] [CrossRef]
- Dang, K.; Vo, T.; Ngo, L.; Ha, H. A deep learning framework integrating MRI image preprocessing methods for brain tumor segmentation and classification. IBRO Neurosci. Rep. 2022, 13, 523–532. [Google Scholar] [CrossRef] [PubMed]
- Jain, D.; Pandey, A.K.; Chauhan, A.S.; Kushwah, J.S.; Saxena, N.; Sharma, R.; Sambrow, V.D.P. ASA-LSTM-based brain tumor segmentation and classification in MRI images. Int. J. Adv. Technol. Eng. Explor. 2024, 11, 838–851. [Google Scholar]
- Taphoorn, M.J.; Klein, M. Cognitive deficits in adult patients with brain tumours. Lancet Neurol. 2004, 3, 159–168. [Google Scholar] [CrossRef]
- Mandle, A.K.; Sahu, S.P.; Gupta, G. Brain tumor segmentation and classification in MRI using clustering and kernel-based SVM. Biomed. Pharmacol. J. 2022, 15, 699–716. [Google Scholar] [CrossRef]
- Agrawal, P.; Katal, N.; Hooda, N. Segmentation and classification of brain tumor using 3D-UNet deep neural networks. Int. J. Cogn. Comput. Eng. 2022, 3, 199–210. [Google Scholar] [CrossRef]
- The Brain Tumour Charity. Brain Tumour Statistics June 2023. 2023. Available online: https://assets.thebraintumourcharity.org/live/uploads/2023/06/Source-K_-Brain-Tumour-Statistics-June-2023-Final-Version.pdf (accessed on 1 August 2024).
- Ostrom, Q.T.; Gittleman, H.; Fulop, J.; Liu, M.; Blanda, R.; Kromer, C.; Wolinsky, Y.; Kruchko, C.; Barnholtz-Sloan, J.S. CBTRUS statistical report: Primary brain and central nervous system tumors diagnosed in the United States in 2008–2012. Neuro-Oncology 2015, 17, iv1–iv62. [Google Scholar] [CrossRef]
- Solanki, S.; Singh, U.P.; Chouhan, S.S.; Jain, S. Brain tumor detection and classification using intelligence techniques: An overview. IEEE Access 2023, 11, 12870–12886. [Google Scholar] [CrossRef]
- Soltaninejad, M.; Yang, G.; Lambrou, T.; Allinson, N.; Jones, T.L.; Barrick, T.R.; Howe, F.A.; Ye, X. Automated brain tumour detection and segmentation using superpixel-based extremely randomized trees in FLAIR MRI. Int. J. Comput. Assist. Radiol. Surg. 2017, 12, 183–203. [Google Scholar] [CrossRef]
- Ahuja, S.; Panigrahi, B.K.; Gandhi, T.K. Enhanced performance of Dark-Nets for brain tumor classification and segmentation using colormap-based superpixel techniques. Mach. Learn. Appl. 2022, 7, 100212. [Google Scholar] [CrossRef]
- Alwakid, G.; Gouda, W.; Humayun, M.; Jhanjhi, N. Diagnosing melanomas in Dermoscopy images using deep learning. Diagnostics 2023, 13, 1815. [Google Scholar] [CrossRef] [PubMed]
- Khan, A.; Bukhari, J.; Bangash, J.I.; Khan, A.; Imran, M.; Asim, M.; Ishaq, M.; Khan, A. Optimizing connection weights of functional link neural network using APSO algorithm for medical data classification. J. King Saud-Univ.-Comput. Inf. Sci. 2022, 34, 2551–2561. [Google Scholar] [CrossRef]
- Chen, L.C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 40, 834–848. [Google Scholar] [CrossRef] [PubMed]
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, 5–9 October 2015; proceedings, part III 18. Springer: Berlin/Heidelberg, Germany, 2015; pp. 234–241. [Google Scholar]
- Díaz-Pernas, F.J.; Martínez-Zarzuela, M.; Antón-Rodríguez, M.; González-Ortega, D. A deep learning approach for brain tumor classification and segmentation using a multiscale convolutional neural network. Healthcare 2021, 9, 153. [Google Scholar] [CrossRef]
- Sajid, S.; Hussain, S.; Sarwar, A. Brain tumor detection and segmentation in MR images using deep learning. Arab. J. Sci. Eng. 2019, 44, 9249–9261. [Google Scholar] [CrossRef]
- Farajzadeh, N.; Sadeghzadeh, N.; Hashemzadeh, M. Brain tumor segmentation and classification on MRI via deep hybrid representation learning. Expert Syst. Appl. 2023, 224, 119963. [Google Scholar] [CrossRef]
- Sangeeta, K.; Rahul, M. A Novel Deep Learning Approach for Brain Tumor Segmentation and Classification. J. Electr. Syst. 2024, 20, 700–710. [Google Scholar]
- Pereira, S.; Pinto, A.; Alves, V.; Silva, C.A. Brain tumor segmentation using convolutional neural networks in MRI images. IEEE Trans. Med. Imaging 2016, 35, 1240–1251. [Google Scholar] [CrossRef]
- Ingle, A.; Roja, M.; Sankhe, M.; Patkar, D. Efficient segmentation and classification of the tumor using improved encoder-decoder architecture in brain MRI images. Int. J. Electr. Comput. Eng. Syst. 2022, 13, 643–651. [Google Scholar] [CrossRef]
- Naser, M.A.; Deen, M.J. Brain tumor segmentation and grading of lower-grade glioma using deep learning in MRI images. Comput. Biol. Med. 2020, 121, 103758. [Google Scholar] [CrossRef]
- Sujatha, T.S.; Sreenivasulu, R. U-Net Segmentation for CNN Classification to Improve the Accuracy of Automatic Prediction of Brain Tumors Using MRI Images. J. Electr. Syst. 2024, 20, 12. [Google Scholar] [CrossRef]
- Xu, D.; Zhou, X.; Niu, X.; Wang, J. Automatic segmentation of low-grade glioma in MRI image based on UNet++ model. In Proceedings of the 2020 3rd International Conference on Computer Information Science and Artificial Intelligence (CISAI) 2020, Inner Mongolia, China, 25–27 September 2020; p. 7. [Google Scholar]
- Bedi, P.; Ningshen, N.; Rani, S.; Gole, P.; Bhasin, V. CT-γ-Net: A Hybrid Model Based on Convolutional Encoder-Decoder and Transformer Encoder for Brain Tumor Localization. J. Data Sci. Intell. Syst. 2024, 15. [Google Scholar] [CrossRef]
- Kunjumon, A.; Jacob, C.; Resmi, R. An Efficient U-Net based Model for Low Grade Glioma Segmentation in MRI Images. In Proceedings of the 2024 Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE), Vellore, India, 22–23 February 2024; pp. 1–5. [Google Scholar]
- Chahbar, F.; Merati, M.; Mahmoudi, S.; Baghdadi, M.; Lebani, Z.A. Brain Tumor Classification in Magnetic Resonance Imaging using Convolutional Neural Networks and Transfer Learning. In Proceedings of the IAM 2023: The 6th International Hybrid Conference on Informatics and Applied Mathematics, Guelma, Algeria, 6–7 December 2023; Volume 6. [Google Scholar]
- Ibtehaz, N.; Rahman, M.S. MultiResUNet: Rethinking the U-Net architecture for multimodal biomedical image segmentation. Neural Netw. 2020, 121, 74–87. [Google Scholar] [CrossRef]
- Yu, F. Multi-scale context aggregation by dilated convolutions. arXiv 2015, arXiv:1511.07122. [Google Scholar]
- Chen, L.C.; Zhu, Y.; Papandreou, G.; Schroff, F.; Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 801–818. [Google Scholar]
- Li, X.; Chen, H.; Qi, X.; Dou, Q.; Fu, C.W.; Heng, P.A. H-DenseUNet: Hybrid densely connected UNet for liver and tumor segmentation from CT volumes. IEEE Trans. Med. Imaging 2018, 37, 2663–2674. [Google Scholar] [CrossRef]
- Al-Dhabyani, W.; Gomaa, M.; Khaled, H.; Aly, F. Deep learning approaches for data augmentation and classification of breast masses using ultrasound images. Int. J. Adv. Comput. Sci. Appl. 2019, 10, 1–11. [Google Scholar] [CrossRef]
- Yang, M.; Yu, K.; Zhang, C.; Li, Z.; Yang, K. Denseaspp for semantic segmentation in street scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 3684–3692. [Google Scholar]
- Buda, M.; Saha, A.; Mazurowski, M.A. LGG Segmentation Dataset. 2019. Available online: https://www.kaggle.com/datasets/mateuszbuda/lgg-mri-segmentation (accessed on 18 April 2024).
Author | CNN Architecture | Dataset | Performance |
---|---|---|---|
Francisco Javier et al. (2021) [16] | Three-path output concatenation CNN | Nanfang Hospital | DSC: 82.8% |
Sajid et al. (2019) [17] | hybrid architecture of CNN | BraTS 2013 | DSC: 86% |
Farajzadeh Nacer et al. (2023) [18] | Deep CNN+ SVM-RBF | Figshare dataset | Accuracy: 98% |
Sangeeta Kakarwal and Rahul Mapari. (2024) [19] | BW-DCNN | Private data | Accuracy: 92% |
Sergio Pereira et al. (2016) [20] | Fully CNN-based UNet | BraTS 2013 | DSC: 85% |
Archana Ingle et al. (2022) [21] | ResUNet2 | Nanfang Hospital | DSC: 83.69% |
Mohamed Naser et al. (2020) [22] | UNet | LGG dataset (Kaggle) | DSC: 84% |
Sujatha et al. (2024) [23] | UNet | LGG dataset (Kaggle) | DSC: 90% |
Dan Xu et al. (2020) [24] | UNet++ | LGG dataset (Kaggle) | DSC: 89.1% |
Punam Bedi et al. (2024) [25] | CT-γ-Net | LGG dataset (Kaggle) | Accuracy: 99.24% |
Anila Kunjumon et al. (2024) [26] | An efficient U-Net | LGG dataset (Kaggle) | Accuracy: 99.8%, DSC: 82% |
Block | Number of Filters |
---|---|
A1 | 16 |
B1 | 16 |
C1 | 32 |
A2 | 32 |
B2 | 32 |
C2 | 64 |
A3 | 64 |
B3 | 64 |
C3 | 128 |
D | 1024 |
Deblockers | 512, 256, 128 |
Model | Accuracy (%) | Precision (%) | Sensitivity (%) | Specificity (%) | DSC (%) | Loss | Computing Time (Min) |
---|---|---|---|---|---|---|---|
Classical UNet | 99.73 | 99.73 | 99.73 | 99.73 | 99.64 | 0.0059 | 63.16 |
Model 01 | 99.39 | 99.39 | 99.39 | 99.39 | 98.99 | 0.022 | 22.32 |
Model 02 | 99.81 | 99.81 | 99.81 | 99.81 | 99.72 | 0.01 | 30.75 |
Model 04 | 99.75 | 99.75 | 99.75 | 99.75 | 99.69 | 0.01 | 83.99 |
MPB-UNet | 99.86 | 99.86 | 99.86 | 99.86 | 99.80 | 0.0033 | 66.59 |
Model | Accuracy (%) | Precision (%) | Sensitivity (%) | Specificity (%) | DSC (%) | Loss |
---|---|---|---|---|---|---|
Classical UNet | 99.53 | 99.53 | 99.53 | 99.53 | 99.41 | 0.012 |
Model 01 | 99.14 | 99.14 | 99.14 | 99.14 | 98.63 | 0.026 |
Model 02 | 99.67 | 99.67 | 99.67 | 99.67 | 99.58 | 0.011 |
Model 04 | 99.71 | 99.71 | 99.71 | 99.71 | 99.62 | 0.01 |
MPB-UNet | 99.82 | 99.82 | 99.82 | 99.82 | 99.74 | 0.004 |
Model | Average IoU (%) | Average DSC (%) |
---|---|---|
Classical UNet | 82.44 | 85.44 |
Model 01 | 58.42 | 61.86 |
Model 02 | 84.37 | 87.05 |
Model 04 | 87.40 | 90.14 |
MPB-UNet | 92.17 | 94.43 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chahbar, F.; Merati, M.; Mahmoudi, S. MPB-UNet: Multi-Parallel Blocks UNet for MRI Automated Brain Tumor Segmentation. Electronics 2025, 14, 40. https://doi.org/10.3390/electronics14010040
Chahbar F, Merati M, Mahmoudi S. MPB-UNet: Multi-Parallel Blocks UNet for MRI Automated Brain Tumor Segmentation. Electronics. 2025; 14(1):40. https://doi.org/10.3390/electronics14010040
Chicago/Turabian StyleChahbar, Fatma, Medjeded Merati, and Saïd Mahmoudi. 2025. "MPB-UNet: Multi-Parallel Blocks UNet for MRI Automated Brain Tumor Segmentation" Electronics 14, no. 1: 40. https://doi.org/10.3390/electronics14010040
APA StyleChahbar, F., Merati, M., & Mahmoudi, S. (2025). MPB-UNet: Multi-Parallel Blocks UNet for MRI Automated Brain Tumor Segmentation. Electronics, 14(1), 40. https://doi.org/10.3390/electronics14010040