Abstract
Early diagnosis of patients’ disease is crucial since it helps doctors and patients devise a treatment plan. Therefore, recognizing medical images using Artificial intelligence-based deep learning techniques has recently increased. Capsule Network (CapsNet) has promising methods in visual tasks due to its ability to keep a high relationship of spatial information compared to convolutional neural networks (CNNs). However, CapsNet faces a critical problem with a complex image background that limits its performance. The traditional CapsNet adopts a standalone convolution (SC) as a feature extractor, Softmax function for normalization of coupling coefficient, and dynamic routing procedure to allow active capsules to perform predictions leading to activation of high-level capsules. The SC is not an effective feature extractor, and SoftMax impedes capsules from distributing optimal coupling coefficient during routing. This paper proposes a CapsNet architecture called SqueezeCapsNet that integrates SqueezeNet and CapsNet to achieve effective feature extraction and fewer parameters. A new squash function named parametric squash function (PSF) was proposed to reduce non-informative capsules and promote discriminative capsules. To the best of our knowledge in literature, we are the first to integrate SqueezeNet into CapsNet. We evaluate our framework on two medical image datasets; Brain tumor and Lung & Colon cancer datasets. Additionally, datasets with varied backgrounds; MNIST, fashion-MNIST, CIFAR-10 were used to evaluate the robustness and generalizability of the model. The SqueezeCapsNet produces 94.85%, 99.76%, 99.87%, 93.49%, and 82.45% on Brain tumor, Lung & Colon Cancer, MNIST, fashion-MNIST, and CIFAR-10 datasets, respectively. Experimental results show that the proposed architecture’s compression techniques significantly provide fewer parameters while enhancing stability and accuracy across all the evaluation metrics. Our results show that our method improves CapsNet and can be adopted as a computer-aided diagnostic method to support the diagnosis of medical image tasks.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
The MNIST dataset that supports this study is publicly available at http://yann.lecun.com/exdb/mnist/[11]. The MNIST dataset that supports this study is publicly available at https://github.com/zalandoresearch/fashion-mnist [42]. The MNIST dataset that supports this study is publicly available at https://www.cs.toronto.edu/~kriz/cifar.html [20]. Finally, the lung and colon cancer histopathological images (LC25000) data that support the findings of this study are openly available at https://github.com/tampapath/lung_colon_image_set/blob/master/README.md[5].
References
Afshar P, Mohammadi A, Plataniotis KN (2018) Brain tumor type classification via capsule networks. Proc - Int Conf Image Process ICIP: 3129–3133. https://doi.org/10.1109/ICIP.2018.8451379
Ahmed K, Torresani L (2019) STAR-CAPS: capsule networks with straight-through attentive routing. In: Advances in neural information processing systems
Amer M, Maul T (2020) Path capsule networks. Neural Process Lett. https://doi.org/10.1007/s11063-020-10273-0
Bhamidi SBS, El-Sharkawy M (2019) Residual capsule network. In: 2019 IEEE 10th annual ubiquitous computing, electronics and mobile communication conference, UEMCON 2019. https://doi.org/10.1109/UEMCON47517.2019.8993019
Borkowski AA, Bui MM, Thomas LB, Wilson CP, DeLand LA, Mastorides SM (2019) Lung and colon cancer histopathological image dataset (LC25000), pp 1–2
Cao S, Yao Y, An G (2011) E2-capsule neural networks for facial expression recognition using AU-aware attention, vol 00, no 00, pp 1–2
Chang S, Liu J (2020) Multi-lane capsule network for classifying images with complex background. IEEE Access, https://doi.org/10.1109/ACCESS.2020.2990700
Cheng J, Huang W, Cao S, Ru Y, Yang W, Yun Z, Wang Z, Feng Q (2015) Enhanced performance of brain tumor classification via tumor region augmentation and partition, PloS One
Cheng J, Yang W, Huang M, Huang W, Jiang J, Zhou Y, Yang R, Zhao J, Feng Y, Feng Q, Chen W (2016) Retrieval of brain tumors by adaptive spatial pooling and fisher vector representation. PloS One
Deborshi R, Sun G (2019) Application of capsule networks for image classification on complex datasets
Deng L (2012) The MNIST database of handwritten digit images for machine learning research. IEEE Signal Process Mag, https://doi.org/10.1109/MSP.2012.2211477
Hahn T, Pyeon M, Kim G (2019) Self-routing capsule networks. In: Advances in neural information processing systems
He JSK, Zhang X, Ren S (2016) Deep residual learning for image recognition. Proc IEEE Conf Comput Vis Pattern Recognit: 770–778. https://doi.org/10.3389/fpsyg.2013.00124
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2016.90
Hinton GE, Krizhevsky A, Wang SD (2011) Transforming auto-encoders. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). https://doi.org/10.1007/978-3-642-21735-7_6
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings - 30th IEEE conference on computer vision and pattern recognition, CVPR 2017. https://doi.org/10.1109/CVPR.2017.243
Iandola FN, Moskewicz MW, Ashraf K, Han S, Dally WJ, Keutzer K (2016) Squeezenet, arXiv
Jia B, Huang Q (2020) DE-CapsNet: a diverse enhanced capsule network with disperse dynamic routing. Appl Sci, https://doi.org/10.3390/app10030884
Khan A, Sohail A, Zahoora U, Qureshi AS (2020) A survey of the recent architectures of deep convolutional neural networks. Artif Intell Rev, https://doi.org/10.1007/s10462-020-09825-6
Krizhevsky A (2009) Learning Multiple Layers of Features from Tiny Images. Sci Dep Univ, Toronto, Tech, https://doi.org/10.1.1.222.9220
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems
Kumar P, Grewal M, Srivastava MM (2018) Boosted cascaded convnets for multilabel classification of thoracic diseases in chest radiographs. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). https://doi.org/10.1007/978-3-319-93000-8_62
Larsson G, Maire M, Shakhnarovich G (2016) FractalNet: ultra-deep neural networks without residuals. pp 1–11
Li H, Guo X, Dai B, Ouyang W, Wang X (2018) Neural network encapsulation. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). https://doi.org/10.1007/978-3-030-01252-6_16
Mandal B, Ghosh S, Sarkhel R, Das N, Nasipuri M (2019) Using dynamic routing to extract intermediate features for developing scalable capsule networks. In: 2019 2nd International conference on advanced computational and communication paradigms, ICACCP 2019. https://doi.org/10.1109/ICACCP.2019.8883020
Mensah Kwabena P , Weyori BA, Abra Mighty A (2020) Exploring the performance of LBP-capsule networks with K-Means routing on complex images. J King Saud Univ - Comput Inf Sci, https://doi.org/10.1016/j.jksuci.2020.10.006
Nguyen HP, Ribeiro B (2019) Advanced capsule networks via context awareness. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). https://doi.org/10.1007/978-3-030-30487-4_14
Paik I, Kwak T, Kim I (2019) Capsule networks need an improved routing algorithm, arXiv
Phaye SSR, Sikka A, Dhall A, Bathula DR (2019) Multi-level dense capsule networks, vol 11365 LNCS springer international publishing
Rajasegaran J, Jayasundara V, Jayasekara S, Jayasekara H, Seneviratne S, Rodrigo R (2019) Deepcaps: going deeper with capsule networks. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2019.01098
Rajpurkar P et al (2017) cheXNet: radiologist-level pneumonia detection on chest X-Rays with deep learning, pp 3–9
Ren H, Su J, Lu H (2019) Evaluating generalization ability of convolutional neural networks and capsule networks for image classification via top-2 classification. arXiv
Rosario VMD, Breternitz M, Borin E (2019) Efficiency and scalability of multi-lane capsule networks (MLCN). In: Proceedings - symposium on computer architecture and high performance computing. https://doi.org/10.1109/SBAC-PAD.2019.00034
Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. In: Advances in neural information processing systems
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd International conference on learning representations ICLR 2015 - conference track proceedings
Szegedy C et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2015.7298594
Van Der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res
Vimal Kurup R, Sowmya V, Soman KP (2020) Effect of Data Pre-processing on Brain Tumor Classification Using Capsulenet. In: ICICCT 2019 – system reliability, quality control, safety, maintenance and management
Wang X, Peng Y, Lu L, Lu Z, Bagheri M, Summers RM (2017) Chestx-ray8: hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In: Proceedings - 30th IEEE conference on computer vision and pattern recognition, CVPR 2017. https://doi.org/10.1109/CVPR.2017.369
Xi E, Bing S, Jin Y (2017) Capsule network performance on complex data, vol 10707, no. Fall, pp 1–7
Xiang C, Zhang L, Tang Y, Zou W, Xu C (2018) MS-CapsNet: a novel multi-scale capsule network. IEEE Signal Process Lett, https://doi.org/10.1109/LSP.2018.2873892
Xiao H, Rasul K, Vollgraf R (2017) Fashion-mniST: a novel image dataset for benchmarking machine learning algorithms, arXiv
Xiong Y, Su G, Ye S, Sun Y, Sun Y (2019) Deeper capsule network for complex data. In: Proceedings of the international joint conference on neural networks. https://doi.org/10.1109/IJCNN.2019.8852020
Yang Z, Wang X (2019) Reducing the dilution: an analysis of the information sensitiveness of capsule network with a practical improvement method
Yao L, Poblenz E, Dagunts D, Covington B, Bernard D, Lyman K (2017) Learning to diagnose from scratch by exploiting dependencies among labels, pp 1–12
Zhao Z, Kleinhans A, Sandhu G, Patel I, Unnikrishnan K.P. (2019) Capsule networks with max-min normalization, arXiv:1903.09662
Acknowledgements
The authors would like to thank the editor and the reviewers for their helpful suggestions and valuable comments.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Adu, K., Walker, J., Mensah, P.K. et al. SqueezeCapsNet: enhancing capsule networks with squeezenet for holistic medical and complex images. Multimed Tools Appl 83, 2823–2852 (2024). https://doi.org/10.1007/s11042-023-15089-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15089-3