[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content

Advertisement

Log in

WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Retinal vessel segmentation is an important task in medical image analysis and has a wide range of applications in the diagnosis and treatment of retinal diseases. However, existing segmentation methods still have some shortcomings in accurately segmenting thin vessels. Based on this observation, we propose a Retinal Vessel Segmentation method based on Weighted Large Kernel Attention (WLKA-RVS), which aims to improve the accuracy of retinal vessel segmentation to better assist physicians in clinical diagnosis and treatment. Our method consists of an encoder and a decoder. In the encoder, a convolution stem first reduces the dimension of the input image. Then, feature extraction is performed by four stages of Swin Transformer modules, each stage with a downsampling layer. In the decoder, there are four different stages of Weighted Large Kernel Attention Block (WLKAB) corresponding to the Swin Transformer modules in the encoder. Then WLKA-RVS applies the Patch Expanding module to achieve upsampling. Finally, a linear layer outputs the final results. We have performed extensive experiments comparing several recent advanced models on three public datasets. WLKA-RVS led by 0.32%, 1.24%, and 0.71% in the mAcc metric, respectively. At the same time, the inference speed of WLKA-RVS met the real-time requirements for medical diagnosis. A series of experiments demonstrated the efficiency, robustness, and applicability of WLKA-RVS.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data Availability

The data that support the findings of this study are openly available at https://cecas.clemson.edu/ahoover/stare/, https://www5.cs.fau.de/research/data/fundus-images/, and https://blogs.kingston.ac.uk/retinal/chasedb1/.

References

  1. Lijuan F, Fan Z (2023) A novel feature fusion model based on non-subsampled shear-wave transform for retinal blood vessel segmentation. Comp Sci Inform Syst 20(4)

  2. Boudegga H, Elloumi Y, Akil M, Bedoui MH, Kachouri R, Abdallah AB (2021) Fast and efficient retinal blood vessel segmentation method based on deep learning network. Comput Med Imaging Graph 90:101902

    Article  Google Scholar 

  3. Tan Y, Yang K-F, Zhao S-X, Li Y-J (2022) Retinal vessel segmentation with skeletal prior and contrastive loss. IEEE Trans Med Imaging 41(9):2238–2251

    Article  MATH  Google Scholar 

  4. Lin G, Bai H, Zhao J, Yun Z, Chen Y, Pang S, Feng Q (2022) Improving sensitivity and connectivity of retinal vessel segmentation via error discrimination network. Med Phys 49(7):4494–4507

    Article  Google Scholar 

  5. Ma D, Lu D, Chen S, Heisler M, Dabiri S, Lee S, Lee H, Ding GW, Sarunic MV, Beg MF (2021) Lf-unet-a novel anatomical-aware dual-branch cascaded deep neural network for segmentation of retinal layers and fluid from optical coherence tomography images. Comput Med Imaging Graph 94:101988

    Article  Google Scholar 

  6. Lu Y, Shen Y, Xing X, Ye C, Meng MQ-H (2023) Boundary-enhanced semi-supervised retinal layer segmentation in optical coherence tomography images using fewer labels. Comput Med Imaging Graph 105:102199

    Article  Google Scholar 

  7. Huang C, Wang Z, Yuan G, Xiong Z, Hu J, Tong Y (2024) Pksea-net: A prior knowledge supervised edge-aware multi-task network for retinal arteriolar morphometry. Comput Biol Med 172:108255

    Article  Google Scholar 

  8. Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 3431–3440

  9. Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, Wang M (2022) Swin-unet: unet-like pure transformer for medical image segmentation. European conference on computer vision. Springer, pp 205–218

  10. Azad R, Niggemeier L, Hüttemann M, Kazerouni A, Aghdam EK, Velichko Y, Bagci U, Merhof D (2024) Beyond self-attention: Deformable large kernel attention for medical image segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp 1287–1297

  11. Li J, Gao G, Yang L, Liu Y (2024) A retinal vessel segmentation network with multiple-dimension attention and adaptive feature fusion. Comput Biol Med 108315

  12. Chen L-C, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587

  13. Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39(12):2481–2495

    Article  MATH  Google Scholar 

  14. Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, pp. 234–241

  15. Ahmed MR, Ashrafi AF, Ahmed RU, Shatabda S, Islam AM, Islam S (2023) Doubleu-netplus: a novel attention and context-guided dual u-net with multi-scale residual feature fusion network for semantic segmentation of medical images. Neural Comput Appl 35(19):14379–14401

    Article  MATH  Google Scholar 

  16. Liu Y, Shen J, Yang L, Bian G, Yu H (2023) Resdo-unet: A deep residual network for accurate retinal vessel segmentation from fundus images. Biomed Signal Process Control 79:104087

    Article  Google Scholar 

  17. Mou L, Zhao Y, Fu H, Liu Y, Cheng J, Zheng Y, Su P, Yang J, Chen L, Frangi AF et al (2021) Cs2-net: Deep learning segmentation of curvilinear structures in medical imaging. Med Image Anal 67:101874

    Article  Google Scholar 

  18. Li W, Zhang H, Li F, Wang L (2022) Rps-net: an effective retinal image projection segmentation network for retinal vessels and foveal avascular zone based on octa data. Med Phys 49(6):3830–3844

    Article  MATH  Google Scholar 

  19. Jun Guo B, He X, Lei Y, Harms J, Wang T, Curran WJ, Liu T, Jiang Zhang L, Yang X (2020) Automated left ventricular myocardium segmentation using 3d deeply supervised attention u-net for coronary computed tomography angiography; ct myocardium segmentation. Med Phys 47(4):1775–1785

    Article  Google Scholar 

  20. Cui H, Yuwen C, Jiang L, Xia Y, Zhang Y (2021) Multiscale attention guided u-net architecture for cardiac segmentation in short-axis mri images. Comput Methods Programs Biomed 206:106142

    Article  MATH  Google Scholar 

  21. Guo C, Szemenyei M, Yi Y, Wang W, Chen B, Fan C (2021) Sa-unet: Spatial attention u-net for retinal vessel segmentation. In: 2020 25th international conference on pattern recognition. IEEE, pp 1236–1242

  22. Tang X, Zhong B, Peng J, Hao B, Li J (2020) Multi-scale channel importance sorting and spatial attention mechanism for retinal vessels segmentation. Appl Soft Comput 93:106353

    Article  MATH  Google Scholar 

  23. Guo M-H, Lu C-Z, Liu Z-N, Cheng M-M, Hu S-M (2023) Visual attention network. Comput Vis Media 9(4):733–752

    Article  MATH  Google Scholar 

  24. Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y (2017) Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision. pp 764–773

  25. Gottlieb JP, Kusunoki M, Goldberg ME (1998) The representation of visual salience in monkey parietal cortex. Nature 391(6666):481–484

    Article  MATH  Google Scholar 

  26. Treisman AM, Gelade G (1980) A feature-integration theory of attention. Cogn Psychol 12(1):97–136

    Article  MATH  Google Scholar 

  27. Wolfe JM, Horowitz TS (2004) What attributes guide the deployment of visual attention and how do they do it? Nat Rev Neurosci 5(6):495–501

    Article  Google Scholar 

  28. Tsotsos JK, Culhane SM, Wai WYK, Lai Y, Davis N, Nuflo F (1995) Modeling visual attention via selective tuning. Artif Intell 78(1–2):507–545

    Article  Google Scholar 

  29. Ali A, Touvron H, Caron M, Bojanowski P, Douze M, Joulin A, Laptev I, Neverova N, Synnaeve G, Verbeek J et al (2021) Xcit: Cross-covariance image transformers. Adv Neural Inf Process Syst 34:20014–20027

    Google Scholar 

  30. Contributors M (2020) MMSegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark. https://github.com/open-mmlab/mmsegmentation

  31. Hoover A, Kouznetsova V, Goldbaum M (2000) Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response. IEEE Trans Med Imaging 19(3):203–210

    Article  MATH  Google Scholar 

  32. Budai A, Bock R, Maier A, Hornegger J, Michelson G et al (2013) Robust vessel segmentation in fundus images. Int J Biomed Imaging 2013

  33. Owen CG, Rudnicka AR, Nightingale CM, Mullen R, Barman SA, Sattar N, Cook DG, Whincup PH (2011) Retinal arteriolar tortuosity and cardiovascular risk factors in a multi-ethnic population study of 10-year-old children; the child heart and health study in england (chase). Arterioscler Thromb Vasc Biol 31(8):1933–1938

    Article  Google Scholar 

  34. Kang B, Moon S, Cho Y, Yu H, Kang S-J (2024) Metaseg: Metaformer-based global contexts-aware network for efficient semantic segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp 434–443

  35. Cheng B, Misra I, Schwing AG, Kirillov A, Girdhar R (2022) Masked-attention mask transformer for universal image segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 1290–1299

  36. Cheng B, Schwing A, Kirillov A (2021) Per-pixel classification is not all you need for semantic segmentation. Adv Neural Inf Process Syst 34:17864–17875

    Google Scholar 

  37. Zhang W, Pang J, Chen K, Loy CC (2021) K-net: Towards unified image segmentation. Adv Neural Inf Process Syst 34:10326–10338

    Google Scholar 

  38. Xie E, Wang W, Yu Z, Anandkumar A, Alvarez JM, Luo P (2021) Segformer: Simple and efficient design for semantic segmentation with transformers. Adv Neural Inf Process Syst 34:12077–12090

    MATH  Google Scholar 

  39. Strudel R, Garcia R, Laptev I, Schmid C (2021) Segmenter: Transformer for semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 7262–7272

  40. Fan M, Lai S, Huang J, Wei X, Chai Z, Luo J, Wei X (2021) Rethinking bisenet for real-time semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 9716–9725

  41. Pan H, Hong Y, Sun W, Jia Y (2022) Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes. IEEE Trans Intell Transp Syst 24(3):3448–3460

  42. Xu M, Zhang Z, Wei F, Hu H, Bai X (2023) Side adapter network for open-vocabulary semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 2945–2954

Download references

Acknowledgements

The authors wish to thank the editors and anonymous reviewers for their valuable comments and suggestions.

Funding

This work was supported by the Guangdong University of Science and Technology Research Natural Science Project with Grant No. GKY-2023KYYBK-17.

Author information

Authors and Affiliations

Authors

Contributions

Jiayao Li: Conceptualization, Methodology, Software, Validation, Writing - Original Draft, Visualization. Min Zeng: Formal analysis, Writing - review & editing, Supervision. Chenxi Wu: Investigation, Writing - review & editing, Supervision. Qianxiang Cheng: Resources, Data Curation, Writing - review & editing. Qiuyan Guo: Investigation, Writing - review & editing, Supervision. Song Li: Conceptualization, Supervision, Project administration, Funding acquisition.

Corresponding author

Correspondence to Song Li.

Ethics declarations

Competing Interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical and Informed Consent for Data Used

This article does not contain any studies on human participants or animals.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, J., Zeng, M., Wu, C. et al. WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention. Appl Intell 55, 403 (2025). https://doi.org/10.1007/s10489-025-06309-4

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10489-025-06309-4

Keywords

Navigation