Abstract
Retinal vessel segmentation is an important task in medical image analysis and has a wide range of applications in the diagnosis and treatment of retinal diseases. However, existing segmentation methods still have some shortcomings in accurately segmenting thin vessels. Based on this observation, we propose a Retinal Vessel Segmentation method based on Weighted Large Kernel Attention (WLKA-RVS), which aims to improve the accuracy of retinal vessel segmentation to better assist physicians in clinical diagnosis and treatment. Our method consists of an encoder and a decoder. In the encoder, a convolution stem first reduces the dimension of the input image. Then, feature extraction is performed by four stages of Swin Transformer modules, each stage with a downsampling layer. In the decoder, there are four different stages of Weighted Large Kernel Attention Block (WLKAB) corresponding to the Swin Transformer modules in the encoder. Then WLKA-RVS applies the Patch Expanding module to achieve upsampling. Finally, a linear layer outputs the final results. We have performed extensive experiments comparing several recent advanced models on three public datasets. WLKA-RVS led by 0.32%, 1.24%, and 0.71% in the mAcc metric, respectively. At the same time, the inference speed of WLKA-RVS met the real-time requirements for medical diagnosis. A series of experiments demonstrated the efficiency, robustness, and applicability of WLKA-RVS.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
The data that support the findings of this study are openly available at https://cecas.clemson.edu/ahoover/stare/, https://www5.cs.fau.de/research/data/fundus-images/, and https://blogs.kingston.ac.uk/retinal/chasedb1/.
References
Lijuan F, Fan Z (2023) A novel feature fusion model based on non-subsampled shear-wave transform for retinal blood vessel segmentation. Comp Sci Inform Syst 20(4)
Boudegga H, Elloumi Y, Akil M, Bedoui MH, Kachouri R, Abdallah AB (2021) Fast and efficient retinal blood vessel segmentation method based on deep learning network. Comput Med Imaging Graph 90:101902
Tan Y, Yang K-F, Zhao S-X, Li Y-J (2022) Retinal vessel segmentation with skeletal prior and contrastive loss. IEEE Trans Med Imaging 41(9):2238–2251
Lin G, Bai H, Zhao J, Yun Z, Chen Y, Pang S, Feng Q (2022) Improving sensitivity and connectivity of retinal vessel segmentation via error discrimination network. Med Phys 49(7):4494–4507
Ma D, Lu D, Chen S, Heisler M, Dabiri S, Lee S, Lee H, Ding GW, Sarunic MV, Beg MF (2021) Lf-unet-a novel anatomical-aware dual-branch cascaded deep neural network for segmentation of retinal layers and fluid from optical coherence tomography images. Comput Med Imaging Graph 94:101988
Lu Y, Shen Y, Xing X, Ye C, Meng MQ-H (2023) Boundary-enhanced semi-supervised retinal layer segmentation in optical coherence tomography images using fewer labels. Comput Med Imaging Graph 105:102199
Huang C, Wang Z, Yuan G, Xiong Z, Hu J, Tong Y (2024) Pksea-net: A prior knowledge supervised edge-aware multi-task network for retinal arteriolar morphometry. Comput Biol Med 172:108255
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 3431–3440
Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, Wang M (2022) Swin-unet: unet-like pure transformer for medical image segmentation. European conference on computer vision. Springer, pp 205–218
Azad R, Niggemeier L, Hüttemann M, Kazerouni A, Aghdam EK, Velichko Y, Bagci U, Merhof D (2024) Beyond self-attention: Deformable large kernel attention for medical image segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp 1287–1297
Li J, Gao G, Yang L, Liu Y (2024) A retinal vessel segmentation network with multiple-dimension attention and adaptive feature fusion. Comput Biol Med 108315
Chen L-C, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587
Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39(12):2481–2495
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, pp. 234–241
Ahmed MR, Ashrafi AF, Ahmed RU, Shatabda S, Islam AM, Islam S (2023) Doubleu-netplus: a novel attention and context-guided dual u-net with multi-scale residual feature fusion network for semantic segmentation of medical images. Neural Comput Appl 35(19):14379–14401
Liu Y, Shen J, Yang L, Bian G, Yu H (2023) Resdo-unet: A deep residual network for accurate retinal vessel segmentation from fundus images. Biomed Signal Process Control 79:104087
Mou L, Zhao Y, Fu H, Liu Y, Cheng J, Zheng Y, Su P, Yang J, Chen L, Frangi AF et al (2021) Cs2-net: Deep learning segmentation of curvilinear structures in medical imaging. Med Image Anal 67:101874
Li W, Zhang H, Li F, Wang L (2022) Rps-net: an effective retinal image projection segmentation network for retinal vessels and foveal avascular zone based on octa data. Med Phys 49(6):3830–3844
Jun Guo B, He X, Lei Y, Harms J, Wang T, Curran WJ, Liu T, Jiang Zhang L, Yang X (2020) Automated left ventricular myocardium segmentation using 3d deeply supervised attention u-net for coronary computed tomography angiography; ct myocardium segmentation. Med Phys 47(4):1775–1785
Cui H, Yuwen C, Jiang L, Xia Y, Zhang Y (2021) Multiscale attention guided u-net architecture for cardiac segmentation in short-axis mri images. Comput Methods Programs Biomed 206:106142
Guo C, Szemenyei M, Yi Y, Wang W, Chen B, Fan C (2021) Sa-unet: Spatial attention u-net for retinal vessel segmentation. In: 2020 25th international conference on pattern recognition. IEEE, pp 1236–1242
Tang X, Zhong B, Peng J, Hao B, Li J (2020) Multi-scale channel importance sorting and spatial attention mechanism for retinal vessels segmentation. Appl Soft Comput 93:106353
Guo M-H, Lu C-Z, Liu Z-N, Cheng M-M, Hu S-M (2023) Visual attention network. Comput Vis Media 9(4):733–752
Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y (2017) Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision. pp 764–773
Gottlieb JP, Kusunoki M, Goldberg ME (1998) The representation of visual salience in monkey parietal cortex. Nature 391(6666):481–484
Treisman AM, Gelade G (1980) A feature-integration theory of attention. Cogn Psychol 12(1):97–136
Wolfe JM, Horowitz TS (2004) What attributes guide the deployment of visual attention and how do they do it? Nat Rev Neurosci 5(6):495–501
Tsotsos JK, Culhane SM, Wai WYK, Lai Y, Davis N, Nuflo F (1995) Modeling visual attention via selective tuning. Artif Intell 78(1–2):507–545
Ali A, Touvron H, Caron M, Bojanowski P, Douze M, Joulin A, Laptev I, Neverova N, Synnaeve G, Verbeek J et al (2021) Xcit: Cross-covariance image transformers. Adv Neural Inf Process Syst 34:20014–20027
Contributors M (2020) MMSegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark. https://github.com/open-mmlab/mmsegmentation
Hoover A, Kouznetsova V, Goldbaum M (2000) Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response. IEEE Trans Med Imaging 19(3):203–210
Budai A, Bock R, Maier A, Hornegger J, Michelson G et al (2013) Robust vessel segmentation in fundus images. Int J Biomed Imaging 2013
Owen CG, Rudnicka AR, Nightingale CM, Mullen R, Barman SA, Sattar N, Cook DG, Whincup PH (2011) Retinal arteriolar tortuosity and cardiovascular risk factors in a multi-ethnic population study of 10-year-old children; the child heart and health study in england (chase). Arterioscler Thromb Vasc Biol 31(8):1933–1938
Kang B, Moon S, Cho Y, Yu H, Kang S-J (2024) Metaseg: Metaformer-based global contexts-aware network for efficient semantic segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp 434–443
Cheng B, Misra I, Schwing AG, Kirillov A, Girdhar R (2022) Masked-attention mask transformer for universal image segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 1290–1299
Cheng B, Schwing A, Kirillov A (2021) Per-pixel classification is not all you need for semantic segmentation. Adv Neural Inf Process Syst 34:17864–17875
Zhang W, Pang J, Chen K, Loy CC (2021) K-net: Towards unified image segmentation. Adv Neural Inf Process Syst 34:10326–10338
Xie E, Wang W, Yu Z, Anandkumar A, Alvarez JM, Luo P (2021) Segformer: Simple and efficient design for semantic segmentation with transformers. Adv Neural Inf Process Syst 34:12077–12090
Strudel R, Garcia R, Laptev I, Schmid C (2021) Segmenter: Transformer for semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 7262–7272
Fan M, Lai S, Huang J, Wei X, Chai Z, Luo J, Wei X (2021) Rethinking bisenet for real-time semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 9716–9725
Pan H, Hong Y, Sun W, Jia Y (2022) Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes. IEEE Trans Intell Transp Syst 24(3):3448–3460
Xu M, Zhang Z, Wei F, Hu H, Bai X (2023) Side adapter network for open-vocabulary semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 2945–2954
Acknowledgements
The authors wish to thank the editors and anonymous reviewers for their valuable comments and suggestions.
Funding
This work was supported by the Guangdong University of Science and Technology Research Natural Science Project with Grant No. GKY-2023KYYBK-17.
Author information
Authors and Affiliations
Contributions
Jiayao Li: Conceptualization, Methodology, Software, Validation, Writing - Original Draft, Visualization. Min Zeng: Formal analysis, Writing - review & editing, Supervision. Chenxi Wu: Investigation, Writing - review & editing, Supervision. Qianxiang Cheng: Resources, Data Curation, Writing - review & editing. Qiuyan Guo: Investigation, Writing - review & editing, Supervision. Song Li: Conceptualization, Supervision, Project administration, Funding acquisition.
Corresponding author
Ethics declarations
Competing Interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethical and Informed Consent for Data Used
This article does not contain any studies on human participants or animals.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, J., Zeng, M., Wu, C. et al. WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention. Appl Intell 55, 403 (2025). https://doi.org/10.1007/s10489-025-06309-4
Accepted:
Published:
DOI: https://doi.org/10.1007/s10489-025-06309-4