WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention

Jiayao Li¹,
Min Zeng²,
Chenxi Wu³,
Qianxiang Cheng¹,
Qiuyan Guo⁴ &
…
Song Li⁵

58 Accesses
Explore all metrics

Abstract

Retinal vessel segmentation is an important task in medical image analysis and has a wide range of applications in the diagnosis and treatment of retinal diseases. However, existing segmentation methods still have some shortcomings in accurately segmenting thin vessels. Based on this observation, we propose a Retinal Vessel Segmentation method based on Weighted Large Kernel Attention (WLKA-RVS), which aims to improve the accuracy of retinal vessel segmentation to better assist physicians in clinical diagnosis and treatment. Our method consists of an encoder and a decoder. In the encoder, a convolution stem first reduces the dimension of the input image. Then, feature extraction is performed by four stages of Swin Transformer modules, each stage with a downsampling layer. In the decoder, there are four different stages of Weighted Large Kernel Attention Block (WLKAB) corresponding to the Swin Transformer modules in the encoder. Then WLKA-RVS applies the Patch Expanding module to achieve upsampling. Finally, a linear layer outputs the final results. We have performed extensive experiments comparing several recent advanced models on three public datasets. WLKA-RVS led by 0.32%, 1.24%, and 0.71% in the mAcc metric, respectively. At the same time, the inference speed of WLKA-RVS met the real-time requirements for medical diagnosis. A series of experiments demonstrated the efficiency, robustness, and applicability of WLKA-RVS.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

(DA-U)²Net: double attention U²Net for retinal vessel segmentation

Article Open access 21 February 2025

LEA U-Net: a U-Net-based deep learning framework with local feature enhancement and attention for retinal vessel segmentation

Article Open access 30 May 2023

U-Net with Attention Mechanism for Retinal Vessel Segmentation

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data Availability

The data that support the findings of this study are openly available at https://cecas.clemson.edu/ahoover/stare/, https://www5.cs.fau.de/research/data/fundus-images/, and https://blogs.kingston.ac.uk/retinal/chasedb1/.

References

Lijuan F, Fan Z (2023) A novel feature fusion model based on non-subsampled shear-wave transform for retinal blood vessel segmentation. Comp Sci Inform Syst 20(4)
Boudegga H, Elloumi Y, Akil M, Bedoui MH, Kachouri R, Abdallah AB (2021) Fast and efficient retinal blood vessel segmentation method based on deep learning network. Comput Med Imaging Graph 90:101902
Article Google Scholar
Tan Y, Yang K-F, Zhao S-X, Li Y-J (2022) Retinal vessel segmentation with skeletal prior and contrastive loss. IEEE Trans Med Imaging 41(9):2238–2251
Article MATH Google Scholar
Lin G, Bai H, Zhao J, Yun Z, Chen Y, Pang S, Feng Q (2022) Improving sensitivity and connectivity of retinal vessel segmentation via error discrimination network. Med Phys 49(7):4494–4507
Article Google Scholar
Ma D, Lu D, Chen S, Heisler M, Dabiri S, Lee S, Lee H, Ding GW, Sarunic MV, Beg MF (2021) Lf-unet-a novel anatomical-aware dual-branch cascaded deep neural network for segmentation of retinal layers and fluid from optical coherence tomography images. Comput Med Imaging Graph 94:101988
Article Google Scholar
Lu Y, Shen Y, Xing X, Ye C, Meng MQ-H (2023) Boundary-enhanced semi-supervised retinal layer segmentation in optical coherence tomography images using fewer labels. Comput Med Imaging Graph 105:102199
Article Google Scholar
Huang C, Wang Z, Yuan G, Xiong Z, Hu J, Tong Y (2024) Pksea-net: A prior knowledge supervised edge-aware multi-task network for retinal arteriolar morphometry. Comput Biol Med 172:108255
Article Google Scholar
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 3431–3440
Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, Wang M (2022) Swin-unet: unet-like pure transformer for medical image segmentation. European conference on computer vision. Springer, pp 205–218
Azad R, Niggemeier L, Hüttemann M, Kazerouni A, Aghdam EK, Velichko Y, Bagci U, Merhof D (2024) Beyond self-attention: Deformable large kernel attention for medical image segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp 1287–1297
Li J, Gao G, Yang L, Liu Y (2024) A retinal vessel segmentation network with multiple-dimension attention and adaptive feature fusion. Comput Biol Med 108315
Chen L-C, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587
Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39(12):2481–2495
Article MATH Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, pp. 234–241
Ahmed MR, Ashrafi AF, Ahmed RU, Shatabda S, Islam AM, Islam S (2023) Doubleu-netplus: a novel attention and context-guided dual u-net with multi-scale residual feature fusion network for semantic segmentation of medical images. Neural Comput Appl 35(19):14379–14401
Article MATH Google Scholar
Liu Y, Shen J, Yang L, Bian G, Yu H (2023) Resdo-unet: A deep residual network for accurate retinal vessel segmentation from fundus images. Biomed Signal Process Control 79:104087
Article Google Scholar
Mou L, Zhao Y, Fu H, Liu Y, Cheng J, Zheng Y, Su P, Yang J, Chen L, Frangi AF et al (2021) Cs2-net: Deep learning segmentation of curvilinear structures in medical imaging. Med Image Anal 67:101874
Article Google Scholar
Li W, Zhang H, Li F, Wang L (2022) Rps-net: an effective retinal image projection segmentation network for retinal vessels and foveal avascular zone based on octa data. Med Phys 49(6):3830–3844
Article MATH Google Scholar
Jun Guo B, He X, Lei Y, Harms J, Wang T, Curran WJ, Liu T, Jiang Zhang L, Yang X (2020) Automated left ventricular myocardium segmentation using 3d deeply supervised attention u-net for coronary computed tomography angiography; ct myocardium segmentation. Med Phys 47(4):1775–1785
Article Google Scholar
Cui H, Yuwen C, Jiang L, Xia Y, Zhang Y (2021) Multiscale attention guided u-net architecture for cardiac segmentation in short-axis mri images. Comput Methods Programs Biomed 206:106142
Article MATH Google Scholar
Guo C, Szemenyei M, Yi Y, Wang W, Chen B, Fan C (2021) Sa-unet: Spatial attention u-net for retinal vessel segmentation. In: 2020 25th international conference on pattern recognition. IEEE, pp 1236–1242
Tang X, Zhong B, Peng J, Hao B, Li J (2020) Multi-scale channel importance sorting and spatial attention mechanism for retinal vessels segmentation. Appl Soft Comput 93:106353
Article MATH Google Scholar
Guo M-H, Lu C-Z, Liu Z-N, Cheng M-M, Hu S-M (2023) Visual attention network. Comput Vis Media 9(4):733–752
Article MATH Google Scholar
Dai J, Qi H, Xiong Y, Li Y, Zhang G, Hu H, Wei Y (2017) Deformable convolutional networks. In: Proceedings of the IEEE international conference on computer vision. pp 764–773
Gottlieb JP, Kusunoki M, Goldberg ME (1998) The representation of visual salience in monkey parietal cortex. Nature 391(6666):481–484
Article MATH Google Scholar
Treisman AM, Gelade G (1980) A feature-integration theory of attention. Cogn Psychol 12(1):97–136
Article MATH Google Scholar
Wolfe JM, Horowitz TS (2004) What attributes guide the deployment of visual attention and how do they do it? Nat Rev Neurosci 5(6):495–501
Article Google Scholar
Tsotsos JK, Culhane SM, Wai WYK, Lai Y, Davis N, Nuflo F (1995) Modeling visual attention via selective tuning. Artif Intell 78(1–2):507–545
Article Google Scholar
Ali A, Touvron H, Caron M, Bojanowski P, Douze M, Joulin A, Laptev I, Neverova N, Synnaeve G, Verbeek J et al (2021) Xcit: Cross-covariance image transformers. Adv Neural Inf Process Syst 34:20014–20027
Google Scholar
Contributors M (2020) MMSegmentation: OpenMMLab Semantic Segmentation Toolbox and Benchmark. https://github.com/open-mmlab/mmsegmentation
Hoover A, Kouznetsova V, Goldbaum M (2000) Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response. IEEE Trans Med Imaging 19(3):203–210
Article MATH Google Scholar
Budai A, Bock R, Maier A, Hornegger J, Michelson G et al (2013) Robust vessel segmentation in fundus images. Int J Biomed Imaging 2013
Owen CG, Rudnicka AR, Nightingale CM, Mullen R, Barman SA, Sattar N, Cook DG, Whincup PH (2011) Retinal arteriolar tortuosity and cardiovascular risk factors in a multi-ethnic population study of 10-year-old children; the child heart and health study in england (chase). Arterioscler Thromb Vasc Biol 31(8):1933–1938
Article Google Scholar
Kang B, Moon S, Cho Y, Yu H, Kang S-J (2024) Metaseg: Metaformer-based global contexts-aware network for efficient semantic segmentation. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision. pp 434–443
Cheng B, Misra I, Schwing AG, Kirillov A, Girdhar R (2022) Masked-attention mask transformer for universal image segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 1290–1299
Cheng B, Schwing A, Kirillov A (2021) Per-pixel classification is not all you need for semantic segmentation. Adv Neural Inf Process Syst 34:17864–17875
Google Scholar
Zhang W, Pang J, Chen K, Loy CC (2021) K-net: Towards unified image segmentation. Adv Neural Inf Process Syst 34:10326–10338
Google Scholar
Xie E, Wang W, Yu Z, Anandkumar A, Alvarez JM, Luo P (2021) Segformer: Simple and efficient design for semantic segmentation with transformers. Adv Neural Inf Process Syst 34:12077–12090
MATH Google Scholar
Strudel R, Garcia R, Laptev I, Schmid C (2021) Segmenter: Transformer for semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision. pp 7262–7272
Fan M, Lai S, Huang J, Wei X, Chai Z, Luo J, Wei X (2021) Rethinking bisenet for real-time semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 9716–9725
Pan H, Hong Y, Sun W, Jia Y (2022) Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes. IEEE Trans Intell Transp Syst 24(3):3448–3460
Xu M, Zhang Z, Wei F, Hu H, Bai X (2023) Side adapter network for open-vocabulary semantic segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 2945–2954

Download references

Acknowledgements

The authors wish to thank the editors and anonymous reviewers for their valuable comments and suggestions.

Funding

This work was supported by the Guangdong University of Science and Technology Research Natural Science Project with Grant No. GKY-2023KYYBK-17.

Author information

Authors and Affiliations

School of Computer Science and Engineering, Faculty of Innovation Engineering, Macau University of Science and Technology, Taipa, Macau, 999078, China
Jiayao Li & Qianxiang Cheng
College of Computer Science, Guangdong University of Science and Technology, Dongguan, 523083, PR China
Min Zeng
University International College, Macau University of Science and Technology, Taipa, Macau, 999078, China
Chenxi Wu
The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, PR China
Qiuyan Guo
Wangjiahe Street Community Health Service Center, Yueyang, 414000, PR China
Song Li

Authors

Jiayao Li
View author publications
You can also search for this author in PubMed Google Scholar
Min Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Chenxi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Qianxiang Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Qiuyan Guo
View author publications
You can also search for this author in PubMed Google Scholar
Song Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Jiayao Li: Conceptualization, Methodology, Software, Validation, Writing - Original Draft, Visualization. Min Zeng: Formal analysis, Writing - review & editing, Supervision. Chenxi Wu: Investigation, Writing - review & editing, Supervision. Qianxiang Cheng: Resources, Data Curation, Writing - review & editing. Qiuyan Guo: Investigation, Writing - review & editing, Supervision. Song Li: Conceptualization, Supervision, Project administration, Funding acquisition.

Corresponding author

Correspondence to Song Li.

Ethics declarations

Competing Interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical and Informed Consent for Data Used

This article does not contain any studies on human participants or animals.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, J., Zeng, M., Wu, C. et al. WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention. Appl Intell 55, 403 (2025). https://doi.org/10.1007/s10489-025-06309-4

Download citation

Accepted: 28 January 2025
Published: 04 February 2025
DOI: https://doi.org/10.1007/s10489-025-06309-4

WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

(DA-U)²Net: double attention U²Net for retinal vessel segmentation

LEA U-Net: a U-Net-based deep learning framework with local feature enhancement and attention for retinal vessel segmentation

U-Net with Attention Mechanism for Retinal Vessel Segmentation

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Ethical and Informed Consent for Data Used

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

WLKA-RVS: a retinal vessel segmentation method using weighted large kernel attention

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

(DA-U)2Net: double attention U2Net for retinal vessel segmentation

LEA U-Net: a U-Net-based deep learning framework with local feature enhancement and attention for retinal vessel segmentation

U-Net with Attention Mechanism for Retinal Vessel Segmentation

Explore related subjects

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Ethical and Informed Consent for Data Used

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation

(DA-U)²Net: double attention U²Net for retinal vessel segmentation