research-article

CT-Net: : Asymmetric compound branch Transformer for medical image segmentation

Authors:

Long Yu,

Min LiAuthors Info & Claims

Volume 170, Issue C

Pages 298 - 311

https://doi.org/10.1016/j.neunet.2023.11.034

Published: 12 April 2024 Publication History

Abstract

The Transformer architecture has been widely applied in the field of image segmentation due to its powerful ability to capture long-range dependencies. However, its ability to capture local features is relatively weak and it requires a large amount of data for training. Medical image segmentation tasks, on the other hand, demand high requirements for local features and are often applied to small datasets. Therefore, existing Transformer networks show a significant decrease in performance when applied directly to this task. To address these issues, we have designed a new medical image segmentation architecture called CT-Net. It effectively extracts local and global representations using an asymmetric asynchronous branch parallel structure, while reducing unnecessary computational costs. In addition, we propose a high-density information fusion strategy that efficiently fuses the features of two branches using a fusion module of only 0.05M. This strategy ensures high portability and provides conditions for directly applying transfer learning to solve dataset dependency issues. Finally, we have designed a parameter-adjustable multi-perceptive loss function for this architecture to optimize the training process from both pixel-level and global perspectives. We have tested this network on 5 different tasks with 9 datasets, and compared to SwinUNet, CT-Net improves the IoU by 7.3% and 1.8% on Glas and MoNuSeg datasets respectively. Moreover, compared to SwinUNet, the average DSC on the Synapse dataset is improved by 3.5%.

Highlights

•

An efficient asymmetric CNN and Transformer parallel framework.

•

A high-density information fusion strategy with a 0.05M-parameter fusion module.

•

Customized multi-composite loss function optimized training process.

•

Adopts low-coupling branch design principle with high portability.

•

Supports transfer learning to solve the data dependency problem of Transformers.

References

[1]

Bernal J., Sánchez F.J., Fernández-Esparrach G., Gil D., Miguel C.R., Vilariño F., WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs saliency maps from physicians, Computerized Medical Imaging and Graphics: the Official Journal of the Computerized Medical Imaging Society 43 (2015) 99–111,.

Abstract

Highlights

References

Recommendations

FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation

TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images

HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations