GitHub - longbai1006/EndoUIC: Official implementation of "EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy", MICCAI 2024.

EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy Long Bai*, Tong Chen*, Qiaozhi Tan*, Wanjun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, Jinlin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren Medical Image Computing and Computer Assisted Intervention (MICCAI) 2024

[`arXiv`]

If you find our code, paper, or dataset useful, please cite the paper as

@article{bai2024endouic,
  title={EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy},
  author={Bai, Long and Tan, Qiaozhi and Chen, Tong and Nah, Wan Jun and Li, Yanheng and He, Zhicheng and Yuan, Sishen and Chen, Zhen and Wu, Jinlin and Islam, Mobarakol and others},
  journal={arXiv preprint arXiv:2406.13705},
  year={2024}
}

Abstract

Wireless Capsule Endoscopy (WCE) is highly valued for its non-invasive and painless approach, though its effectiveness is compromised by uneven illumination from hardware constraints and complex internal dynamics, leading to overexposed or underexposed images. While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels remains underexplored. To tackle this, we introduce EndoUIC, a WCE unified illumination correction solution using an end-to-end promptable diffusion transformer (DFT) model. In our work, the illumination prompt module shall navigate the model to adapt to different exposure levels and perform targeted image enhancement, in which the Adaptive Prompt Integration (API) and Global Prompt Scanner (GPS) modules shall further boost the concurrent representation learning between the prompt parameters and features. Besides, the U-shaped restoration DFT model shall capture the long-range dependencies and contextual information for unified illumination restoration. Moreover, we present a novel Capsule-endoscopy Exposure Correction (CEC) dataset, including ground-truth and corrupted image pairs annotated by expert photographers. Extensive experiments against a variety of state-of-the-art (SOTA) methods on four datasets showcase the effectiveness of our proposed method and components in WCE illumination restoration, and the additional downstream experiments further demonstrate its utility for clinical diagnosis and surgical assistance.

Environment

For environment setup, please follow these instructions

git clone https://github.com/longbai-cuhk/EndoUIC.git
cd EndoUIC
conda create -n EndoUIC python=3.7
conda activate EndoUIC
conda install pytorch==1.7.0 torchvision torchaudio cudatoolkit=11.0 -c pytorch
cd BasicSR-light
pip install -r requirements.txt
BASICSR_EXT=True sudo $(which python) setup.py develop
cd ../EndoUIC
pip install -r requirements.txt
BASICSR_EXT=True sudo $(which python) setup.py develop

You also need to install

Dataset

Kvasir-Capsule Dataset and Red Lesion Endoscopy Dataset
- The low-light and ground-truth image pairs are released by LLCaps.
Endo4IE Dataset
Capsule endoscopy Exposure Correction (CEC) Dataset

Training

Train your model with one GPU by running

CUDA_VISIBLE_DEVICES=0 python endouic/train.py -opt options/train_v1.yaml --launcher pytorch

Training arguments can be modified in 'train_v1.yml'.

Train your model with two or more GPUs by running

CUDA_VISIBLE_DEVICES=0,1...,n-1 python -m torch.distributed.launch --nproc_per_node=n --master_port=12345 endouic/train.py -opt options/train_v1.yaml --launcher pytorch

Training arguments can be modified in 'train_v1.yml'.

Inference

Conduct model inference by running

CUDA_VISIBLE_DEVICES=0 python endouic/train.py -opt options/infer.yaml

Evaluation (PSNR, SSIM, LPIPS, ....)

Please install the dependency needed.

cd evaluation
python evaluation.py -dir_A /[GT_PATH] -dir_B /[GENERATED_IMAGE_PATH]

Acknowledgement

The code is partly built on PyDiff, PromptIR, and VMamba.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
BasicSR-light		BasicSR-light
EndoUIC		EndoUIC
dataset		dataset
evaluation		evaluation
pretrained_models		pretrained_models
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy

Long Bai, Tong Chen, Qiaozhi Tan*, Wanjun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, Jinlin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren

Medical Image Computing and Computer Assisted Intervention (MICCAI) 2024

Abstract

Environment

Dataset

Training

Inference

Evaluation (PSNR, SSIM, LPIPS, ....)

Acknowledgement

About

Releases

Packages

Languages

License

longbai1006/EndoUIC

Folders and files

Latest commit

History

Repository files navigation

EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy

Long Bai*, Tong Chen*, Qiaozhi Tan*, Wanjun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, Jinlin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren

Medical Image Computing and Computer Assisted Intervention (MICCAI) 2024

Abstract

Environment

Dataset

Training

Inference

Evaluation (PSNR, SSIM, LPIPS, ....)

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Long Bai, Tong Chen, Qiaozhi Tan*, Wanjun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, Jinlin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren

Packages