Multi-Scale Attention Feature Enhancement Network for Single Image Dehazing
<p>The architecture of the MAFE.</p> "> Figure 2
<p>Structure of the DFB module.</p> "> Figure 3
<p>Structure of the SPDC.</p> "> Figure 4
<p>Structure of the MAE module.</p> "> Figure 5
<p>Structure of the AFE module.</p> "> Figure 6
<p>Structure of PA.</p> "> Figure 7
<p>PSNR and SSIM learning curves. The red line and blue line are the change curves of PSNR and SSIM respectively as steps increase.</p> "> Figure 8
<p>Visual results comparison on SOTS-outdoor dataset.</p> "> Figure 9
<p>Visual results comparison on NHHAZE dataset.</p> "> Figure 10
<p>Visual results comparison on RTTS dataset.</p> "> Figure 11
<p>Visual results comparison on real-world images.</p> ">
Abstract
:1. Introduction
- (1)
- We propose an end-to-end Multi-Scale Attention Feature Enhancement Networks for Single Image Dehazing (MAFE). This method has achieved excellent performance in image dehazing. It can adaptively focus on the high-frequency information of the hazy image and retain more detailed information. Since it does not need to rely on the atmospheric scattering model, it is not affected by the estimated atmospheric light value. Additionally, color distortion is avoided.
- (2)
- We propose an attention feature enhancement module, which can adaptively focus on high-frequency information of hazy images, enhance the relevance of contextual information, suppress redundant information, and compensate for the loss of detailed information.
- (3)
- We propose a multi-scale attention enhancement module that builds upon the attention feature enhancement module and incorporates a spatial pyramid of dilated convolutions to fully extract and utilize the multi-scale features of the image. This module expands the receptive field and improves the quality of the dehazed image while preserving more detailed information.
- (4)
- The experimental results on both synthetic and real-world hazy images demonstrate that our proposed method achieves state-of-the-art single image dehazing methods in terms of dehazing performance. It can well preserve details such as color and texture of the image.
2. Related Work
3. Proposed Method
3.1. Haze Density Image Prediction Model
3.2. Network Architecture
3.3. DFB Mathematical Model
3.4. MAE Module Mathematical Model
3.5. AFE Module Mathematical Model
3.6. Context Enhancement Module Mathematical Model
3.7. Loss Function
3.7.1. Loss
3.7.2. Perceptual Loss
4. Experimental Results
4.1. Datasets
4.2. Implementation Details
4.3. Experimental Results on Synthetic Hazy Images
4.4. Experimental Results on Real-World Hazy Images
4.5. Ablation Study
- (1)
- When our proposed method does not include the SPDC and CEM modules, that is, the baseline network, the dehazing results are the worst.
- (2)
- When the SPDC module is added to the Baseline network, compared with the dehazing results of the Baseline network, the PSNR scores is increased by 2.26, the SSIM scores is increased by 0.0152, and the LPIPS distance decreased by 0.0028, which proves the effectiveness of the SPDC module in improving the dehazing performance.
- (3)
- When adding the CEM module on the baseline network, that is, the AFE module we proposed, compared with the dehazing results of the baseline network, the PSNR scores is increased by 2.48, the SSIM scores is increased by 0.0184, and the LPIPS distance decreased by 0.0028, which proves the dehazing performance of the proposed AFE module.
- (4)
- When both SPDC module and CEM module are added to baseline network, that is, the MAE module proposed in this paper, the network model is our proposed method. The PSNR and SSIM scores are the highest and the LPIPS distance is the shortest in Table 3, indicating that our proposed MAE module and method have the best dehazing performance, demonstrating the superiority of our proposed MAE module and method.
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Sharma, T.; Debaque, B.; Duclos, N.; Chehri, A.; Kinder, B.; Fortier, P. Deep Learning-Based Object Detection and Scene Perception Under Bad Weather Conditions. Electronics 2022, 11, 563. [Google Scholar] [CrossRef]
- Wu, W.; Chang, H.; Zheng, Y.; Li, Z.; Chen, Z.; Zhang, Z. Contrastive Learning-Based Robust Object Detection Under Smoky Conditions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 19–24 June 2022; pp. 4295–4302. [Google Scholar]
- Amrani, M.; Bey, A.; Amamra, A. New SAR Target Recognition Based on YOLO and Very Deep Multi-Canonical Correlation Analysis. Int. J. Remote Sens. 2022, 43, 5800–5819. [Google Scholar] [CrossRef]
- Chen, Z.; Wu, R.; Lin, Y.; Li, C.; Chen, S.; Yuan, Z.; Zou, X. Plant Disease Recognition Model Based on Improved YOLOv5. Agronomy 2022, 12, 365. [Google Scholar] [CrossRef]
- Engin, D.; Genç, A.; Kemal Ekenel, H. Cycle-Dehaze: Enhanced Cyclegan for Single Image Dehazing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, UT, USA, 18–22 June 2018; pp. 825–833. [Google Scholar]
- Hodges, C.; Bennamoun, M.; Rahmani, H. Single Image Dehazing Using Deep Neural Networks. Pattern Recogn. Lett. 2019, 128, 70–77. [Google Scholar] [CrossRef]
- Song, Y.; He, Z.; Qian, H.; Du, X. Vision Transformers for Single Image Dehazing. IEEE Trans. Image Process. 2023, 32, 1927–1941. [Google Scholar] [CrossRef]
- Wang, A.; Wang, W.; Liu, J.; Gu, N. AIPNet: Image-to-Image Single Image Dehazing with Atmospheric Illumination Prior. IEEE Trans. Image Process. 2018, 28, 381–393. [Google Scholar] [CrossRef]
- Hide, R. Optics of the atmosphere: Scattering by molecules and particles. Phys. Bull. 1977, 28, 521. [Google Scholar] [CrossRef]
- Narasimhan, S.G.; Nayar, S.K. Chromatic Framework for Vision in Bad Weather. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Hilton Head, SC, USA, 15 June 2000; pp. 598–605. [Google Scholar]
- Narasimhan, S.G.; Nayar, S.K. Contrast Restoration of Weather Degraded Images. IEEE Trans. Pattern Anal. Mach. Intel. 2003, 25, 713–724. [Google Scholar] [CrossRef]
- Zhang, X.; Jiang, R.; Wang, T.; Luo, W. Single image dehazing via dual-path recurrent network. IEEE Trans. Image Process. 2021, 30, 5211–5222. [Google Scholar] [CrossRef]
- Chen, Z.; He, Z.; Lu, Z. DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attention. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Vancouver, BC, Canada, 18–22 June 2023. [Google Scholar]
- Wu, H.; Qu, Y.; Lin, S.; Zhou, J.; Qiao, R.; Zhang, Z.; Ma, L. Contrastive learning for compact single image dehazing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Kuala Lumpur, Malaysia, 18–20 December 2021; pp. 10551–10560. [Google Scholar]
- Huang, P.; Zhao, L.; Jiang, R.; Wang, T.; Zhang, X. Self-filtering image dehazing with self-supporting module. Neurocomputing 2021, 432, 57–69. [Google Scholar] [CrossRef]
- He, K.; Sun, J.; Tang, X. Single Image Haze Removal Using Dark Channel Prior. IEEE Trans. Pattern Anal. Mach. Intel. 2011, 33, 2341–2353. [Google Scholar]
- Zhu, Q.; Mai, J.; Shao, L. A Fast Single Image Haze Removal Algorithm Using Color Attenuation Prior. IEEE Trans. Image Process. 2015, 24, 3522–3533. [Google Scholar] [PubMed]
- Berman, D.; Avidan, S. Non-local Image Dehazing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 1674–1682. [Google Scholar]
- Fattal, R. Single Image Dehazing. ACM Trans. Graph. 2008, 27, 1–9. [Google Scholar] [CrossRef]
- Dalborgo, V.; Murari, T.B.; Madureira, V.S.; Moraes, J.G.L.; Bezerra, V.M.O.; Santos, F.Q.; Monteiro, R.L. Traffic Sign Recognition with Deep Learning: Vegetation Occlusion Detection in Brazilian Environments. Sensors 2023, 23, 5919. [Google Scholar] [CrossRef]
- Esteva, A.; Chou, K.; Yeung, S.; Naik, N.; Madani, A.; Mottaghi, A.; Socher, R. Deep Learning-Enabled Medical Computer Vision. NPJ Digit. Med. 2021, 4, 5. [Google Scholar] [CrossRef]
- Ma, W.; Liu, Z.; Kudyshev, Z.A.; Boltasseva, A.; Cai, W.; Liu, Y. Deep Learning for The Design of Photonic Structures. Nat. Photonics 2021, 15, 77–90. [Google Scholar] [CrossRef]
- Tran, K.A.; Kondrashova, O.; Bradley, A.; Williams, E.D.; Pearson, J.V.; Waddell, N. Deep Learning in Cancer Diagnosis, Prognosis and Treatment Selection. Genome Med. 2021, 13, 152. [Google Scholar] [CrossRef]
- Cai, B.; Xu, X.; Jia, K.; Qing, C.; Tao, D. Dehazenet: An End-to-End System for Single Image Haze Removal. IEEE Trans. Image Process. 2016, 25, 5187–5198. [Google Scholar] [CrossRef]
- Ren, W.; Liu, S.; Zhang, H.; Pan, J.; Cao, X.; Yang, M.H. Single image dehazing via multi-scale convolutional neural networks. In Proceedings of the 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Springer: Cham, Switzerland, 2016; pp. 154–169. [Google Scholar]
- Li, B.; Peng, X.; Wang, Z.; Xu, J.; Feng, D. AOD-Net: All-in-One Dehazing Network. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 4780–4788. [Google Scholar]
- Liao, Y.; Su, Z.; Liang, X.; Qiu, B. Hdp-Net: Haze Density Prediction Network for Nighttime Dehazing. In Proceedings of the Pacific Rim Conference on Multimedia, Hefei, China, 21–22 December 2018; Springer: Cham, Switzerland, 2018; pp. 469–480. [Google Scholar]
- Chen, D.; He, M.; Fan, Q.; Liao, J.; Zhang, L.; Hou, D.; Hua, G. Gated context aggregation network for image dehazing and deraining. In Proceedings of the 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 7–11 January 2019; pp. 1375–1383. [Google Scholar]
- Shao, Y.; Li, L.; Ren, W.; Gao, C.; Sang, N. Domain Adaptation for Image Dehazing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 2808–2817. [Google Scholar]
- Qin, X.; Wang, Z.; Bai, Y.; Xie, X.; Jia, H. FFA-Net: Feature Fusion Attention Network for Single Image Dehazing. In Proceedings of the 2020 AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; pp. 11908–11915. [Google Scholar]
- Liu, X.; Ma, Y.; Shi, Z.; Chen, J. Griddehazenet: Attention-based multi-scale network for image dehazing. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27–31 October 2019; pp. 7314–7323. [Google Scholar]
- Wang, N.; Cui, Z.; Su, Y.; He, C.; Li, A. Multiscale supervision-guided context aggregation network for single image dehazing. IEEE Signal Process. Lett. 2021, 29, 70–74. [Google Scholar] [CrossRef]
- Sheng, J.; Lv, G.; Du, G.; Wang, Z.; Feng, Q. Multi-scale residual attention network for single image dehazing. Digit. Signal Process. 2022, 121, 103327. [Google Scholar] [CrossRef]
- Zhou, Y.; Chen, Z.; Sheng, B.; Li, P.; Kim, J.; Wu, E. AFF-Dehazing: Attention-based feature fusion network for low-light image Dehazing. Comput. Animat. Virtual Worlds 2021, 32, 3–4. [Google Scholar] [CrossRef]
- Wang, S.; Zhou, T.; Lu, Y.; Di, H. Contextual Transformation Network for Lightweight Remote-Sensing Image Super-Resolution. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5615313. [Google Scholar] [CrossRef]
- Mehta, S.; Rastegari, M.; Caspi, A.; Shapiro, L.; Hajishirzi, H. Espnet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 552–568. [Google Scholar]
- Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
- Li, B.; Ren, W.; Fu, D.; Tao, D.; Feng, D.; Zeng, W.; Wang, Z. Benchmarking Single-Image Dehazing and Beyond. IEEE Trans. Image Process. 2019, 1, 492–505. [Google Scholar] [CrossRef] [PubMed]
- Ancuti, C.O.; Ancuti, C.; Timofte, R. NH-HAZE: An Image Dehazing Benchmark with Non-Homogeneous Hazy and Haze-Free Images. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA, 14–19 June 2020; pp. 1798–1805. [Google Scholar]
- He, T.; Zhang, Z.; Zhang, H.; Zhang, Z.; Xie, J.; Li, M. Bag of Tricks for Image Classification with Convolutional Neural Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–21 June 2019; pp. 558–567. [Google Scholar]
- Amyar, A.; Modzelewski, R.; Vera, P.; Morard, V.; Ruan, S. Multi-task multi-scale learning for outcome prediction in 3D PET images. Comput. Biol. Med. 2022, 151, 106208. [Google Scholar] [CrossRef]
Method | SOTS-Outdoor | ||||
---|---|---|---|---|---|
PSNR | SSIM | LPIPS | |||
DCP | 21.81 | 0.8583 | 0.0527 | — | — |
CAP | 22.09 | 0.8829 | 0.0427 | — | — |
AOD-Net | 20.29 | 0.8765 | 0.0880 | 0.002 | 0.101 |
Dehaze-Net | 22.46 | 0.8514 | 0.0390 | 0.008 | 0.450 |
GridDehaze-Net | 30.86 | 0.9819 | 0.0053 | 0.956 | 452.01 |
FFA-Net | 33.38 | 0.9804 | 0.0049 | 4.456 | 1010.86 |
Ours | 33.74 | 0.9843 | 0.0040 | 3.070 | 679.31 |
Image No. | DCP | CAP | Dehaze-Net | AOD-Net | GridDehaze-Net | FFA-Net | Our Method |
---|---|---|---|---|---|---|---|
1 | 13.36/0.4838/0.386 | 13.52/0.4881/0.396 | 12.48/0.4715/0.410 | 13.38/0.4589/0.439 | 12.77/0.4854/0.413 | 18.65/0.6605/0.217 | 21.14/0.6628/0.213 |
2 | 11.83/0.4087/0.492 | 12.62/0.4060/0.498 | 11.16/0.3726/0.534 | 14.56/0.4253/0.537 | 12.13/0.4005/0.497 | 18.86/0.6109/0.288 | 20.88/0.6261/0.282 |
3 | 11.69/0.4977/0.460 | 12.10/0.4994/0.448 | 10.77/0.4882/0.477 | 11.50/0.4613/0.475 | 10.42/0.5044/0.446 | 17.00/0.6621/0.264 | 18.09/0.6647/0.260 |
4 | 13.84/0.5359/0.345 | 14.15/0.5303/0.342 | 12.94/0.5231/0.357 | 14.27/0.4562/0.434 | 13.65/0.5416/0.385 | 19.85/0.7259/0.167 | 22.34/0.7282/0.159 |
Method | PSNR | SSIM | Lpips |
---|---|---|---|
Baseline | 26.71 | 0.9514 | 0.0269 |
Baseline + SPDC | 28.97 | 0.9666 | 0.0241 |
Baseline + CEM | 29.19 | 0.9698 | 0.0241 |
Baseline + SPDC + CEM | 29.50 | 0.9706 | 0.0232 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Dong, W.; Wang, C.; Sun, H.; Teng, Y.; Xu, X. Multi-Scale Attention Feature Enhancement Network for Single Image Dehazing. Sensors 2023, 23, 8102. https://doi.org/10.3390/s23198102
Dong W, Wang C, Sun H, Teng Y, Xu X. Multi-Scale Attention Feature Enhancement Network for Single Image Dehazing. Sensors. 2023; 23(19):8102. https://doi.org/10.3390/s23198102
Chicago/Turabian StyleDong, Weida, Chunyan Wang, Hao Sun, Yunjie Teng, and Xiping Xu. 2023. "Multi-Scale Attention Feature Enhancement Network for Single Image Dehazing" Sensors 23, no. 19: 8102. https://doi.org/10.3390/s23198102
APA StyleDong, W., Wang, C., Sun, H., Teng, Y., & Xu, X. (2023). Multi-Scale Attention Feature Enhancement Network for Single Image Dehazing. Sensors, 23(19), 8102. https://doi.org/10.3390/s23198102