Abstract
The use of machine vision and deep learning for intelligent industrial inspection has become increasingly important in automating the production processes. Despite the fact that machine vision approaches are used for industrial inspection, deep learning-based defect segmentation has not been widely studied. While state-of-the-art segmentation methods are often tuned for a specific purpose, extending them to unknown sets or other datasets, such as defect segmentation datasets, require further analysis. In addition, recent contributions and improvements in image segmentation methods have not been extensively investigated for defect segmentation. To address these problems, we conducted a comparative experimental study on several recent state-of-the-art deep learning-based segmentation methods for steel surface defect segmentation and evaluated them on the basis of segmentation performance, processing time, and computational complexity using two public datasets, NEU-Seg and Severstal Steel Defect Detection (SSDD). In addition we proposed and trained a hybrid transformer-based encoder with CNN-based decoder head and achieved state-of-the-art results, a Dice score of 95.22% (NEU-Seg) and 95.55% (SSDD).
Similar content being viewed by others
Data Availability
The datasets that are used for this experimental study are publicly available steel surface defect segmentation datasets. The NEU-Seg dataset if from [10] and the SDDD dataset can be accessed from [39]. In addition, the specific partition of the dataset used during training and experimentation can be provided up on request.
Code Availability
The implementation code for this study including the trained weights will soon be available at: https://github.com/djene-mengistu/dseg_models
References
Abdou M A (2022) Literature review: efficient deep neural networks techniques for medical image analysis. Neural Comput Appl 34(8):5791–5812
Abu M, Amir A, Lean YH et al (2021) The performance analysis of transfer learning for steel defect detection by using deep learning. In: J Phys: Conf Series. IOP Publishing, p 012041
Ahmed K R (2023) Dsteelnet: a real-time parallel dilated cnn with atrous spatial pyramid pooling for detecting and classifying defects in surface steel strips. Sensors 23(1):544
Badrinarayanan V, Kendall A, Cipolla R (2016) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. arXiv:1511.00561 [cs]
Bozkurt F (2022) Skin lesion classification on dermatoscopic images using effective data augmentation and pre-trained deep learning approach. Multimed Tools Applic, 1–19
Chen L-C, Papandreou G, Schroff F et al (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587
Chen L-C, Zhu Y, Papandreou G et al (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 833–851, DOI https://doi.org/10.1007/978-3-030-01234-2_49
Damacharla P, Rao A, Ringenberg J et al (2021) Tlu-net: a deep learning approach for automatic steel surface defect detection. In: 2021 International Conference on Applied Artificial Intelligence (ICAPAI). IEEE, pp 1–6
Demir K, Ay M, Cavas M et al (2022) Automated steel surface defect detection and classification using a new deep learning-based approach. Neural Comput Appl, 1–18
Dong H, Song K, He Y et al (2020) Pga-net: pyramid feature fusion and global context attention network for automated surface defect detection. IEEE Trans Industr Inf 16(12):7448–7458. https://doi.org/10.1109/TII.2019.2958826
Dosovitskiy A, Beyer L, Kolesnikov A et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv:2010.11929
Elizar E, Zulkifley M A, Muharar R et al (2022) A review on multiscale-deep-learning applications. Sensors 22(19):7384
Fu G, Sun P, Zhu W et al (2019) A deep-learning-based approach for fast and robust steel surface defects classification. Opt Lasers Eng 121:397–405. https://doi.org/10.1016/j.optlaseng.2019.05.005
Fu J, Liu J, Tian H et al (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3146–3154
Gao Y, Gao L, Li X et al (2020) A semi-supervised convolutional neural network-based method for steel surface defect recognition. Robot Comput-Integr Manuf 61:101825. https://doi.org/10.1016/j.rcim.2019.101825
Hao R, Lu B, Cheng Y et al (2020) A steel surface defect inspection approach towards smart industrial monitoring. J Intell Manuf. https://doi.org/10.1007/s10845-020-01670-2, [Online; accessed 2021-05-17]
He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, Las Vegas, pp 770–778, DOI https://doi.org/10.1109/CVPR.2016.90
He Y, Song K, Meng Q et al (2020) An end-to-end steel surface defect detection approach via fusing multiple hierarchical features. IEEE Trans Instrum Meas 69(4):1493–1504. https://doi.org/10.1109/TIM.2019.2915404
Huang Z, Wang X, Huang L et al (2019) Ccnet: criss-cross attention for semantic segmentation. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 603–612
Huang Y, Qiu C, Wang X et al (2020) A compact convolutional neural network for surface defect inspection. Sensors 20(7):1974. https://doi.org/10.3390/s20071974
Huang Z, Wu J, Xie F (2021) Automatic surface defect segmentation for hot-rolled steel strip using depth-wise separable u-shape network. Mater Lett 301:130271. https://doi.org/10.1016/j.matlet.2021.130271
Kirillov A, Girshick R, He K et al (2019) Panoptic feature pyramid networks. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, Long Beach, pp 6392–6401, DOI https://doi.org/10.1109/CVPR.2019.00656
Liu Z, Yang B, Duan G et al (2020) Visual defect inspection of metal part surface via deformable convolution and concatenate feature pyramid neural networks. IEEE Trans Instrum Meas 69(12):9681–9694. https://doi.org/10.1109/TIM.2020.3001695
Liu Z, Lin Y, Cao Y et al (2021) Swin transformer: hierarchical vision transformer using shifted windows. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, Montreal, pp 9992–10002, DOI https://doi.org/10.1109/ICCV48922.2021.00986
Liu Z, Mao H, Wu C-Y et al (2022) A convnet for the 2020s. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11976–11986
Liu Z, Zeng Z, Li J, Teng S (2022) Automatic detection and quantification of hot-rolled steel surface defects using deep learning. Arab J Sci Eng, 1–13
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. CVPR2015, 10
Luo Q, Fang X, Su J et al (2020) Automated visual defect classification for flat steel surface: a survey. IEEE Trans Instrum Meas 69(12):9329–9349. https://doi.org/10.1109/TIM.2020.3030167
Luo Q, Fang X, Sun Y et al (2019) Surface defect classification for hot-rolled steel strips by selectively dominant local binary patterns. IEEE Access 7:23488–23499. https://doi.org/10.1109/ACCESS.2019.2898215
Ma J, Wang Y, Shi C, Lu C (2018) Fast surface defect detection using improved gabor filters. In: 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, pp 1508–1512, DOI https://doi.org/10.1109/ICIP.2018.8451351, (to appear in print)
Mehta S, Rastegari M, Caspi A et al (2018) Espnet: efficient spatial pyramid of dilated convolutions for semantic segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV), DOI https://doi.org/10.1007/978-3-030-01249-6_34
Paszke A, Chaurasia A, Kim S et al (2016) Enet: a deep neural network architecture for real-time semantic segmentation. arXiv:1606.02147 [cs]
Peng C, Zhang X, Yu G et al (2017) Large kernel matters – improve semantic segmentation by global convolutional network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), DOI https://doi.org/10.1109/CVPR.2017.189
Prappacher N, Bullmann M, Bohn G et al (2020) Defect detection on rolling element surface scans using neural image segmentation. Appl Sci 10(9):3290. https://doi.org/10.3390/app10093290
Qian K (2019) Automated detection of steel defects via machine learning based on real-time semantic segmentation. In: Proceedings of the 3rd international conference on video and image processing , pp 42–46
Ren Z, Fang F, Yan N et al (2021) State of the art in defect detection based on machine vision. International Journal of Precision Engineering and Manufacturing-Green Technology. https://doi.org/10.1007/s40684-021-00343-6
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: Medical image computing and computer-assisted intervention – MICCAI 2015, vol 9351. Springer International Publishing, pp 234–241, DOI https://doi.org/10.1007/978-3-319-24574-4_28
Sandler M, Howard A, Zhu M et al (June 2018) Mobilenetv2: inverted residuals and linear bottlenecks. In: 2018 IEEE/CVF Conference on computer vision and pattern recognition. IEEE, Salt Lake City, pp 4510–4520, DOI https://doi.org/10.1109/CVPR.2018.00474
Severstal (2019) Severstal: Steel defect detection, kaggle challange 2019. https://www.kaggle.com/c/severstal-steel-defect-detection
Sime D M, Wang G, Zeng Z et al (2022) Semi-supervised defect segmentation with pairwise similarity map consistency and ensemble-based cross-pseudo labels. IEEE Trans Industr Inf. https://doi.org/10.1109/TII.2022.3230785
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Suvdaa B, Ahn J, Ko J (2012) Steel surface defects detection and classification using sift and voting strategy. Int J Softw Eng Applic 6(2):6
Tabernik D, Šela S, Skvarč J et al (2020) Segmentation-based deep-learning approach for surface-defect detection. J Intell Manuf 31 (3):759–776. https://doi.org/10.1007/s10845-019-01476-x
Tan M, Le Q (2019) Efficientnet: rethinking model scaling for convolutional neural networks. In: International conference on machine learning. PMLR, pp 6105–6114
Tang B, Chen L, Sun W et al (2022) Review of surface defect detection of steel products based on machine vision. IET Image Proc
Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. Advances in neural information processing systems, 30
Wang J, Sun K, Cheng T et al (2021) Deep high-resolution representation learning for visual recognition. IEEE Trans Pattern Anal Mach Intell 43 (10):3349–3364. https://doi.org/10.1109/TPAMI.2020.2983686
Wang J, Zhang Q, Liu G (2022) Drcdct-net: a steel surface defect diagnosis method based on a dual-route cross-domain convolution-transformer network. Meas Sci Technol 33(9):095404
Wang W, Xie E, Li X et al (2022) Pvt v2: improved baselines with pyramid vision transformer. Comput Vis Media 8(3):415–424. https://doi.org/10.1007/s41095-022-0274-8
Wanin M (1993) In-line metallurgical process control in the steel industry. Le Journal de Physique IV 03(C7):C7–1101–C7–1107. https://doi.org/10.1051/jp4:19937172
Wu H, Zhang J, Huang K et al (2019) Fastfcn: rethinking dilated convolution in the backbone for semantic segmentation. arXiv:1903.11816 [cs]
Wu T, Tang S, Zhang R et al (2021) Cgnet: a light-weight context guided network for semantic segmentation. IEEE Trans Image Process 30:1169–1179. https://doi.org/10.1109/TIP.2020.3042065
Xiao T, Liu Y, Zhou B et al (2018) Unified perceptual parsing for scene understanding. In: Proceedings of the European conference on computer vision (ECCV), pp 418–434, DOI https://doi.org/10.1007/978-3-030-01228-1_26
Xie S, Girshick R, Dollar P et al (2017) Aggregated residual transformations for deep neural networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, pp 5987–5995, DOI https://doi.org/10.1109/CVPR.2017.634
Xie E, Wang W, Yu Z et al (2021) Segformer: simple and efficient design for semantic segmentation with transformers. In: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), p 14
Xu R, Hao R, Huang B (2022) Efficient surface defect detection using self-supervised learning strategy and segmentation network. Adv Eng Inform 52:101566
Yan H, Zhang C, Wu M (2022) Lawin transformer: improving semantic segmentation transformer with multi-scale representations via large window attention. arXiv:2201.01615
Yu F, Koltun V (2016) Multi-scale context aggregation by dilated convolutions. arXiv:1511.07122 [cs]
Yu C, Wang J, Peng C et al (2018) Bisenet: bilateral segmentation network for real-time semantic segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV)
Yuan Y, Huang L, Guo J et al (2021) Ocnet: object context for semantic segmentation. Int J Comput Vision 129(8):2375–2398. https://doi.org/10.1007/s11263-021-01465-9
Zhang H, Dana K, Shi J et al (2018) Context encoding for semantic segmentation. In: 2018 IEEE/CVF Conference on computer vision and pattern recognition, Salt Lake City, pp 7151–7160, DOI https://doi.org/10.1109/CVPR.2018.00747
Zhang C, Patras P, Haddadi H (2019) Deep learning in mobile and wireless networking: a survey. IEEE Commun Surv Tutor 21(3):2224–2287
Zhang H, Goodfellow I, Metaxas D et al (2019) Self-attention generative adversarial networks. In: Proceedings of the 36th International Conference on Machine Learning, Long Beach, pp 7354–7363
Zhang Q, Yang Y-B (2021) Rest: an efficient transformer for visual recognition. In: Advances in neural information processing systems, vol 34. Curran Associates, Inc., pp 15475–15485
Zhao H, Shi J, Qi X et al (2017) Pyramid scene parsing network. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, Honolulu, pp 6230–6239, DOI https://doi.org/10.1109/CVPR.2017.660
Zhao H, Qi X, Shen X et al (2018) Icnet for real-time semantic segmentation on high-resolution images. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 418–434
Zheng S, Lu J, Zhao H et al (2021) Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, pp 6877–6886, DOI https://doi.org/10.1109/CVPR46437.2021.00681
Zheng X, Zheng S, Kong Y et al (2021) Recent advances in surface defect inspection of industrial products using deep learning techniques. Int J Adv Manuf Technol 113(1-2):35–58. https://doi.org/10.1007/s00170-021-06592-8
Acknowledgments
This work was supported by Natural Science Foundation of China (No. 51975107, No.52175292), Science and Technology Project of Sichuan Province (No. 2020ZDZX0015).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethics approval
Not applicable
Conflict of Interests
The authors declare that they have no known competing interests and all the materials and methods used in the study are duly acknowledged.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sime, D.M., Wang, G., Zeng, Z. et al. Deep learning-based automated steel surface defect segmentation: a comparative experimental study. Multimed Tools Appl 83, 2995–3018 (2024). https://doi.org/10.1007/s11042-023-15307-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15307-y