Study on defect detection of metal castings based on supervised enhancement and attention distillation

Haixia Pan¹^na1,
Han Zhao¹^na1,
Xingyun Wei¹,
Dongdong Zhang¹,
Biao Dong¹ &
…
Jiahua Lan¹

296 Accesses
Explore all metrics

Abstract

Metal castings are essential parts and their quality greatly impacts the product’s performance. Current techniques for identifying surface defects in metal castings usually suffer from issues like inadequate accuracy in defect recognition. In comparison to other models, the performance of the Deformable DETR model in detecting defects in metal castings is commendable, yet its attentional mechanism inherently demands a larger sample size and exhibits a slower convergence rate. However, the dataset collected in this paper for metal castings possesses a limited number of samples and imbalanced data, which consequently limits the overall effectiveness of the Deformable DETR model on this task. Responding to the above, a supervised enhancement algorithm for one-to-many assignment based on the Deformable DETR was proposed in this paper. This algorithm accelerates convergence speed and reduces data requirements while eliminating Non-Maximum Suppression post-processing. Applying the supervised enhancement algorithm to DETR on this dataset also improves defect recall. In addition, attention distillation was applied to the deformable attention mechanism to reduce time and space complexity to O(Llog(L)). It creates favourable conditions for the generation of attention weights, which are changed from a linear transformation of query to query-key interaction, and emphasises casting defects while ensuring global attention. Furthermore, data processing methods such as image slicing and data augmentation are also used to enhance casting defect detection ability in this paper. Finally, the recall rate and precision rate of metal casting defects have been improved, increasing to 97.7% and 85.0%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

An improved defect recognition framework for casting based on DETR algorithm

Article 17 March 2023

Foreground–background separation transformer for weakly supervised surface defect detection

Article 03 July 2024

Metal surface defect detection based on improved YOLOv5

Article Open access 27 November 2023

Data Availability

The data will be provided based on request.

References

Zuo, B., Wang, F.: Surface cutting defect detection of magnet using Fourier image reconstruction. Comput. Eng. Appl. 52(3), 256–260 (2016)
Google Scholar
Peng, X., Chen, Y., Yu, W., Zhou, Z., Sun, G.: An online defects inspection method for float glass fabrication based on machine vision. Int. J. Adv. Manuf. Technol. 39, 1180–1189 (2008)
Article Google Scholar
Shi, Y., Cui, L., Qi, Z., Meng, F., Chen, Z.: Automatic road crack detection using random structured forests. IEEE Trans. Intell. Transp. Syst. 17(12), 3434–3445 (2016)
Article Google Scholar
Tao, X., Zhang, D., Wang, Z., Liu, X., Zhang, H., Xu, D.: Detection of power line insulator defects using aerial images analyzed with convolutional neural networks. IEEE Trans. Syst. Man Cybern.: Syst. 50(4), 1486–1498 (2018)
Article Google Scholar
Zhang, C., Chang, C.-C., Jamshidi, M.: Concrete bridge surface damage detection using a single-stage detector. Comput.-Aid. Civil Infrastruct. Eng. 35(4), 389–409 (2020)
Article Google Scholar
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement (2018). arXiv:1804.02767
Chen, S.-H., Tsai, C.-C.: Smd led chips defect detection using a yolov3-dense model. Adv. Eng. Inform. 47, 101255 (2021)
Article Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)
Sabokrou, M., Khalooei, M., Fathy, M., Adeli, E.: Adversarially learned one-class classifier for novelty detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3379–3388 (2018)
Baur, C., Wiestler, B., Albarqouni, S., Navab, N.: Deep autoencoding models for unsupervised anomaly segmentation in brain mr images. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: 4th International Workshop, BrainLes 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 16, 2018, Revised Selected Papers, Part I 4, pp. 161–169. Springer (2019)
Akcay, S., Atapour-Abarghouei, A., Breckon, T.P.: Ganomaly: Semi-supervised anomaly detection via adversarial training. In: Computer Vision–ACCV 2018: 14th Asian Conference on Computer Vision, Perth, Australia, December 2–6, 2018, Revised Selected Papers, Part III 14, pp. 622–637. Springer (2019)
Yang, H., Zhou, Q., Song, K., Yin, Z.: An anomaly feature-editing-based adversarial network for texture defect visual inspection. IEEE Trans. Industr. Inf. 17(3), 2220–2230 (2020)
Article Google Scholar
Akyon, F.C., Altinuc, S.O., Temizel, A.: Slicing aided hyper inference and fine-tuning for small object detection. In: 2022 IEEE International Conference on Image Processing (ICIP), pp. 966–970. IEEE (2022)
Ghiasi, G., Cui, Y., Srinivas, A., Qian, R., Lin, T.-Y., Cubuk, E.D., Le, Q.V., Zoph, B.: Simple copy-paste is a strong data augmentation method for instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2918–2928 (2021)
Zhu, X., Su, W., Lu, L., Li, B., Wang, X., Dai, J.: Deformable detr: Deformable transformers for end-to-end object detection (2020). arXiv:2010.04159
Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., Zagoruyko, S.: End-to-end object detection with transformers. In: European Conference on Computer Vision, pp. 213–229. Springer (2020)
Jocher, G.: yolov5 (2020). https://github.com/ultralytics/yolov5/. Accessed March 9, 2022
Wang, C.-Y., Bochkovskiy, A., Liao, H.-Y.M.: Yolov7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7464–7475 (2023)
Cai, Z., Vasconcelos, N.: Cascade r-cnn: Delving into high quality object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6154–6162 (2018)
Pernkopf, F.: Detection of surface defects on raw steel blocks using Bayesian network classifiers. Pattern Anal. Appl. 7, 333–342 (2004)
Article MathSciNet Google Scholar
Yue, B., Wang, Y., Min, Y., Zhang, Z., Wang, W., Yong, J.: Rail surface defect recognition method based on adaboost multi-classifier combination. In: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 391–396. IEEE (2019)
Fang, J., Tan, X., Wang, Y.: Acrm: attention cascade r-cnn with mix-nms for metallic surface defect detection. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 423–430. IEEE (2021)
Xu, Y., Wang, X., Li, S.: Track surface defect detection based on efficientdet. In: International Conference on Electrical and Information Technologies for Rail Transportation, pp. 56–66. Springer (2021)
Usamentiaga, R., Lema, D.G., Pedrayes, O.D., Garcia, D.F.: Automated surface defect detection in metals: a comparative review of object detection and semantic segmentation using deep learning. IEEE Trans. Ind. Appl. 58(3), 4203–4213 (2022)
Article Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: Unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2009)
Article Google Scholar
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: Ssd: Single shot multibox detector. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14, pp. 21–37. Springer (2016)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. Adv. Neural Inform. Process. Syst. 30 (2017)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inform. Process. Syst. 28 (2015)
Sun, P., Zhang, R., Jiang, Y., Kong, T., Xu, C., Zhan, W., Tomizuka, M., Li, L., Yuan, Z., Wang, C., et al.: Sparse r-cnn: End-to-end object detection with learnable proposals. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14454–14463 (2021)
Meng, D., Chen, X., Fan, Z., Zeng, G., Li, H., Yuan, Y., Sun, L., Wang, J.: Conditional detr for fast training convergence. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3651–3660 (2021)
Zhou, H., Zhang, S., Peng, J., Zhang, S., Li, J., Xiong, H., Zhang, W.: Informer: Beyond efficient transformer for long sequence time-series forecasting. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 11106–11115 (2021)
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft coco: Common objects in context. In: Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, pp. 740–755. Springer (2014)
Wang, J., Chen, K., Yang, S., Loy, C.C., Lin, D.: Region proposal by guided anchoring. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2965–2974 (2019)
Yang, T., Zhang, X., Li, Z., Zhang, W., Sun, J.: Metaanchor: Learning to detect objects with customized anchors. Adv. Neural Inform. Process. Syst. 31 (2018)
Zhang, X., Wan, F., Liu, C., Ji, R., Ye, Q.: Freeanchor: Learning to match anchors for visual object detection. Adv. Neural Inform. Process. Syst. 32 (2019)
Ge, Z., Liu, S., Li, Z., Yoshie, O., Sun, J.: Ota: Optimal transport assignment for object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 303–312 (2021)
Wang, W., Cao, Y., Zhang, J., Tao, D.: Fp-detr: Detection transformer advanced by fully pre-training. In: International Conference on Learning Representations (2021)
Wang, W., Zhang, J., Cao, Y., Shen, Y., Tao, D.: Towards data-efficient detection transformers. In: European Conference on Computer Vision, pp. 88–105. Springer (2022)
Chen, Q., Chen, X., Wang, J., Zhang, S., Yao, K., Feng, H., Han, J., Ding, E., Zeng, G., Wang, J.: Group detr: Fast detr training with group-wise one-to-many assignment. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6633–6642 (2023)
Jia, D., Yuan, Y., He, H., Wu, X., Yu, H., Lin, W., Sun, L., Zhang, C., Hu, H.: Detrs with hybrid matching. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 19702–19712 (2023)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. arXiv e-prints (2017)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. IEEE (2016)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database, pp. 248–255 (2009)

Download references

Acknowledgements

Thanks to all the authors for their contributions to this article.

Funding

The authors did not receive support from any organization for the submitted work.

Author information

Haixia Pan and Han Zhao contributed equally to this work.

Authors and Affiliations

School of Software, Beihang University, Beijing, 100191, China
Haixia Pan, Han Zhao, Xingyun Wei, Dongdong Zhang, Biao Dong & Jiahua Lan

Authors

Haixia Pan
View author publications
You can also search for this author in PubMed Google Scholar
Han Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xingyun Wei
View author publications
You can also search for this author in PubMed Google Scholar
Dongdong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Biao Dong
View author publications
You can also search for this author in PubMed Google Scholar
Jiahua Lan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, HZ; methodology, HP, HZ, XW, DZ, BD, JL; software, HZ; validation, HZ; formal analysis, HZ; investigation, HZ; resources, HP, HZ; data curation, HZ; writing—original draft preparation, HZ; writing—review and editing, HP, XW, DZ, BD, JL; visualization, HZ; supervision, HP; project administration, HP All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Haixia Pan.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical approval

This article does not contain any studies with human participants performed by any of the authors.

Informed consent

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Supervised enhancement algorithm pseudocode

The labelled truth values are replicated n times before the Hungarian matching is executed. Pseudocode is an example of copying 2x.

Appendix B: Attention distillation algorithm pseudocode

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Pan, H., Zhao, H., Wei, X. et al. Study on defect detection of metal castings based on supervised enhancement and attention distillation. Machine Vision and Applications 35, 57 (2024). https://doi.org/10.1007/s00138-024-01536-0

Download citation

Received: 26 November 2023
Revised: 01 March 2024
Accepted: 18 March 2024
Published: 23 April 2024
DOI: https://doi.org/10.1007/s00138-024-01536-0