Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning

Zhongbin Sun^15,16,
Xiaolong Li¹⁶,
Yiran Li¹⁷ &
…
Yue Ma¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15042))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

114 Accesses

Abstract

Unsupervised anomaly detection is a challenging computer vision task, in which 2D-based anomaly detection methods have been extensively studied. However, multimodal anomaly detection based on RGB images and 3D point clouds requires further investigation. The existing methods are mainly inspired by memory bank based methods commonly used in 2D-based anomaly detection, which may cost extra memory for storing multimodal features. In present study, a novel memoryless method MDSS is proposed for multimodal anomaly detection, which employs a light-weighted student-teacher network and a signed distance function to learn from RGB images and 3D point clouds respectively, and complements the anomaly information from the two modalities. Specifically, a student-teacher network is trained with normal RGB images and masks generated from point clouds by a dynamic loss, and the anomaly score map could be obtained from the discrepancy between the output of student and teacher. Furthermore, the signed distance function learns from normal point clouds to predict the signed distances between points and surface, and the obtained signed distances are used to generate anomaly score map. Subsequently, the anomaly score maps are aligned to generate the final anomaly score map for detection. The experimental results indicate that MDSS is comparable but more stable than the SOTA memory bank based method Shape-guided, and furthermore performs better than other baseline methods.

Supported by the Fundamental Research Funds for the Central Universities under Grant No. 2021QN1075.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 54.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 69.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Continuous Memory Representation for Anomaly Detection

Trusted 3D self-supervised representation learning with cross-modal settings

Article 02 June 2024

MemFlowNet: A Network for Detecting Subtle Surface Anomalies with Memory Bank and Normalizing Flow

Notes

1.
Recently accepted in CVPR 2024 (https://cvpr.thecvf.com/Conferences/2024/AcceptedPapers).

References

Batzner, K., Heckler, L., König, R.: EfficientAD: accurate visual anomaly detection at millisecond-level latencies. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 128–138 (2024)
Google Scholar
Bergmann, P., Fauser, M., Sattlegger, D., Steger, C.: The MVTec anomaly detection dataset: a comprehensive real-world dataset for unsupervised anomaly detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9592–9600 (2019)
Google Scholar
Bergmann, P., Fauser, M., Sattlegger, D., Steger, C.: Uninformed students: student-teacher anomaly detection with discriminative latent embeddings. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4183–4192 (2020)
Google Scholar
Bergmann, P., Jin, X., Sattlegger, D., Steger, C.: The MVTec 3D-AD dataset for unsupervised 3d anomaly detection and localization. In: Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, pp. 202–213 (2022)
Google Scholar
Bergmann, P., Sattlegger, D.: Anomaly detection in 3d point clouds using deep geometric descriptors. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2613–2623 (2023)
Google Scholar
Chu, Y.M., Liu, C., Hsieh, T.I., Chen, H.T., Liu, T.L.: Shape-guided dual-memory learning for 3d anomaly detection. In: Proceedings of the 40th International Conference on Machine Learning, pp. 6185–6194 (2023)
Google Scholar
Cohen, N., Hoshen, Y.: Sub-image anomaly detection with deep pyramid correspondences (2020). arXiv:2005.02357
Costanzino, A., Ramirez, P.Z., Lisanti, G., Di Stefano, L.: Multimodal industrial anomaly detection by crossmodal feature mapping. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2024)
Google Scholar
Deng, H., Li, X.: Anomaly detection via reverse distillation from one-class embedding. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9737–9746 (2022)
Google Scholar
Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network (2015). arXiv:1503.02531
Horwitz, E., Hoshen, Y.: Back to the feature: Classical 3d features are (almost) all you need for 3d anomaly detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2967–2976 (2023)
Google Scholar
Liu, J., Xie, G., Wang, J., Li, S., Wang, C., Zheng, F., Jin, Y.: Deep industrial image anomaly detection: a survey. Mach. Intell. Res. 21(1), 104–135 (2024)
Article Google Scholar
Loshchilov, I., Hutter, F.: SGDR: stochastic gradient descent with warm restarts. In: 5th International Conference on Learning Representations (2017)
Google Scholar
Ma, B., Liu, Y.S., Zwicker, M., Han, Z.: Surface reconstruction from point clouds by learning predictive context priors. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6326–6337 (2022)
Google Scholar
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3d classification and segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 652–660 (2017)
Google Scholar
Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In: Advances in Neural Information Processing Systems, pp. 5099–5108 (2017)
Google Scholar
Reiss, T., Cohen, N., Bergman, L., Hoshen, Y.: PANDA: adapting pretrained features for anomaly detection and segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2806–2814 (2021)
Google Scholar
Roth, K., Pemula, L., Zepeda, J., Schölkopf, B., Brox, T., Gehler, P.: Towards total recall in industrial anomaly detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14318–14328 (2022)
Google Scholar
Rudolph, M., Wehrbein, T., Rosenhahn, B., Wandt, B.: Asymmetric student-teacher networks for industrial anomaly detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2592–2602 (2023)
Google Scholar
Ruff, L., Kauffmann, J.R., Vandermeulen, R.A., Montavon, G., Samek, W., Kloft, M., Dietterich, T.G., Müller, K.R.: A unifying review of deep and shallow anomaly detection. Proc. IEEE 109(5), 756–795 (2021)
Article Google Scholar
Wang, G., Han, S., Ding, E., Huang, D.: Student-teacher feature pyramid matching for anomaly detection. In: 32nd British Machine Vision Conference, p. 306 (2021)
Google Scholar
Wang, Y., Peng, J., Zhang, J., Yi, R., Wang, Y., Wang, C.: Multimodal industrial anomaly detection via hybrid fusion. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8032–8041 (2023)
Google Scholar

Download references

Author information

Authors and Affiliations

Mine Digitization Engineering Research Center of the Ministry of Education, Xuzhou, 221116, Jiangsu, China
Zhongbin Sun
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, 221116, Jiangsu, China
Zhongbin Sun, Xiaolong Li & Yue Ma
Sun Yueqi Honors College, China University of Mining and Technology, Xuzhou, 221116, Jiangsu, China
Yiran Li

Authors

Zhongbin Sun
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolong Li
View author publications
You can also search for this author in PubMed Google Scholar
Yiran Li
View author publications
You can also search for this author in PubMed Google Scholar
Yue Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhongbin Sun .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Zhouchen Lin
Nankai University, Tianjin, China
Ming-Ming Cheng
Chinese Academy of Sciences, Beijing, China
Ran He
Xinjiang University, Ürümqi, Xinjiang, China
Kurban Ubul
Xinjiang University, Ürümqi, China
Wushouer Silamu
Peking University, Beijing, China
Hongbin Zha
Tsinghua University, Beijing, China
Jie Zhou
Chinese Academy of Sciences, Beijing, China
Cheng-Lin Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, Z., Li, X., Li, Y., Ma, Y. (2025). Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning. In: Lin, Z., et al. Pattern Recognition and Computer Vision. PRCV 2024. Lecture Notes in Computer Science, vol 15042. Springer, Singapore. https://doi.org/10.1007/978-981-97-8858-3_31

Download citation

DOI: https://doi.org/10.1007/978-981-97-8858-3_31
Published: 03 November 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-8857-6
Online ISBN: 978-981-97-8858-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Continuous Memory Representation for Anomaly Detection

Trusted 3D self-supervised representation learning with cross-modal settings

MemFlowNet: A Network for Detecting Subtle Surface Anomalies with Memory Bank and Normalizing Flow

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Continuous Memory Representation for Anomaly Detection

Trusted 3D self-supervised representation learning with cross-modal settings

MemFlowNet: A Network for Detecting Subtle Surface Anomalies with Memory Bank and Normalizing Flow

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation