Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.20236 (cs)

[Submitted on 29 Mar 2024]

Title:Long-Tailed Anomaly Detection with Learnable Class Names

Authors:Chih-Hui Ho, Kuan-Chuan Peng, Nuno Vasconcelos

View PDF

Abstract:Anomaly detection (AD) aims to identify defective images and localize their defects (if any). Ideally, AD models should be able to detect defects over many image classes; without relying on hard-coded class names that can be uninformative or inconsistent across datasets; learn without anomaly supervision; and be robust to the long-tailed distributions of real-world applications. To address these challenges, we formulate the problem of long-tailed AD by introducing several datasets with different levels of class imbalance and metrics for performance evaluation. We then propose a novel method, LTAD, to detect defects from multiple and long-tailed classes, without relying on dataset class names. LTAD combines AD by reconstruction and semantic AD modules. AD by reconstruction is implemented with a transformer-based reconstruction module. Semantic AD is implemented with a binary classifier, which relies on learned pseudo class names and a pretrained foundation model. These modules are learned over two phases. Phase 1 learns the pseudo-class names and a variational autoencoder (VAE) for feature synthesis that augments the training data to combat long-tails. Phase 2 then learns the parameters of the reconstruction and classification modules of LTAD. Extensive experiments using the proposed long-tailed datasets show that LTAD substantially outperforms the state-of-the-art methods for most forms of dataset imbalance. The long-tailed dataset split is available at this https URL .

Comments:	This paper is accepted to CVPR 2024. The supplementary material is included. The long-tailed dataset split is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.20236 [cs.CV]
	(or arXiv:2403.20236v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.20236

Submission history

From: Kuan-Chuan Peng [view email]
[v1] Fri, 29 Mar 2024 15:26:44 UTC (7,897 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Long-Tailed Anomaly Detection with Learnable Class Names

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Long-Tailed Anomaly Detection with Learnable Class Names

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators