Computer Science > Computer Vision and Pattern Recognition

arXiv:2302.10719 (cs)

[Submitted on 21 Feb 2023 (v1), last revised 27 Sep 2023 (this version, v2)]

Title:Memory-augmented Online Video Anomaly Detection

Authors:Leonardo Rossi, Vittorio Bernuzzi, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

View PDF

Abstract:The ability to understand the surrounding scene is of paramount importance for Autonomous Vehicles (AVs). This paper presents a system capable to work in an online fashion, giving an immediate response to the arise of anomalies surrounding the AV, exploiting only the videos captured by a dash-mounted camera. Our architecture, called MOVAD, relies on two main modules: a Short-Term Memory Module to extract information related to the ongoing action, implemented by a Video Swin Transformer (VST), and a Long-Term Memory Module injected inside the classifier that considers also remote past information and action context thanks to the use of a Long-Short Term Memory (LSTM) network. The strengths of MOVAD are not only linked to its excellent performance, but also to its straightforward and modular architecture, trained in a end-to-end fashion with only RGB frames with as less assumptions as possible, which makes it easy to implement and play with. We evaluated the performance of our method on Detection of Traffic Anomaly (DoTA) dataset, a challenging collection of dash-mounted camera videos of accidents. After an extensive ablation study, MOVAD is able to reach an AUC score of 82.17\%, surpassing the current state-of-the-art by +2.87 AUC. Our code will be available on this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
MSC classes:	68-02, 68-04, 68-06, 68T07, 68T10, 68T45
ACM classes:	F.1.1
Cite as:	arXiv:2302.10719 [cs.CV]
	(or arXiv:2302.10719v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2302.10719

Submission history

From: Leonardo Rossi PhD [view email]
[v1] Tue, 21 Feb 2023 15:14:27 UTC (1,428 KB)
[v2] Wed, 27 Sep 2023 13:14:41 UTC (1,526 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Memory-augmented Online Video Anomaly Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Memory-augmented Online Video Anomaly Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators