Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.06310 (cs)

[Submitted on 27 Apr 2023 (v1), last revised 18 Nov 2024 (this version, v4)]

Title:SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition

Authors:Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu

Abstract:This paper introduces a novel approach to Social Group Activity Recognition (SoGAR) using Self-supervised Transformers network that can effectively utilize unlabeled video data. To extract spatio-temporal information, we created local and global views with varying frame rates. Our self-supervised objective ensures that features extracted from contrasting views of the same video were consistent across spatio-temporal domains. Our proposed approach is efficient in using transformer-based encoders to alleviate the weakly supervised setting of group activity recognition. By leveraging the benefits of transformer models, our approach can model long-term relationships along spatio-temporal dimensions. Our proposed SoGAR method achieved state-of-the-art results on three group activity recognition benchmarks, namely JRDB-PAR, NBA, and Volleyball datasets, surpassing the current numbers in terms of F1-score, MCA, and MPCA metrics.

Comments:	Under review for IEEE Access journal; 12 pages, 7 figures. arXiv admin note: text overlap with arXiv:2303.12149
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.06310 [cs.CV]
	(or arXiv:2305.06310v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.06310

Submission history

From: Naga Venkata Sai Raviteja Chappa [view email]
[v1] Thu, 27 Apr 2023 03:41:15 UTC (11,864 KB)
[v2] Mon, 15 May 2023 21:29:46 UTC (11,864 KB)
[v3] Mon, 28 Aug 2023 14:18:25 UTC (11,864 KB)
[v4] Mon, 18 Nov 2024 19:03:35 UTC (17,023 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators