Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.09373 (cs)

[Submitted on 19 Jul 2022 (v1), last revised 30 Aug 2022 (this version, v3)]

Title:Multi-Task Learning Framework for Emotion Recognition in-the-wild

Authors:Tenggan Zhang, Chuanhe Liu, Xiaolong Liu, Yuchen Liu, Liyu Meng, Lei Sun, Wenqiang Jiang, Fengyuan Zhang, Jinming Zhao, Qin Jin

View PDF

Abstract:This paper presents our system for the Multi-Task Learning (MTL) Challenge in the 4th Affective Behavior Analysis in-the-wild (ABAW) competition. We explore the research problems of this challenge from three aspects: 1) For obtaining efficient and robust visual feature representations, we propose MAE-based unsupervised representation learning and IResNet/DenseNet-based supervised representation learning methods; 2) Considering the importance of temporal information in videos, we explore three types of sequential encoders to capture the temporal information, including the encoder based on transformer, the encoder based on LSTM, and the encoder based on GRU; 3) For modeling the correlation between these different tasks (i.e., valence, arousal, expression, and AU) for multi-task affective analysis, we first explore the dependency between these different tasks and propose three multi-task learning frameworks to model the correlations effectively. Our system achieves the performance of $1.7607$ on the validation dataset and $1.4361$ on the test dataset, ranking first in the MTL Challenge. The code is available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2207.09373 [cs.CV]
	(or arXiv:2207.09373v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.09373

Submission history

From: Tenggan Zhang [view email]
[v1] Tue, 19 Jul 2022 16:18:53 UTC (67 KB)
[v2] Mon, 25 Jul 2022 02:47:39 UTC (67 KB)
[v3] Tue, 30 Aug 2022 06:54:14 UTC (132 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Task Learning Framework for Emotion Recognition in-the-wild

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Task Learning Framework for Emotion Recognition in-the-wild

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators