Computer Science > Computation and Language

arXiv:2312.10493 (cs)

[Submitted on 16 Dec 2023 (v1), last revised 19 Dec 2023 (this version, v2)]

Title:Debiasing Multimodal Sarcasm Detection with Contrastive Learning

Authors:Mengzhao Jia, Can Xie, Liqiang Jing

Abstract:Despite commendable achievements made by existing work, prevailing multimodal sarcasm detection studies rely more on textual content over visual information. It unavoidably induces spurious correlations between textual words and labels, thereby significantly hindering the models' generalization capability. To address this problem, we define the task of out-of-distribution (OOD) multimodal sarcasm detection, which aims to evaluate models' generalizability when the word distribution is different in training and testing settings. Moreover, we propose a novel debiasing multimodal sarcasm detection framework with contrastive learning, which aims to mitigate the harmful effect of biased textual factors for robust OOD generalization. In particular, we first design counterfactual data augmentation to construct the positive samples with dissimilar word biases and negative samples with similar word biases. Subsequently, we devise an adapted debiasing contrastive learning mechanism to empower the model to learn robust task-relevant features and alleviate the adverse effect of biased words. Extensive experiments show the superiority of the proposed framework.

Subjects:	Computation and Language (cs.CL); Multimedia (cs.MM)
Cite as:	arXiv:2312.10493 [cs.CL]
	(or arXiv:2312.10493v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.10493

Submission history

From: Mengzhao Jia [view email]
[v1] Sat, 16 Dec 2023 16:14:50 UTC (1,835 KB)
[v2] Tue, 19 Dec 2023 15:55:23 UTC (1,834 KB)

Computer Science > Computation and Language

Title:Debiasing Multimodal Sarcasm Detection with Contrastive Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Debiasing Multimodal Sarcasm Detection with Contrastive Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators