Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.19316 (cs)

[Submitted on 27 Jun 2024 (v1), last revised 21 Jul 2024 (this version, v2)]

Title:Enhanced Data Transfer Cooperating with Artificial Triplets for Scene Graph Generation

Authors:KuanChao Chu, Satoshi Yamazaki, Hideki Nakayama

Abstract:This work focuses on training dataset enhancement of informative relational triplets for Scene Graph Generation (SGG). Due to the lack of effective supervision, the current SGG model predictions perform poorly for informative relational triplets with inadequate training samples. Therefore, we propose two novel training dataset enhancement modules: Feature Space Triplet Augmentation (FSTA) and Soft Transfer. FSTA leverages a feature generator trained to generate representations of an object in relational triplets. The biased prediction based sampling in FSTA efficiently augments artificial triplets focusing on the challenging ones. In addition, we introduce Soft Transfer, which assigns soft predicate labels to general relational triplets to make more supervisions for informative predicate classes effectively. Experimental results show that integrating FSTA and Soft Transfer achieve high levels of both Recall and mean Recall in Visual Genome dataset. The mean of Recall and mean Recall is the highest among all the existing model-agnostic methods.

Comments:	Accepted to IEICE Transactions on Information and Systems in April 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.19316 [cs.CV]
	(or arXiv:2406.19316v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.19316

Submission history

From: KuanChao Chu [view email]
[v1] Thu, 27 Jun 2024 16:52:01 UTC (2,261 KB)
[v2] Sun, 21 Jul 2024 13:01:49 UTC (2,261 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Enhanced Data Transfer Cooperating with Artificial Triplets for Scene Graph Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Enhanced Data Transfer Cooperating with Artificial Triplets for Scene Graph Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators