Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.08515 (cs)

[Submitted on 17 May 2022 (v1), last revised 25 Jul 2022 (this version, v2)]

Title:Unsupervised Segmentation in Real-World Images via Spelke Object Inference

Authors:Honglin Chen, Rahul Venkatesh, Yoni Friedman, Jiajun Wu, Joshua B. Tenenbaum, Daniel L. K. Yamins, Daniel M. Bear

View PDF

Abstract:Self-supervised, category-agnostic segmentation of real-world images is a challenging open problem in computer vision. Here, we show how to learn static grouping priors from motion self-supervision by building on the cognitive science concept of a Spelke Object: a set of physical stuff that moves together. We introduce the Excitatory-Inhibitory Segment Extraction Network (EISEN), which learns to extract pairwise affinity graphs for static scenes from motion-based training signals. EISEN then produces segments from affinities using a novel graph propagation and competition network. During training, objects that undergo correlated motion (such as robot arms and the objects they move) are decoupled by a bootstrapping process: EISEN explains away the motion of objects it has already learned to segment. We show that EISEN achieves a substantial improvement in the state of the art for self-supervised image segmentation on challenging synthetic and real-world robotics datasets.

Comments:	25 pages, 10 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
ACM classes:	I.2.10; I.4.8
Cite as:	arXiv:2205.08515 [cs.CV]
	(or arXiv:2205.08515v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.08515

Submission history

From: Honglin Chen [view email]
[v1] Tue, 17 May 2022 17:39:24 UTC (42,959 KB)
[v2] Mon, 25 Jul 2022 16:24:49 UTC (33,344 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Segmentation in Real-World Images via Spelke Object Inference

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Segmentation in Real-World Images via Spelke Object Inference

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators