R-SQAIR: Relational Sequential Attend, Infer, Repeat
classification
💻 cs.LG
stat.ML
keywords
relationalsequentialattentioninferr-sqairattendbiascombinatorial
read the original abstract
Traditional sequential multi-object attention models rely on a recurrent mechanism to infer object relations. We propose a relational extension (R-SQAIR) of one such attention model (SQAIR) by endowing it with a module with strong relational inductive bias that computes in parallel pairwise interactions between inferred objects. Two recently proposed relational modules are studied on tasks of unsupervised learning from videos. We demonstrate gains over sequential relational mechanisms, also in terms of combinatorial generalization.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
3D-DLP: Self-Supervised 3D Object-Centric Scene Representation Learning
3D-DLP decomposes 3D scenes into controllable latent particles via self-supervised reconstruction for improved robotic tasks.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.