S-jepa: Towards seamless cross-dataset transfer through dy- namic spatial attention.arXiv preprint arXiv:2403.11772

Pierre Guetschel, Thomas Moreau, Michael Tangermann · arXiv 2403.11772

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

cs.CV · 2025-12-15 · unverdicted · novelty 7.0

RVM uses recurrent computation inside a masked autoencoder to learn video representations that match or exceed prior video and image models on classification, tracking, and dense spatial tasks with up to 30x better parameter efficiency.

citing papers explorer

Showing 1 of 1 citing paper.

Recurrent Video Masked Autoencoders cs.CV · 2025-12-15 · unverdicted · none · ref 33
RVM uses recurrent computation inside a masked autoencoder to learn video representations that match or exceed prior video and image models on classification, tracking, and dense spatial tasks with up to 30x better parameter efficiency.

S-jepa: Towards seamless cross-dataset transfer through dy- namic spatial attention.arXiv preprint arXiv:2403.11772

fields

years

verdicts

representative citing papers

citing papers explorer