Arvideo: Autoregressive pretrain- ing for self-supervised video representation learning.arXiv preprint arXiv:2405.15160, 2024

Ren, S · 2024 · arXiv 2405.15160

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Generative Event Pretraining with Foundation Model Alignment

cs.CV · 2026-03-24 · unverdicted · novelty 6.0

GEP transfers semantic knowledge from image foundation models to event data via alignment and generative pretraining on mixed sequences to create transferable event-based visual models.

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

cs.CV · 2024-12-10 · unverdicted · novelty 6.0

Motion-aware contrastive learning on mask tubes improves temporal panoptic scene graph generation over pooling-based methods on video and 4D datasets.

citing papers explorer

Showing 2 of 2 citing papers.

Generative Event Pretraining with Foundation Model Alignment cs.CV · 2026-03-24 · unverdicted · none · ref 48
GEP transfers semantic knowledge from image foundation models to event data via alignment and generative pretraining on mixed sequences to create transferable event-based visual models.
Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation cs.CV · 2024-12-10 · unverdicted · none · ref 33
Motion-aware contrastive learning on mask tubes improves temporal panoptic scene graph generation over pooling-based methods on video and 4D datasets.

Arvideo: Autoregressive pretrain- ing for self-supervised video representation learning.arXiv preprint arXiv:2405.15160, 2024

fields

years

verdicts

representative citing papers

citing papers explorer