pith. sign in

Localizing objects with self-supervised transformers and no labels

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

fields

cs.CV 8

years

2026 7 2025 1

representative citing papers

Registers Matter for Pixel-Space Diffusion Transformers

cs.CV · 2026-05-15 · unverdicted · novelty 6.0

Register tokens enhance pixel-space DiT training and output quality via cleaner high-noise feature maps, and a dual-stream design adds further gains with little overhead.

Vision Transformers Need More Than Registers

cs.CV · 2026-02-25 · unverdicted · novelty 6.0

ViTs exhibit lazy aggregation by relying on irrelevant background patches for global semantics, and selectively integrating patch features into the CLS token reduces this effect and improves results across label-, text-, and self-supervision.

citing papers explorer

Showing 8 of 8 citing papers.