pith. sign in

Attention sinks in diffusion language models

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1 baseline 1 other 1

citation-polarity summary

years

2026 7

clear filters

representative citing papers

Registers Matter for Pixel-Space Diffusion Transformers

cs.CV · 2026-05-15 · unverdicted · novelty 6.0

Register tokens enhance pixel-space DiT training and output quality via cleaner high-noise feature maps, and a dual-stream design adds further gains with little overhead.

citing papers explorer

Showing 2 of 2 citing papers after filters.