pith. sign in

Flash A ttention-3: Fast and accurate attention with asynchrony and low-precision

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.CL 1 cs.LG 1

years

2026 2

clear filters

representative citing papers

Why Attend to Everything? Focus is the Key

cs.CL · 2026-03-12 · conditional · novelty 6.0

Focus learns a few centroids to gate long-range token attention, producing sparse attention that matches or beats full attention quality with up to 8.6x speedup at million-token lengths.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.