pith. sign in

hub

FlashAttention-2: Faster attention with better parallelism and work partitioning

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

hub tools

citation-role summary

method 3 background 1

citation-polarity summary

years

2026 8 2025 2

representative citing papers

OmniDrop: Layer-wise Token Pruning for Omni-modal LLMs via Query-Guidance

cs.AI · 2026-05-14 · unverdicted · novelty 6.0

OmniDrop is a training-free layer-wise token pruning framework for omni-modal LLMs that uses query guidance and temporal diversity to reduce prefill latency by up to 40% and memory by 14.7% while improving benchmark scores by up to 3.58 points.

Step-Audio-R1.5 Technical Report

eess.AS · 2026-04-28 · unverdicted · novelty 4.0

Step-Audio-R1.5 applies RLHF to audio reasoning models to escape the verifiable reward trap of RLVR, preserving analytical ability while restoring prosodic naturalness and immersion in long dialogues.

citing papers explorer

Showing 10 of 10 citing papers.