pith. machine review for the scientific record. sign in

hub

Eagle-3: Scaling up inference acceleration of large language models via training-time test

18 Pith papers cite this work. Polarity classification is still indexing.

18 Pith papers citing it

hub tools

years

2026 18

verdicts

UNVERDICTED 18

clear filters

representative citing papers

Test-Time Speculation

cs.CL · 2026-05-10 · unverdicted · novelty 7.0

Test-Time Speculation adapts draft models online via target-model verifications to sustain high acceptance lengths during long LLM generations.

CASCADE: Context-Aware Relaxation for Speculative Image Decoding

cs.CV · 2026-05-08 · unverdicted · novelty 6.0

CASCADE formalizes semantic interchangeability and convergence in target model representations to enable context-aware acceptance relaxation in tree-based speculative decoding, delivering up to 3.6x speedup on text-to-image models without quality loss.

RACER: Retrieval-Augmented Contextual Rapid Speculative Decoding

cs.CL · 2026-04-16 · unverdicted · novelty 6.0

RACER unifies retrieval of exact matching patterns with logit-driven cues to produce better speculative drafts, achieving more than 2x speedup over autoregressive decoding and outperforming prior training-free speculative decoding methods.

citing papers explorer

Showing 1 of 1 citing paper after filters.