pith. sign in

hub Canonical reference

Mooncake: A kvcache-centric disaggregated architecture for llm serving

Canonical reference. 80% of citing Pith papers cite this work as background.

16 Pith papers citing it
Background 80% of classified citations

hub tools

citation-role summary

background 3 method 2

citation-polarity summary

clear filters

representative citing papers

Tutti: Making SSD-Backed KV Cache Practical for Long-Context LLM Serving

cs.OS · 2026-05-05 · unverdicted · novelty 7.0

Tutti is a GPU-direct SSD-backed KV cache that removes CPU bottlenecks via object abstraction, GPU io_uring, and slack scheduling, delivering near-DRAM performance at 2x higher request rate and 27% lower cost than prior GDS-based systems.

citing papers explorer

Showing 2 of 2 citing papers after filters.