pith. sign in

Apiserve: Efficient api support for large-language model inferencing

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 1 baseline 1

citation-polarity summary

years

2026 5 2023 1

representative citing papers

A Policy-Driven Runtime Layer for Agentic LLM Serving

cs.AI · 2026-05-26 · unverdicted · novelty 7.0

Introduces a three-tier architecture with an agent runtime layer and four primitives for agent-aware policies in LLM serving, validated on KV caching via CacheSage showing 13-37pp hit-rate gains on five workloads.

citing papers explorer

Showing 6 of 6 citing papers.