Title resolution pending

Gopal Kakivaya, Lu Xun, Richard Hasha, Shegufta Bakht Ahsan, Todd Pfleiger, Rishi Sinha, Anurag Gupta, Mihail Tarta, Mark Fussell, Vipul Modi, Mansoor Mohsin, Ray Kong, Anmol Ahuja, Oana Platon, Alex Wun, Matthew Snider, Chacko Daniel, Da · 2018 · arXiv 0508.319054

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Offloading L7 Policies to the Kernel

cs.NI · 2026-05-29 · unverdicted · novelty 6.0

L7FP synthesizes eBPF data planes to enforce the majority of L7 policies in the kernel for service meshes, delivering up to 6x lower median latency and 3x higher throughput than state-of-the-art proxies while falling back for unsupported policies.

BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching

cs.CL · 2024-11-29 · unverdicted · novelty 6.0

BatchLLM achieves 1.3x-10.8x higher throughput than vLLM and SGLang for batched LLM inference with prefix sharing via global prefix identification, decoding-first reordering, and memory-centric token batching.

citing papers explorer

Showing 2 of 2 citing papers.

Offloading L7 Policies to the Kernel cs.NI · 2026-05-29 · unverdicted · none · ref 36
L7FP synthesizes eBPF data planes to enforce the majority of L7 policies in the kernel for service meshes, delivering up to 6x lower median latency and 3x higher throughput than state-of-the-art proxies while falling back for unsupported policies.
BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching cs.CL · 2024-11-29 · unverdicted · none · ref 10
BatchLLM achieves 1.3x-10.8x higher throughput than vLLM and SGLang for batched LLM inference with prefix sharing via global prefix identification, decoding-first reordering, and memory-centric token batching.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer