NetKV is a network-aware O(|D|) greedy scheduler for decode instance selection that reduces mean TTFT by up to 21.2% versus round-robin and 17.6% versus cache+load baselines in 64-GPU fat-tree simulations.
Data center tcp (dctcp),
2 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 2representative citing papers
Two-year empirical study of 472 IXPs finds 49.2% global traffic growth, stable utilization rates, regionally distinct patterns, and high self-similarity, establishing IXP statistics as a robust proxy for overall Internet dynamics.
citing papers explorer
-
NetKV: Network-Aware Decode Instance Selection for Disaggregated LLM Inference
NetKV is a network-aware O(|D|) greedy scheduler for decode instance selection that reduces mean TTFT by up to 21.2% versus round-robin and 17.6% versus cache+load baselines in 64-GPU fat-tree simulations.
-
Five Blind Men and the Internet: Towards an Understanding of Internet Traffic
Two-year empirical study of 472 IXPs finds 49.2% global traffic growth, stable utilization rates, regionally distinct patterns, and high self-similarity, establishing IXP statistics as a robust proxy for overall Internet dynamics.