Jenga: Effective memory management for serving LLM with heterogeneity

Chen Zhang, Kuntai Du, Shu Liu, Woosuk Kwon, Xiangxi Mo, Yufeng Wang, Xiaoxuan Liu, Kaichao You, Zhuohan Li, Mingsheng Long, Jidong Zhai, Joseph Gonzalez, Ion Stoica · 2025 · arXiv 1569.376482

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

TIDAL: Recovering Temporal Phase for Cloud Block Storage Placement from LLM-Derived Semantics

cs.OS · 2026-05-18 · unverdicted · novelty 7.0

TIDAL recovers temporal phase signals from LLM-derived semantics of provisioning metadata to enable complementary CVD placement, reducing overload frequency by 79.1% on production traces.

Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

SAECache uses a multi-queue semantic-aware eviction policy with fully adaptive online learning to improve TTFT by 1.4x-2.7x over LRU-style baselines in LLM prefix caching.

BatchWeave: A Consistent Object-Store-Native Data Plane for Large Foundation Model Training

cs.DC · 2026-05-11 · unverdicted · novelty 7.0 · 2 refs

BatchWeave delivers an object-store-native data plane for distributed large foundation model training via transactional global batches and a decentralized adaptive commit algorithm.

FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning

cs.SE · 2026-04-13 · unverdicted · novelty 7.0 · 2 refs

FM-Agent is the first framework to automate compositional Hoare reasoning for large systems by having LLMs derive natural-language function specs from caller intent and then generate tests that found 522 new bugs in systems up to 143k lines of code.

AnyPoC: Universal Proof-of-Concept Test Generation for Scalable LLM-Based Bug Detection

cs.SE · 2026-04-13 · conditional · novelty 6.0

AnyPoC introduces a multi-agent system for generating and validating PoC tests from LLM bug reports, producing 1.3x more valid PoCs, rejecting 9.8x more false positives, and discovering 122 new bugs across 12 major projects.

Search-Based Software Engineering and AI Foundation Models: Current Landscape and Future Roadmap

cs.SE · 2025-05-26 · unverdicted · novelty 4.0

A research roadmap analyzing the current state of search-based software engineering with foundation models, outlining challenges and directions across three integration aspects.

citing papers explorer

Showing 6 of 6 citing papers.

TIDAL: Recovering Temporal Phase for Cloud Block Storage Placement from LLM-Derived Semantics cs.OS · 2026-05-18 · unverdicted · none · ref 91
TIDAL recovers temporal phase signals from LLM-derived semantics of provisioning metadata to enable complementary CVD placement, reducing overload frequency by 79.1% on production traces.
Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches cs.LG · 2026-05-12 · unverdicted · none · ref 24
SAECache uses a multi-queue semantic-aware eviction policy with fully adaptive online learning to improve TTFT by 1.4x-2.7x over LRU-style baselines in LLM prefix caching.
BatchWeave: A Consistent Object-Store-Native Data Plane for Large Foundation Model Training cs.DC · 2026-05-11 · unverdicted · none · ref 21 · 2 links
BatchWeave delivers an object-store-native data plane for distributed large foundation model training via transactional global batches and a decentralized adaptive commit algorithm.
FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning cs.SE · 2026-04-13 · unverdicted · none · ref 14 · 2 links
FM-Agent is the first framework to automate compositional Hoare reasoning for large systems by having LLMs derive natural-language function specs from caller intent and then generate tests that found 522 new bugs in systems up to 143k lines of code.
AnyPoC: Universal Proof-of-Concept Test Generation for Scalable LLM-Based Bug Detection cs.SE · 2026-04-13 · conditional · none · ref 63
AnyPoC introduces a multi-agent system for generating and validating PoC tests from LLM bug reports, producing 1.3x more valid PoCs, rejecting 9.8x more false positives, and discovering 122 new bugs across 12 major projects.
Search-Based Software Engineering and AI Foundation Models: Current Landscape and Future Roadmap cs.SE · 2025-05-26 · unverdicted · none · ref 225
A research roadmap analyzing the current state of search-based software engineering with foundation models, outlining challenges and directions across three integration aspects.

Jenga: Effective memory management for serving LLM with heterogeneity

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer