InProceedings of the 6th International Conference on Computer Science and Management Technology

Predictive-LoRA: A proactive, fragmentation-aware serverless inference system for LLMs

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

POLAR: Online Learning for LoRA Adapter Caching and Routing in Edge LLM Serving

cs.LG · 2026-04-17 · unverdicted · novelty 7.0

POLAR formulates joint LoRA adapter caching and routing as a two-timescale contextual bandit, achieving sublinear regret bounds and outperforming non-adaptive baselines in experiments with real adapters.

citing papers explorer

Showing 1 of 1 citing paper.

POLAR: Online Learning for LoRA Adapter Caching and Routing in Edge LLM Serving cs.LG · 2026-04-17 · unverdicted · none · ref 14
POLAR formulates joint LoRA adapter caching and routing as a two-timescale contextual bandit, achieving sublinear regret bounds and outperforming non-adaptive baselines in experiments with real adapters.

InProceedings of the 6th International Conference on Computer Science and Management Technology

fields

years

verdicts

representative citing papers

citing papers explorer