pith. sign in

Helix: Serving large language models over heterogeneous gpus and network via max-flow

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

citation-role summary

baseline 2 background 1

citation-polarity summary

years

2026 9 2025 1

verdicts

UNVERDICTED 10

representative citing papers

Feedback-Driven Execution for LLM-Based Binary Analysis

cs.CR · 2026-04-16 · unverdicted · novelty 7.0

FORGE uses a reasoning-action-observation loop and Dynamic Forest of Agents to perform scalable LLM-based binary analysis, finding 1,274 vulnerabilities across 591 of 3,457 real-world firmware binaries at 72.3% precision and broader coverage than prior methods.

PALS: Power-Aware LLM Serving for Mixture-of-Experts Models

cs.AI · 2026-05-20 · unverdicted · novelty 6.0

PALS adds dynamic GPU power capping to LLM serving frameworks like vLLM, jointly tuning it with batch size via offline models and feedback control to improve energy efficiency up to 26.3% and cut QoS violations 4-7x on dense and MoE models.

citing papers explorer

Showing 10 of 10 citing papers.