Available from: https://arxiv.org/abs/2301.00407

Huaizheng Zhang, Yuanming Li, Wencong Xiao, Yizheng Huang, Xing Di, Jianxiong Yin, Simon See, Yong Luo, Chiew Tong Lau, Yang You · 2023 · arXiv 2301.00407

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

Energy-Aware Scheduling for Serverless LLM Serving on Shared GPUs

cs.DC · 2026-06-29 · unverdicted · novelty 4.0

Festina reduces energy consumption by up to 56% for serverless LLM inference on shared GPUs while keeping TTFT/TBT SLO attainment within 2% of four state-of-the-art baselines.

SMART-MIG: A Learning Framework for Scalable and Energy-Efficient GPU Scheduling

cs.DC · 2026-06-29 · unverdicted · novelty 3.0

SMART-MIG applies MF-MARL for constant-complexity MIG repartitioning plus heuristics for scheduling, reporting 18% better energy-tardiness efficiency than static partitioning and 27% above a theoretical energy lower bound.

CompPow: A Case for Component-level GPU Power Management

cs.AR · 2026-05-21 · unverdicted · novelty 3.0

CompPow makes the case that component-aware power management inside GPUs can yield 10% higher energy efficiency and 5% better performance for ML workloads.

A comprehensive evaluation of spatial co-execution on GPUs using MPS and MIG technologies

cs.DC · 2026-04-24 · unverdicted · novelty 3.0

MPS can boost performance up to 30% and cut energy 20% with careful provisioning but degrades sharply under memory contention, whereas MIG delivers steadier gains through hardware isolation at the cost of higher overhead and occasional performance losses.

citing papers explorer

Showing 4 of 4 citing papers.

Energy-Aware Scheduling for Serverless LLM Serving on Shared GPUs cs.DC · 2026-06-29 · unverdicted · none · ref 65
Festina reduces energy consumption by up to 56% for serverless LLM inference on shared GPUs while keeping TTFT/TBT SLO attainment within 2% of four state-of-the-art baselines.
SMART-MIG: A Learning Framework for Scalable and Energy-Efficient GPU Scheduling cs.DC · 2026-06-29 · unverdicted · none · ref 25
SMART-MIG applies MF-MARL for constant-complexity MIG repartitioning plus heuristics for scheduling, reporting 18% better energy-tardiness efficiency than static partitioning and 27% above a theoretical energy lower bound.
CompPow: A Case for Component-level GPU Power Management cs.AR · 2026-05-21 · unverdicted · none · ref 38
CompPow makes the case that component-aware power management inside GPUs can yield 10% higher energy efficiency and 5% better performance for ML workloads.
A comprehensive evaluation of spatial co-execution on GPUs using MPS and MIG technologies cs.DC · 2026-04-24 · unverdicted · none · ref 39
MPS can boost performance up to 30% and cut energy 20% with careful provisioning but degrades sharply under memory contention, whereas MIG delivers steadier gains through hardware isolation at the cost of higher overhead and occasional performance losses.

Available from: https://arxiv.org/abs/2301.00407

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer