pith. sign in

Coordinated Power Management on Heterogeneous Systems

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

Performance prediction is essential for energy-efficient computing in heterogeneous computing systems that integrate CPUs and GPUs. However, traditional performance modeling methods often rely on exhaustive offline profiling, which becomes impractical due to the large setting space and the high cost of profiling large-scale applications. In this paper, we present OPEN, a framework consists of offline and online phases. The offline phase involves building a performance predictor and constructing an initial dense matrix. In the online phase, OPEN performs lightweight online profiling, and leverages the performance predictor with collaborative filtering to make performance prediction. We evaluate OPEN on multiple heterogeneous systems, including those equipped with A100 and A30 GPUs. Results show that OPEN achieves prediction accuracy up to 98.29\%. This demonstrates that OPEN effectively reduces profiling cost while maintaining high accuracy, making it practical for power-aware performance modeling in modern HPC environments. Overall, OPEN provides a lightweight solution for performance prediction under power constraints, enabling better runtime decisions in power-aware computing environments.

fields

cs.DC 2

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Towards Energy Efficient Co-Scheduling in HPC

cs.DC · 2026-04-19 · unverdicted · novelty 5.0

EcoSched jointly selects GPU counts via lightweight profiling and coschedules jobs with a score-based policy plus NUMA placement, delivering up to 14.8% energy savings, 30.1% makespan reduction, and 40.4% EDP improvement on H100/A100/V100 systems.

citing papers explorer

Showing 2 of 2 citing papers.

  • Towards Energy Efficient Co-Scheduling in HPC cs.DC · 2026-04-19 · unverdicted · none · ref 30 · internal anchor

    EcoSched jointly selects GPU counts via lightweight profiling and coschedules jobs with a score-based policy plus NUMA placement, delivering up to 14.8% energy savings, 30.1% makespan reduction, and 40.4% EDP improvement on H100/A100/V100 systems.

  • EcoShift: Performance-Aware Power Management for Power-Constrained Heterogeneous Systems cs.DC · 2026-04-19 · unverdicted · none · ref 42 · internal anchor

    EcoShift uses online performance prediction plus dynamic programming to reallocate reclaimed power in heterogeneous CPU-GPU clusters, delivering up to 6% average performance gain while staying inside the total power limit.