In: 2025 IEEE Inter- national Symposium on Performance Analysis of Systems and Software (ISPASS), pp

Agrawal, A · 2025 · arXiv 4960.2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Resource-aware Computation-Communication Overlap for multi-GPU ML Workloads

cs.DC · 2026-06-08 · unverdicted · novelty 4.0

A method using shared-memory occupancy shaping and elevated communication priority achieves up to 25.5% faster multi-GPU ML execution on NVIDIA and AMD GPUs.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Resource-aware Computation-Communication Overlap for multi-GPU ML Workloads cs.DC · 2026-06-08 · unverdicted · none · ref 8
A method using shared-memory occupancy shaping and elevated communication priority achieves up to 25.5% faster multi-GPU ML execution on NVIDIA and AMD GPUs.

In: 2025 IEEE Inter- national Symposium on Performance Analysis of Systems and Software (ISPASS), pp

fields

years

verdicts

representative citing papers

citing papers explorer