A decentralized virtual processor automatically exploits parallelism in array programs through local cooperative decisions without a central scheduler.
In: 2012 International Conference for High Performance Computing, Networking, Storage and Analysis
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
SET is a new CUDA runtime framework that combines event-chaining, work-stealing, and per-stream buffers in graph-based pipelines to deliver 1.15-1.44X speedups and 18-54% lower scheduling overhead versus prior CUDA graph methods.
citing papers explorer
-
A Virtual Processor brings back the Free Lunch
A decentralized virtual processor automatically exploits parallelism in array programs through local cooperative decisions without a central scheduler.
-
SET: Stream-Event-Triggered Scheduling for Efficient CUDA Graph Pipelines
SET is a new CUDA runtime framework that combines event-chaining, work-stealing, and per-stream buffers in graph-based pipelines to deliver 1.15-1.44X speedups and 18-54% lower scheduling overhead versus prior CUDA graph methods.