Accessed February 2026

Explicit GPU API designed to reduce CPU overhead vs OpenGL · 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three Browsers

cs.LG · 2026-02-09 · unverdicted · novelty 7.0

WebGPU dispatch overhead for batch-1 LLM inference is 24-71 microseconds per operation depending on backend, dominating performance, with a sequential-dispatch method revealing that naive benchmarks overestimate by about 20 times.

citing papers explorer

Showing 1 of 1 citing paper.

Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three Browsers cs.LG · 2026-02-09 · unverdicted · none · ref 2
WebGPU dispatch overhead for batch-1 LLM inference is 24-71 microseconds per operation depending on backend, dominating performance, with a sequential-dispatch method revealing that naive benchmarks overestimate by about 20 times.

Accessed February 2026

fields

years

verdicts

representative citing papers

citing papers explorer