Think fast: A tensor streaming processor (tsp) for accelerating deep learning workloads

· 2020

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

The xPU-athalon: Quantifying the Competition of AI Acceleration

cs.AR · 2026-04-12 · unverdicted · novelty 6.0

Quantitative benchmarks across recent AI accelerators reveal that optimal hardware choice varies with workload parameters and that several platforms incur substantially higher idle power than GPUs.

M100: An Orchestrated Dataflow Architecture Powering General AI Computing

cs.LG · 2026-04-20 · unverdicted · novelty 5.0

M100 is a tensor-based dataflow architecture that eliminates heavy caching through compiler-managed data streams, claiming higher utilization and better performance than GPGPUs for AD and LLM inference tasks.

citing papers explorer

Showing 2 of 2 citing papers.

The xPU-athalon: Quantifying the Competition of AI Acceleration cs.AR · 2026-04-12 · unverdicted · none · ref 2
Quantitative benchmarks across recent AI accelerators reveal that optimal hardware choice varies with workload parameters and that several platforms incur substantially higher idle power than GPUs.
M100: An Orchestrated Dataflow Architecture Powering General AI Computing cs.LG · 2026-04-20 · unverdicted · none · ref 13
M100 is a tensor-based dataflow architecture that eliminates heavy caching through compiler-managed data streams, claiming higher utilization and better performance than GPGPUs for AD and LLM inference tasks.

Think fast: A tensor streaming processor (tsp) for accelerating deep learning workloads

fields

years

verdicts

representative citing papers

citing papers explorer