FTTN: Feature-Targeted Testing for Numerical Properties of NVIDIA & AMD Matrix Accelerators

Wenhai Lin, Yiquan Chen, Jiexiong Xu, Zhen Jin, Peiyu Liu, Shishun Cai, Yuzhong Zhang, Jingchang Qin, Yiquan Lin, Wenzhi Chen · 2024 · arXiv 9990.2024

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

other 1

citation-polarity summary

unclear 1

representative citing papers

Bit-Accurate Modeling of GPU Matrix Multiply-Accumulate Units: Demystifying Numerical Discrepancy and Accuracy

cs.AR · 2025-11-14 · accept · novelty 8.0 · 2 refs

The authors derive the first bit-accurate arithmetic models for matrix multiply-accumulate operations on ten GPU architectures spanning NVIDIA Volta to Blackwell and AMD CDNA1 to CDNA3.

Eidola: Modeling Multi-GPU Network Communication Traffic in Distributed AI Workloads

cs.DC · 2026-06-10 · unverdicted · novelty 5.0

Eidola is a gem5 extension that emulates cycle-level peer-to-peer GPU writes via real-application timing profiles to simulate traffic and synchronization in multi-GPU AI systems.

KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances

cs.DC · 2026-04-27

citing papers explorer

Showing 3 of 3 citing papers.

Bit-Accurate Modeling of GPU Matrix Multiply-Accumulate Units: Demystifying Numerical Discrepancy and Accuracy cs.AR · 2025-11-14 · accept · none · ref 7 · 2 links
The authors derive the first bit-accurate arithmetic models for matrix multiply-accumulate operations on ten GPU architectures spanning NVIDIA Volta to Blackwell and AMD CDNA1 to CDNA3.
Eidola: Modeling Multi-GPU Network Communication Traffic in Distributed AI Workloads cs.DC · 2026-06-10 · unverdicted · none · ref 24
Eidola is a gem5 extension that emulates cycle-level peer-to-peer GPU writes via real-application timing profiles to simulate traffic and synchronization in multi-GPU AI systems.
KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances cs.DC · 2026-04-27 · unreviewed · ref 30

FTTN: Feature-Targeted Testing for Numerical Properties of NVIDIA & AMD Matrix Accelerators

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer