SPEC CPU 2026 presents a new benchmark suite using open-source apps, expanded multithreading, and Rolling-Round-Robin Rate to address gaps in evaluating heterogeneous multiprogrammed CPU performance.
Title resolution pending
5 Pith papers cite this work. Polarity classification is still indexing.
years
2026 5verdicts
UNVERDICTED 5representative citing papers
Packed layouts and extensions to tiling/fusion/vectorization in MLIR/IREE enable VLA ML code generation for SVE, achieving up to 1.45x speedup over NEON and outperforming PyTorch frameworks while scaling with vector length.
A specialized profiling tool using Linux perf_event samples gem5 call-stacks to expose simulated architecture behaviors such as TimingSimpleCPU inefficiencies and cache coherence deadlocks not visible in conventional stats.
Profile-guided opcode labeling removes consistently independent loads from the MDP working set, cutting queries 79%, false dependencies 77%, and raising small-core IPC 1.47% on SPEC2017 intspeed.
Akita is a decoupled simulation engine that lets developers write simple single-threaded cycle-based code while automatically delivering event-driven performance, transparent parallel execution, and built-in tracing for monitoring and visualization.
citing papers explorer
-
SPEC CPU: The Next Generation
SPEC CPU 2026 presents a new benchmark suite using open-source apps, expanded multithreading, and Rolling-Round-Robin Rate to address gaps in evaluating heterogeneous multiprogrammed CPU performance.
-
Scalable Packed Layouts for Vector-Length-Agnostic ML Code Generation
Packed layouts and extensions to tiling/fusion/vectorization in MLIR/IREE enable VLA ML code generation for SVE, achieving up to 1.45x speedup over NEON and outperforming PyTorch frameworks while scaling with vector length.
-
Understanding Simulated Architecture via gem5 Call-Stack Profiling
A specialized profiling tool using Linux perf_event samples gem5 call-stacks to expose simulated architecture behaviors such as TimingSimpleCPU inefficiencies and cache coherence deadlocks not visible in conventional stats.
-
PG-MDP: Profile-Guided Memory Dependence Prediction for Area-Constrained Cores
Profile-guided opcode labeling removes consistently independent loads from the MDP working set, cutting queries 79%, false dependencies 77%, and raising small-core IPC 1.47% on SPEC2017 intspeed.
-
Akita: A High Usability Simulation Framework for Computer Architecture
Akita is a decoupled simulation engine that lets developers write simple single-threaded cycle-based code while automatically delivering event-driven performance, transparent parallel execution, and built-in tracing for monitoring and visualization.