Roofline: an insightful visual performance model for multicore architectures,

· 2009

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

NasZip: Software and Hardware Co-Design to Accelerate Approximate Nearest Neighbor Search with DIMM-Based Near-Data Processing

cs.AR · 2026-05-21 · conditional · novelty 6.0

NasZip delivers up to 8.4x speedup over CPU baselines and 1.69x over prior NDP accelerators for ANNS by combining near-data processing with statistics-based PCA early exiting, dynamic-float encoding, and data-aware neighbor mapping.

SparKV: Overhead-Aware KV Cache Loading for Efficient On-Device LLM Inference

cs.NI · 2026-04-23 · unverdicted · novelty 6.0

SparKV reduces time-to-first-token by 1.3x-5.1x and energy use by 1.5x-3.3x for on-device LLM inference by adaptively choosing between cloud KV streaming and local computation while overlapping execution and adjusting for runtime conditions.

Sparsity-Aware Roofline Models for Sparse Matrix-Matrix Multiplication

cs.DC · 2026-04-08 · unverdicted · novelty 6.0

Sparsity-aware roofline models are required for accurate SpMM performance prediction because matrix structure alters arithmetic intensity and a single unified model fails across patterns like block, banded, scale-free, and random.

citing papers explorer

Showing 3 of 3 citing papers.

NasZip: Software and Hardware Co-Design to Accelerate Approximate Nearest Neighbor Search with DIMM-Based Near-Data Processing cs.AR · 2026-05-21 · conditional · none · ref 42
NasZip delivers up to 8.4x speedup over CPU baselines and 1.69x over prior NDP accelerators for ANNS by combining near-data processing with statistics-based PCA early exiting, dynamic-float encoding, and data-aware neighbor mapping.
SparKV: Overhead-Aware KV Cache Loading for Efficient On-Device LLM Inference cs.NI · 2026-04-23 · unverdicted · none · ref 40
SparKV reduces time-to-first-token by 1.3x-5.1x and energy use by 1.5x-3.3x for on-device LLM inference by adaptively choosing between cloud KV streaming and local computation while overlapping execution and adjusting for runtime conditions.
Sparsity-Aware Roofline Models for Sparse Matrix-Matrix Multiplication cs.DC · 2026-04-08 · unverdicted · none · ref 11
Sparsity-aware roofline models are required for accurate SpMM performance prediction because matrix structure alters arithmetic intensity and a single unified model fails across patterns like block, banded, scale-free, and random.

Roofline: an insightful visual performance model for multicore architectures,

fields

years

verdicts

representative citing papers

citing papers explorer