DAE4HLS enables explicit decoupling of access and execute in HLS to unlock memory-level parallelism, delivering 10-79x speedups for complex workloads on commercial and dynamic HLS tools.
Wulf and Sally A
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
LLM inference should be reframed and evaluated as energy-to-token production with a Token Production Function that accounts for power, cooling, and efficiency ceilings.
Modern benchmarks confirm that region-based custom allocators retain locality advantages over state-of-the-art general-purpose allocators, extending the original 2000 conclusions with new applications and fragmentation analysis.
citing papers explorer
-
DAE4HLS: Exposing Memory-Level Parallelism for High-Level Synthesis using Explicit Decoupling
DAE4HLS enables explicit decoupling of access and execute in HLS to unlock memory-level parallelism, delivering 10-79x speedups for complex workloads on commercial and dynamic HLS tools.
-
Position: LLM Inference Should Be Evaluated as Energy-to-Token Production
LLM inference should be reframed and evaluated as energy-to-token production with a Token Production Function that accounts for power, cooling, and efficiency ceilings.
-
Reconsidering "Reconsidering Custom Memory Allocation"
Modern benchmarks confirm that region-based custom allocators retain locality advantages over state-of-the-art general-purpose allocators, extending the original 2000 conclusions with new applications and fragmentation analysis.