LLMCompass: Enabling efficient hardware design for large language model inference

· 2025 · DOI 10.1109/isca59077

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure

cs.DC · 2026-05-07 · unverdicted · novelty 7.0

CCL-Bench packages traces and metadata to compute detailed compute, memory, and communication efficiency metrics, surfacing performance insights unavailable from end-to-end benchmarks.

ELMoE-3D: Leveraging Intrinsic Elasticity of MoE for Hybrid-Bonding-Enabled Self-Speculative Decoding in On-Premises Serving

cs.LG · 2026-04-16 · unverdicted · novelty 6.0

ELMoE-3D achieves 6.6x average speedup and 4.4x energy efficiency gain for MoE serving on 3D hardware by scaling expert and bit elasticity for elastic self-speculative decoding.

Energy-Aware Computing in the Year 2026

cs.DC · 2026-05-23 · unverdicted · novelty 2.0

The paper reviews energy-aware computing literature and constructs a taxonomy organized by hardware/software aspects, measurement, optimizations, scheduling, scaling, consolidation, federated learning, and cooling.

Crosstalk In Contemporary Quantum Devices

quant-ph · 2026-05-26 · unverdicted · novelty 1.0

Review synthesizing crosstalk mechanisms, mitigation strategies, and security vulnerabilities across major quantum computing platforms from existing literature.

citing papers explorer

Showing 4 of 4 citing papers.

CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure cs.DC · 2026-05-07 · unverdicted · none · ref 80
CCL-Bench packages traces and metadata to compute detailed compute, memory, and communication efficiency metrics, surfacing performance insights unavailable from end-to-end benchmarks.
ELMoE-3D: Leveraging Intrinsic Elasticity of MoE for Hybrid-Bonding-Enabled Self-Speculative Decoding in On-Premises Serving cs.LG · 2026-04-16 · unverdicted · none · ref 74
ELMoE-3D achieves 6.6x average speedup and 4.4x energy efficiency gain for MoE serving on 3D hardware by scaling expert and bit elasticity for elastic self-speculative decoding.
Energy-Aware Computing in the Year 2026 cs.DC · 2026-05-23 · unverdicted · none · ref 280
The paper reviews energy-aware computing literature and constructs a taxonomy organized by hardware/software aspects, measurement, optimizations, scheduling, scaling, consolidation, federated learning, and cooling.
Crosstalk In Contemporary Quantum Devices quant-ph · 2026-05-26 · unverdicted · none · ref 136
Review synthesizing crosstalk mechanisms, mitigation strategies, and security vulnerabilities across major quantum computing platforms from existing literature.

LLMCompass: Enabling efficient hardware design for large language model inference

fields

years

verdicts

representative citing papers

citing papers explorer