Title resolution pending

Lintang Sutawika, Hailey Schoelkopf, Leo Gao, Baber Abbasi, Stella Biderman, Jonathan Tow · 2026 · DOI 10.5281/zenodo.18636344

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

SPEAR: A System for Post-Quantization Error-Adaptive Recovery Enabling Efficient Low-Bit LLM Serving

cs.AR · 2026-06-04 · unverdicted · novelty 6.0

SPEAR places input-dependent error compensators at CKA-selected layers and fuses them into low-bit GEMMs to recover 56-75% of the W4-to-FP16 perplexity gap with <1% memory overhead and near-baseline latency.

Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models

cs.CL · 2026-06-04 · unverdicted · novelty 5.0

VSRAQ is a MoE-specific quantization objective that combines value and structure alignment to preserve expert-selection behavior and reduce quality loss without inference overhead.

citing papers explorer

Showing 2 of 2 citing papers.

SPEAR: A System for Post-Quantization Error-Adaptive Recovery Enabling Efficient Low-Bit LLM Serving cs.AR · 2026-06-04 · unverdicted · none · ref 22
SPEAR places input-dependent error compensators at CKA-selected layers and fuses them into low-bit GEMMs to recover 56-75% of the W4-to-FP16 perplexity gap with <1% memory overhead and near-baseline latency.
Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models cs.CL · 2026-06-04 · unverdicted · none · ref 24
VSRAQ is a MoE-specific quantization objective that combines value and structure alignment to preserve expert-selection behavior and reduce quality loss without inference overhead.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer