In Figure 12, Figure 13, and Figure 14, we report detailed bit allocation results from AlphaQ under a 2-bit budget, with both layer-wise and expert-wise settings

using the EleutherAI LM Harness (Gao et al · 2024 · arXiv 2068.1157

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

AlphaQ: Calibration-Free Bit Allocation for Mixture-of-Experts Quantization

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

AlphaQ performs calibration-free mixed-precision quantization of MoE models by allocating higher bits to experts whose weight spectra exhibit stronger heavy-tailed structure according to HT-SR theory, outperforming calibration-based methods and reaching near full-precision accuracy at 3.5 average bi

citing papers explorer

Showing 1 of 1 citing paper after filters.

AlphaQ: Calibration-Free Bit Allocation for Mixture-of-Experts Quantization cs.LG · 2026-06-03 · unverdicted · none · ref 29
AlphaQ performs calibration-free mixed-precision quantization of MoE models by allocating higher bits to experts whose weight spectra exhibit stronger heavy-tailed structure according to HT-SR theory, outperforming calibration-based methods and reaching near full-precision accuracy at 3.5 average bi

In Figure 12, Figure 13, and Figure 14, we report detailed bit allocation results from AlphaQ under a 2-bit budget, with both layer-wise and expert-wise settings

fields

years

verdicts

representative citing papers

citing papers explorer