Nielsen, Jacob, and Peter Schneider-Kamp

https://arxiv · arXiv 2203.11086

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Layerwise Progressive Freezing: A Training Scaffold for Depth-Scalable Binary Networks

cs.LG · 2026-06-26 · unverdicted · novelty 7.0

StoMPP progressively binarizes BNN layers layerwise from input to output via stochastic masks, delivering depth-scalable accuracy gains in a fully STE-free regime by controlling activation-induced gradient blockades.

Mapping the Schedule x Bit-Width Boundary in Sub-100M Quantisation-Aware Training

cs.LG · 2026-05-25 · unverdicted · novelty 5.0

Factorial experiments with over 1300 runs falsify the hypothesis that INT6 QAT needs a different LR schedule from higher precision and identify a 50M-parameter boundary for INT4 schedule sensitivity.

citing papers explorer

Showing 2 of 2 citing papers.

Layerwise Progressive Freezing: A Training Scaffold for Depth-Scalable Binary Networks cs.LG · 2026-06-26 · unverdicted · none · ref 41
StoMPP progressively binarizes BNN layers layerwise from input to output via stochastic masks, delivering depth-scalable accuracy gains in a fully STE-free regime by controlling activation-induced gradient blockades.
Mapping the Schedule x Bit-Width Boundary in Sub-100M Quantisation-Aware Training cs.LG · 2026-05-25 · unverdicted · none · ref 17
Factorial experiments with over 1300 runs falsify the hypothesis that INT6 QAT needs a different LR schedule from higher precision and identify a 50M-parameter boundary for INT4 schedule sensitivity.

Nielsen, Jacob, and Peter Schneider-Kamp

fields

years

verdicts

representative citing papers

citing papers explorer