arXiv preprint arXiv:1907.04840 , year=

Sparse networks from scratch: Faster training without losing performance , author= · 1907 · arXiv 1907.04840

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

cs.LG · 2022-08-15 · conditional · novelty 7.0

LLM.int8() performs 8-bit inference for transformers up to 175B parameters with no accuracy loss by combining vector-wise quantization for most features with 16-bit mixed-precision handling of systematic outlier dimensions.

Adaptive Regularization for Sparsity Control in Bregman-Based Optimizers

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

An adaptive regularization update for Bregman optimizers achieves target sparsity levels from 75% to 99% with faster early convergence and performance matching or exceeding oracle-tuned baselines.

On a stochastic column-block bregman method for nonlinear systems

math.NA · 2026-05-08 · unverdicted · novelty 5.0

A stochastic column-block nonlinear Bregman method is introduced for sparse solutions of nonlinear systems, with a proven convergence rate bound under stated assumptions.

citing papers explorer

Showing 3 of 3 citing papers.

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale cs.LG · 2022-08-15 · conditional · none · ref 15
LLM.int8() performs 8-bit inference for transformers up to 175B parameters with no accuracy loss by combining vector-wise quantization for most features with 16-bit mixed-precision handling of systematic outlier dimensions.
Adaptive Regularization for Sparsity Control in Bregman-Based Optimizers cs.LG · 2026-05-08 · unverdicted · none · ref 12
An adaptive regularization update for Bregman optimizers achieves target sparsity levels from 75% to 99% with faster early convergence and performance matching or exceeding oracle-tuned baselines.
On a stochastic column-block bregman method for nonlinear systems math.NA · 2026-05-08 · unverdicted · none · ref 3
A stochastic column-block nonlinear Bregman method is introduced for sparse solutions of nonlinear systems, with a proven convergence rate bound under stated assumptions.

arXiv preprint arXiv:1907.04840 , year=

fields

years

verdicts

representative citing papers

citing papers explorer