Oats: Outlier-aware pruning through sparse and low rank decomposition.arXiv preprint arXiv:2409.13652

12 Stephen Zhang, Vardan Papyan · arXiv 2409.13652

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

ELAS: Efficient Pre-Training of Low-Rank Large Language Models via 2:4 Activation Sparsity

cs.LG · 2026-05-05 · unverdicted · novelty 5.0

ELAS pre-trains low-rank LLMs by applying 2:4 activation sparsity after squared ReLU to cut memory and accelerate training with minimal performance loss.

citing papers explorer

Showing 1 of 1 citing paper.

ELAS: Efficient Pre-Training of Low-Rank Large Language Models via 2:4 Activation Sparsity cs.LG · 2026-05-05 · unverdicted · none · ref 13
ELAS pre-trains low-rank LLMs by applying 2:4 activation sparsity after squared ReLU to cut memory and accelerate training with minimal performance loss.

Oats: Outlier-aware pruning through sparse and low rank decomposition.arXiv preprint arXiv:2409.13652

fields

years

verdicts

representative citing papers

citing papers explorer