Training neural networks from scratch with parallel low-rank adapters.arXiv preprint arXiv:2402.16828, 2024

Minyoung Huh, Brian Cheung, Jeremy Bernstein, Phillip Isola, Pulkit Agrawal · 2024 · arXiv 2402.16828

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Learned Subspace Compression for Communication-Efficient Pipeline Parallelism

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

MAPL learns task-specific orthogonal compression subspaces per pipeline stage via manifold-constrained optimization and recovers signals with low-overhead anchors, yielding better compression-performance tradeoffs than fixed projections on LLaMA models up to 1B parameters.

citing papers explorer

Showing 1 of 1 citing paper.

Learned Subspace Compression for Communication-Efficient Pipeline Parallelism cs.LG · 2026-06-03 · unverdicted · none · ref 18
MAPL learns task-specific orthogonal compression subspaces per pipeline stage via manifold-constrained optimization and recovers signals with low-overhead anchors, yielding better compression-performance tradeoffs than fixed projections on LLaMA models up to 1B parameters.

Training neural networks from scratch with parallel low-rank adapters.arXiv preprint arXiv:2402.16828, 2024

fields

years

verdicts

representative citing papers

citing papers explorer