Piggyback: Adapting a single network to multiple tasks by learning to mask weights

Mallya, Arun, Davis, Dillon, Lazebnik, Svetlana , editor= · 2018 · DOI 10.1007/978-3-030-01225-0_5

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Layerwise Progressive Freezing: A Training Scaffold for Depth-Scalable Binary Networks

cs.LG · 2026-06-26 · unverdicted · novelty 7.0

StoMPP progressively binarizes BNN layers layerwise from input to output via stochastic masks, delivering depth-scalable accuracy gains in a fully STE-free regime by controlling activation-induced gradient blockades.

Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates

cs.CL · 2025-12-04 · conditional · novelty 6.0

SSU mitigates catastrophic forgetting in low-resource LLM target-language adaptation by scoring and column-wise freezing source-critical parameters, reducing source degradation to ~3% versus ~20% for full fine-tuning while matching target performance.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates cs.CL · 2025-12-04 · conditional · none · ref 55
SSU mitigates catastrophic forgetting in low-resource LLM target-language adaptation by scoring and column-wise freezing source-critical parameters, reducing source degradation to ~3% versus ~20% for full fine-tuning while matching target performance.

Piggyback: Adapting a single network to multiple tasks by learning to mask weights

fields

years

verdicts

representative citing papers

citing papers explorer