A single LoRA adapter placed at the gradient-energy-dominant shallow FFN module outperforms distributed LoRA across instruction, math, code, and conversation tasks.
Adaptive budget allocation for parameter-efficient fine-tuning
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
MoLF routes updates between full fine-tuning and LoRA at the optimizer level to match or exceed the better of the two static methods on SQL, medical QA, and counterfactual tasks while an efficient variant outperforms prior adaptive LoRA by up to 20%.
citing papers explorer
-
Rethinking Adapter Placement: A Dominant Adaptation Module Perspective
A single LoRA adapter placed at the gradient-energy-dominant shallow FFN module outperforms distributed LoRA across instruction, math, code, and conversation tasks.