TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning

· 2025 · cs.CL · arXiv 2510.07118

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Instruction tuning is essential for aligning large language models (LLMs) to downstream tasks and commonly relies on large, diverse corpora. However, small, high-quality subsets, known as coresets, can deliver comparable or superior results, though curating them remains challenging. Existing methods often rely on coarse, sample-level signals like gradients, an approach that is computationally expensive and overlooks fine-grained features. To address this, we introduce TRIM (Token Relevance via Interpretable Multi-layer Attention), a forward-only, token-centric framework. Instead of using gradients, TRIM operates by matching underlying representational patterns identified via attention-based "fingerprints" from a handful of target samples. Such an approach makes TRIM highly efficient and uniquely sensitive to the structural features that define a task. Coresets selected by our method consistently outperform state-of-the-art baselines by up to 9% on downstream tasks and even surpass the performance of full-data fine-tuning in some settings. By avoiding expensive backward passes, TRIM achieves this at a fraction of the computational cost. These findings establish TRIM as a scalable and efficient alternative for building high-quality instruction-tuning datasets.

representative citing papers

AlphaToken: Decoupling Adaptation and Stability for Path-Aware Response Token Valuation in LLM Post-Training

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

AlphaToken decouples adaptation and stability into path-aware token valuations for LLM post-training using a Fisher-drift proxy to mask low-value tokens and improve performance while reducing catastrophic forgetting.

citing papers explorer

Showing 1 of 1 citing paper after filters.

AlphaToken: Decoupling Adaptation and Stability for Path-Aware Response Token Valuation in LLM Post-Training cs.CL · 2026-06-01 · unverdicted · none · ref 21 · internal anchor
AlphaToken decouples adaptation and stability into path-aware token valuations for LLM post-training using a Fisher-drift proxy to mask low-value tokens and improve performance while reducing catastrophic forgetting.

TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning

fields

years

verdicts

representative citing papers

citing papers explorer