Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches

· 2025 · cs.CL · arXiv 2512.12677

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

We explore efficient strategies to fine-tune decoder-only Large Language Models (LLMs) for downstream text classification under resource constraints. Two approaches are investigated: (1) attaching a classification head to a pretrained causal LLM and fine-tuning it on the task, using the LLM's final-token embedding as a sequence representation, and (2) instruction-tuning the LLM in a prompt-to-response format for classification. To enable single-GPU fine-tuning of models up to 8B parameters, we combine 4-bit model quantization with Low-Rank Adaptation (LoRA) for parameter-efficient training. Experiments on two patent benchmarks, a 5-class single-label internal corpus and the public WIPO-Alpha multi-label dataset with 14 categories, show that the embedding-head approach matches or exceeds fine-tuned BERT baselines on single-label classification while training 10-30x fewer parameters. Instruction-tuning is competitive only in the multi-label regime, and only with substantially larger trainable budgets of at least 100M parameters. These results demonstrate that directly leveraging the internal representations of causal LLMs, together with efficient fine-tuning techniques, yields strong classification performance under limited computational resources. We discuss the advantages of each approach and outline practical guidelines and future directions for optimizing LLM fine-tuning in classification scenarios.

representative citing papers

When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification

cs.AI · 2026-05-22 · unverdicted · novelty 5.0

Synthetic data boosts multi-label patent classification mainly through volume in low-data regimes, with fidelity mattering more as real data increases and a 20-30% real data mix optimal under fixed budgets.

Investigating Detection and Obfuscation of Prompt Injection Attacks Against Software Reverse Engineering AI Agents

cs.CR · 2026-05-29 · unverdicted · novelty 4.0

This work examines prompt injection vulnerabilities in agentic software reverse engineering AI systems and tests detection, obfuscation, and defense techniques.

citing papers explorer

Showing 2 of 2 citing papers after filters.

When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification cs.AI · 2026-05-22 · unverdicted · none · ref 33 · internal anchor
Synthetic data boosts multi-label patent classification mainly through volume in low-data regimes, with fidelity mattering more as real data increases and a 20-30% real data mix optimal under fixed budgets.
Investigating Detection and Obfuscation of Prompt Injection Attacks Against Software Reverse Engineering AI Agents cs.CR · 2026-05-29 · unverdicted · none · ref 18 · internal anchor
This work examines prompt injection vulnerabilities in agentic software reverse engineering AI systems and tests detection, obfuscation, and defense techniques.

Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches

fields

years

verdicts

representative citing papers

citing papers explorer