Targeted Error Correction in Knowledge Distillation : Small Language Models Surpass GPT , November 2025

Hee-Jin Lee, Zhen Guo, Luchao Jin, Morteza Moazami Goudarzi · 2025 · arXiv 2511.03005

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

SHIELD: A Diverse Clinical Note Dataset and Distilled Small Language Models for Enterprise-Scale De-identification

cs.CL · 2026-05-05 · conditional · novelty 5.0

SHIELD dataset and distilled DeBERTa v3 model achieve 0.88 micro precision and 0.86 recall on PHI de-identification while matching teacher performance on structured categories.

citing papers explorer

Showing 1 of 1 citing paper.

SHIELD: A Diverse Clinical Note Dataset and Distilled Small Language Models for Enterprise-Scale De-identification cs.CL · 2026-05-05 · conditional · none · ref 23
SHIELD dataset and distilled DeBERTa v3 model achieve 0.88 micro precision and 0.86 recall on PHI de-identification while matching teacher performance on structured categories.

Targeted Error Correction in Knowledge Distillation : Small Language Models Surpass GPT , November 2025

fields

years

verdicts

representative citing papers

citing papers explorer