The delta learning hypothesis: Preference tuning on weak data can yield strong gains.arXiv preprint arXiv:2507.06187, 2025

Scott Geng, Hamish Ivison, Chun-Liang Li, Maarten Sap, Jerry Li, Ranjay Krishna, Pang Wei Koh · 2025 · arXiv 2507.06187

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Weak-to-Strong Generalization is Nearly Inevitable (in Linear Models)

cs.LG · 2026-05-07 · unverdicted · novelty 8.0

Weak-to-strong generalization is nearly inevitable in linear logistic regression for most student-teacher pairs without any model capacity mismatch.

Bridging Expert Knowledge and Automated Feature Engineering via Self-Evolution

cs.AI · 2026-06-07 · unverdicted · novelty 6.0

FEST uses self-evolving trees to produce expert-aligned, auditable features from unstructured data and outperforms baselines on brand, authenticity, and stress tasks while releasing the BrandGuide dataset.

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

cs.LG · 2026-05-31 · unverdicted · novelty 5.0

Trust functions filter unreliable weak labels to enable near-lossless weak-to-strong generalization and iterative chaining.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Weak-to-Strong Generalization is Nearly Inevitable (in Linear Models) cs.LG · 2026-05-07 · unverdicted · none · ref 36
Weak-to-strong generalization is nearly inevitable in linear logistic regression for most student-teacher pairs without any model capacity mismatch.
Bridging Expert Knowledge and Automated Feature Engineering via Self-Evolution cs.AI · 2026-06-07 · unverdicted · none · ref 8
FEST uses self-evolving trees to produce expert-aligned, auditable features from unstructured data and outperforms baselines on brand, authenticity, and stress tasks while releasing the BrandGuide dataset.
Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher cs.LG · 2026-05-31 · unverdicted · none · ref 44
Trust functions filter unreliable weak labels to enable near-lossless weak-to-strong generalization and iterative chaining.

The delta learning hypothesis: Preference tuning on weak data can yield strong gains.arXiv preprint arXiv:2507.06187, 2025

fields

years

verdicts

representative citing papers

citing papers explorer