Let˜αl(j) := exp(sl(j))P q̸=il exp(sl(q)) be the softmax normalized only over competitors, evaluated atW (t−1)

(update of a smooth lower bound) Define the competitors’ log-sum-exp lower bound ϕl(W) :=s l(il, W)−log X j̸=il exp sl(j, W) ≤∆ l(W)

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training

cs.LG · 2025-11-10 · conditional · novelty 6.0

Curriculum post-training on reasoning trees yields polynomial sample complexity for accurate Chain-of-Thought generation in transformers, unlike exponential requirements without curriculum.

citing papers explorer

Showing 1 of 1 citing paper.

Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training cs.LG · 2025-11-10 · conditional · none · ref 8
Curriculum post-training on reasoning trees yields polynomial sample complexity for accurate Chain-of-Thought generation in transformers, unlike exponential requirements without curriculum.

Let˜αl(j) := exp(sl(j))P q̸=il exp(sl(q)) be the softmax normalized only over competitors, evaluated atW (t−1)

fields

years

verdicts

representative citing papers

citing papers explorer