pith. sign in

The final answer is: <answer> 25 </answer> SkillFactory models output for a GSM8k <think> <sample> To solve this problem, we need to follow these steps

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CL 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

SkillFactory: Self-Distillation For Learning Cognitive Behaviors

cs.CL · 2025-12-03 · unverdicted · novelty 6.0

SkillFactory creates silver SFT data from a model's self-generated traces rearranged into cognitive skill formats to prime models for better skill use during subsequent RL, improving post-RL generalization and out-of-domain robustness.

citing papers explorer

Showing 1 of 1 citing paper.

  • SkillFactory: Self-Distillation For Learning Cognitive Behaviors cs.CL · 2025-12-03 · unverdicted · none · ref 36

    SkillFactory creates silver SFT data from a model's self-generated traces rearranged into cognitive skill formats to prime models for better skill use during subsequent RL, improving post-RL generalization and out-of-domain robustness.