arXiv preprint arXiv:2505.21067 , year=

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning , author= · arXiv 2505.21067

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

cs.AI · 2026-05-04 · unverdicted · novelty 6.0

CoRD uses collaborative multi-teacher step-wise decoding with perplexity-guided beam search to generate higher-quality Long-CoT data that lets smaller models reach near-teacher performance with less supervision.

citing papers explorer

Showing 1 of 1 citing paper.

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding cs.AI · 2026-05-04 · unverdicted · none · ref 24
CoRD uses collaborative multi-teacher step-wise decoding with perplexity-guided beam search to generate higher-quality Long-CoT data that lets smaller models reach near-teacher performance with less supervision.

arXiv preprint arXiv:2505.21067 , year=

fields

years

verdicts

representative citing papers

citing papers explorer