X-KD: General Experiential Knowledge Distillation for Large Language Models

Yuang Cai, Yuyu Yuan · 2026 · arXiv 2602.12674

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Procedural Memory Distillation: Online Reflection for Self-Improving Language Models

cs.AI · 2026-07-01 · unverdicted · novelty 5.0

PMD extracts and distills cross-episode procedural knowledge from RL rollouts into LLM policies at three abstraction levels, yielding 3.8-13.6% gains over SDPO on SCIKNOWEVAL and LIVECODEBENCH via co-evolution.

citing papers explorer

Showing 1 of 1 citing paper.

Procedural Memory Distillation: Online Reflection for Self-Improving Language Models cs.AI · 2026-07-01 · unverdicted · none · ref 2
PMD extracts and distills cross-episode procedural knowledge from RL rollouts into LLM policies at three abstraction levels, yielding 3.8-13.6% gains over SDPO on SCIKNOWEVAL and LIVECODEBENCH via co-evolution.

X-KD: General Experiential Knowledge Distillation for Large Language Models

fields

years

verdicts

representative citing papers

citing papers explorer