arXiv preprint arXiv:2406.06391 , year=

Towards Lifelong Learning of Large Language Models: A Survey , author= · arXiv 2406.06391

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Repeated post-training is not Self-improving: Diagnosing Scientific Amnesia in Continual DPO Pipelines

cs.AI · 2026-06-17 · unverdicted · novelty 5.0

Scientific amnesia is observable in production-like continual DPO pipelines, with most tested strategy proposers degrading in peak performance and results depending sharply on chain regime, evaluator, and seed coverage.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Repeated post-training is not Self-improving: Diagnosing Scientific Amnesia in Continual DPO Pipelines cs.AI · 2026-06-17 · unverdicted · none · ref 8
Scientific amnesia is observable in production-like continual DPO pipelines, with most tested strategy proposers degrading in peak performance and results depending sharply on chain regime, evaluator, and seed coverage.

arXiv preprint arXiv:2406.06391 , year=

fields

years

verdicts

representative citing papers

citing papers explorer