Canonical reference

URL https: //www.pnas.org/doi/abs/10.1073/pnas.1611835114

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A. Rusu + 2 more · 2017 · Proceedings of the National Academy of Sciences · DOI 10.1073/pnas.1611835114

Canonical reference. 90% of citing Pith papers cite this work as background.

73 Pith papers citing it

5,466 external citations · Crossref

Background 90% of classified citations

open at publisher browse 73 citing papers

citation-role summary

background 9 method 1

citation-polarity summary

background 9 use method 1

representative citing papers

Optimal L2 Regularization in High-dimensional Continual Linear Regression

cs.LG · 2026-01-20 · unverdicted · novelty 8.0

In high-dimensional continual linear regression, optimal fixed L2 regularization strength scales as T/ln T with the number of tasks and mitigates label noise for arbitrary linear teachers.

Continuous-time Optimal Stopping through Deep Reinforcement Learning

cs.LG · 2026-06-16 · unverdicted · novelty 7.0

CARLOS employs an aggregate deep neural network trained on progressively finer time grids with adaptive sampling to learn continuous-time exercise boundaries for optimal stopping, delivering higher values than discrete Bermudan methods.

EST-PRM: Stress-Testing Process Reward Models Before They Become Load-Bearing

cs.LG · 2026-05-30 · unverdicted · novelty 7.0

EST-PRM stress-tests five PRM models on 4,687 reasoning chains from MATH-500, GSM8K, and PRMBench using three label-preserving transformations and reports model-specific vulnerability patterns.

Continual Model Routing in Evolving Model Hubs

cs.AI · 2026-05-27 · unverdicted · novelty 7.0

Formalizes continual model routing (CMR), releases CMRBench with over 2000 models, and presents CARvE which outperforms retrieval, fine-tuning and adapter-merging baselines on model/family/domain accuracy.

Measuring the Depth of LLM Unlearning via Activation Patching

cs.CL · 2026-05-23 · unverdicted · novelty 7.0

Introduces Unlearning Depth Score (UDS) via activation patching to quantify LLM unlearning depth and claims it outperforms 20 other metrics in faithfulness and robustness on 150 models.

Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

Creates realistic benchmarks for continual anomaly detection and proposes DINOSaur, a training-free method with frozen DINOv3, coreset memory, and neighborhood scoring that outperforms others with zero forgetting and sub-100ms edge inference.

PMF-CL: Pareto-Minimal-Forgetting Continual Learner for Conflicting Tasks

cs.LG · 2026-05-18 · unverdicted · novelty 7.0 · 2 refs

PMF-CL derives Pareto-minimal-forgetting algorithms for linear/basis-function regression and quadratic-bounded losses like logistic regression, achieving static O(d²) memory for d-parameter models.

Reasoning Portability: Guiding Continual Learning for MLLMs in the RLVR Era

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

Formalizes Reasoning Portability (RP) and proposes RDB-CL to modulate per-sample KL regularization in RLVR for MLLM continual learning, achieving +12.0% Last accuracy over vanilla RLVR baseline by preserving reusable reasoning on high-RP samples.

Language Acquisition Device in Large Language Models

cs.CL · 2026-05-16 · unverdicted · novelty 7.0

Pre-pretraining on MP-STRUCT matches k-Shuffle Dyck baselines in efficiency while adding human-like resistance to implausible languages and challenges the need for C-RASP definability in effective PPT languages.

Streaming Adversarial Robustness in Fuzzy ARTMAP: Mechanism-Aligned Evaluation, Progressive Training, and Interpretable Diagnostics

cs.LG · 2026-05-07 · conditional · novelty 7.0

Fuzzy ARTMAP models are highly vulnerable to a new white-box attack aligned with their category competition, but progressive selective training yields stronger replay-free robustness than offline adversarial training under adaptive evaluation.

Atomic-Probe Governance for Skill Updates in Compositional Robot Policies

cs.RO · 2026-04-29 · unverdicted · novelty 7.0 · 2 refs

A cross-version swap protocol reveals dominant skills that swing composition success by up to 50 percentage points, and an atomic probe with selective revalidation governs updates at lower cost than always re-testing full compositions.

Preconditioned DeltaNet: Curvature-aware Sequence Modeling for Linear Recurrences

cs.LG · 2026-04-22 · unverdicted · novelty 7.0

Preconditioned delta-rule models with a diagonal curvature approximation improve upon standard DeltaNet, GDN, and KDA by better approximating the test-time regression objective.

CapTrack: Multifaceted Evaluation of Forgetting in LLM Post-Training

cs.LG · 2026-02-19 · unverdicted · novelty 7.0

CapTrack shows post-training causes drift beyond facts, with instruction fine-tuning producing stronger behavioral changes than preference optimization across model families.

A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA

cs.CL · 2023-11-28 · unverdicted · novelty 7.0

LoRA adapters should be scaled by 1/sqrt(rank) rather than 1/rank to stabilize learning and enable effective use of higher ranks during fine-tuning of large language models.

Trust the Right Teacher: Quality-Aware Self-Distillation for GUI Grounding

cs.AI · 2026-06-16 · unverdicted · novelty 6.0

Quality-aware self-distillation using soft correctness-aware gating and teacher-probability scaling improves VLM performance on GUI grounding benchmarks when both components are combined.

Learning to Solve, Forgetting to Retain: Correct-Set Turnover in RLVR

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

RLVR exhibits correct-set turnover where solved problems regress during training, and a periodic review mechanism exploiting a repair-window principle improves retention and performance over baselines.

Rethinking Molecular Text Representations for LLMs: An Empirical Study

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

Structured text representations like CML and MolJSON outperform SMILES variants on structural tasks while IUPAC dominates semantic tasks such as molecule retrieval across all tested LLMs.

Repurposing Adversarial Perturbations for Continual Learning: From Defense to Active Alignment

cs.LG · 2026-06-01 · unverdicted · novelty 6.0

AdvCL repurposes adversarial perturbations into geometric control signals for continual learning using Intra-Smooth, Proto-Clip, and Inter-Align modules, reporting gains in performance, robustness, lower forgetting, and stronger transfer.

Dynamic Proxy-Mixing: Transferring Replay Controllers from Small to Large Models for Continual Instruction Tuning

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

PROXYMIX learns a dynamic replay controller on a small proxy model and transfers it to a large target model, improving accuracy by 3.4 points and reducing forgetting by 3.5 points on LLaMA-3-8B continual tuning sequences.

Divergence Decoding: Inference-Time Unlearning via Auxiliary Models

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

Divergence Decoding steers LLM logits using small auxiliary models to unlearn specific data at inference time, outperforming baselines and generalizing to images.

A Systematic Study of Behavioral Cloning for Scientific Data Annotation

cs.HC · 2026-05-26 · unverdicted · novelty 6.0

Introduces 9 synthetic annotation tasks and benchmarks for behavioral cloning, finding hierarchical skill learning, scaling benefits, effective multi-task pretraining, and shared internal representations of task phases and mistakes.

TypedCSIP: Typed Counterfactual Pretraining for Chinese Legislative Conflict Classification

cs.CL · 2026-05-25 · unverdicted · novelty 6.0

TypedCSIP applies typed counterfactual selective intervention pretraining on expert revisions to lift macro-F1 by 0.9-1.3 pp on the LCR-CN Chinese legislative conflict classification benchmark under a pre-registered multi-seed test.

Rethinking Continual Learning for Speech and Audio: A Representation-Centric Taxonomy and Open Problems

eess.AS · 2026-05-24 · unverdicted · novelty 6.0

Introduces a representation-geometry-based taxonomy for continual learning in speech and audio, identifies mismatches with current CL assumptions in foundation models, and lists open challenges.

Unlocking Biological Workflows for Robust Protein-Text Question Answering: A Dual-Dimensional RAG Framework

cs.IR · 2026-05-17 · unverdicted · novelty 6.0

2D-ProteinRAG is a dual-dimensional RAG framework that incorporates BLAST workflows plus horizontal attribute alignment and vertical homology denoising to improve protein-text QA on both in-distribution and out-of-distribution cases.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

URL https: //www.pnas.org/doi/abs/10.1073/pnas.1611835114

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer