Pretrained vision- language-action models are surprisingly resistant to forgetting in continual learning

· 2026 · arXiv 2603.03818

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Can VLA Models Learn from Real-World Data Continually without Forgetting?

cs.RO · 2026-05-26 · unverdicted · novelty 7.0

VLA models exhibit catastrophic forgetting on a new real-world dataset of four sequential manipulation tasks, with experience replay implementation factors evaluated for mitigation.

PHASER: Phase-Aware and Semantic Experience Replay for Vision-Language-Action Models

cs.RO · 2026-06-02 · unverdicted · novelty 6.0

PHASER improves average success rate by up to 31% over uniform experience replay on LIBERO continual learning benchmarks for VLA models by phase-centric capacity allocation and semantic interference routing.

Multisensory Continual Learning: Adapting Pretrained Visuomotor Policies to Force

cs.RO · 2026-06-29 · unverdicted · novelty 5.0 · 2 refs

MuSe adapts vision-only pretrained visuomotor policies to force-torque sensing via multi-stage fusion, multisensory future prediction, and experience replay, achieving strong contact-rich performance while preserving original task results.

Preserving Foundational Capabilities in Flow-Matching VLAs through Conservative SFT

cs.RO · 2026-05-09 · unverdicted · novelty 5.0 · 2 refs

ConSFT is a gradient-scaling fine-tuning objective for flow-matching VLAs that bounds parameter disruption via model-confidence weighting, yielding over 20% better capability retention than vanilla SFT on LIBERO and RoboTwin.

citing papers explorer

Showing 4 of 4 citing papers.

Can VLA Models Learn from Real-World Data Continually without Forgetting? cs.RO · 2026-05-26 · unverdicted · none · ref 6
VLA models exhibit catastrophic forgetting on a new real-world dataset of four sequential manipulation tasks, with experience replay implementation factors evaluated for mitigation.
PHASER: Phase-Aware and Semantic Experience Replay for Vision-Language-Action Models cs.RO · 2026-06-02 · unverdicted · none · ref 6
PHASER improves average success rate by up to 31% over uniform experience replay on LIBERO continual learning benchmarks for VLA models by phase-centric capacity allocation and semantic interference routing.
Multisensory Continual Learning: Adapting Pretrained Visuomotor Policies to Force cs.RO · 2026-06-29 · unverdicted · none · ref 14 · 2 links
MuSe adapts vision-only pretrained visuomotor policies to force-torque sensing via multi-stage fusion, multisensory future prediction, and experience replay, achieving strong contact-rich performance while preserving original task results.
Preserving Foundational Capabilities in Flow-Matching VLAs through Conservative SFT cs.RO · 2026-05-09 · unverdicted · none · ref 19 · 2 links
ConSFT is a gradient-scaling fine-tuning objective for flow-matching VLAs that bounds parameter disruption via model-confidence weighting, yielding over 20% better capability retention than vanilla SFT on LIBERO and RoboTwin.

Pretrained vision- language-action models are surprisingly resistant to forgetting in continual learning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer