pith. sign in

Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

background 2 dataset 1

citation-polarity summary

years

2025 3 2023 1

polarities

background 3

representative citing papers

Interactive Post-Training for Vision-Language-Action Models

cs.LG · 2025-05-22 · unverdicted · novelty 6.0

RIPT-VLA applies RL with dynamic rollout sampling and leave-one-out advantage estimation to fine-tune VLA models, achieving up to 97.5% success rates and recovering from 4% to 97% success with one demonstration in 15 iterations.

citing papers explorer

Showing 4 of 4 citing papers.