Sample Complexity of Multi-task Reinforcement Learning

· 2013 · cs.LG · arXiv 1309.6821

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Transferring knowledge across a sequence of reinforcement-learning tasks is challenging, and has a number of important applications. Though there is encouraging empirical evidence that transfer can improve performance in subsequent reinforcement-learning tasks, there has been very little theoretical analysis. In this paper, we introduce a new multi-task algorithm for a sequence of reinforcement-learning tasks when each task is sampled independently from (an unknown) distribution over a finite set of Markov decision processes whose parameters are initially unknown. For this setting, we prove under certain assumptions that the per-task sample complexity of exploration is reduced significantly due to transfer compared to standard single-task algorithms. Our multi-task algorithm also has the desired characteristic that it is guaranteed not to exhibit negative transfer: in the worst case its per-task sample complexity is comparable to the corresponding single-task algorithm.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

DRATS derives a minimax objective from a feasibility formulation of MTRL to adaptively sample tasks with the largest return gaps, leading to better worst-task performance on MetaWorld benchmarks.

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

cs.LG · 2026-04-04 · unverdicted · novelty 7.0

A low-rank matrix estimation method in a reward-free RL framework learns shared representations across linear MDPs and yields near-optimal policies with characterized regret bounds under relaxed feature assumptions.

citing papers explorer

Showing 2 of 2 citing papers.

Distributionally Robust Multi-Task Reinforcement Learning via Adaptive Task Sampling cs.LG · 2026-05-14 · unverdicted · none · ref 298 · internal anchor
DRATS derives a minimax objective from a feasibility formulation of MTRL to adaptively sample tasks with the largest return gaps, leading to better worst-task performance on MetaWorld benchmarks.
Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards cs.LG · 2026-04-04 · unverdicted · none · ref 13
A low-rank matrix estimation method in a reward-free RL framework learns shared representations across linear MDPs and yields near-optimal policies with characterized regret bounds under relaxed feature assumptions.

Sample Complexity of Multi-task Reinforcement Learning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer