pith. sign in

Odalric-Ambrym Maillard

Identifiers

  • name variant Odalric-Ambrym Maillard 0.60 · backfill

Papers (15)

  1. Pliable rejection sampling stat.ML · 2026 · author #4
  2. Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits cs.LG · 2019 · author #2
  3. Practical Open-Loop Optimistic Planning cs.LG · 2019 · author #2
  4. Budgeted Reinforcement Learning in Continuous State Space cs.LG · 2019 · author #5
  5. Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs stat.ML · 2018 · author #2
  6. Efficient tracking of a growing number of experts stat.ML · 2017 · author #2
  7. Streaming kernel regression with provably adaptive mean, variance, and regularization stat.ML · 2017 · author #2
  8. Boundary Crossing Probabilities for General Exponential Families stat.ML · 2017 · author #1
  9. Random Shuffling and Resets for the Non-stationary Stochastic Bandit Problem cs.AI · 2016 · author #3
  10. Low-rank Bandits with Latent Mixtures cs.LG · 2016 · author #2
  11. Selecting Near-Optimal Approximate State Representations in Reinforcement Learning cs.LG · 2014 · author #2
  12. Concentration inequalities for sampling without replacement math.ST · 2013 · author #2
  13. Optimal Regret Bounds for Selecting the State Representation in Reinforcement Learning cs.LG · 2013 · author #1
  14. Selecting the State-Representation in Reinforcement Learning cs.LG · 2013 · author #1
  15. Kullback-Leibler upper confidence bounds for optimal sequential allocation math.PR · 2012 · author #3

Mentions

  • 1803.01626 #2 · arxiv_oai · confidence 0.70 Odalric-Ambrym Maillard
  • 1405.2652 #2 · backfill · confidence 0.70 Odalric-Ambrym Maillard
  • 1309.4029 #2 · backfill · confidence 0.70 Odalric-Ambrym Maillard
  • 1302.2553 #1 · backfill · confidence 0.70 Odalric-Ambrym Maillard
  • 1302.2552 #1 · backfill · confidence 0.70 Odalric-Ambrym Maillard
  • 1210.1136 #3 · backfill · confidence 0.70 Odalric-Ambrym Maillard

Frequent Coauthors