pith. sign in

arxiv: 1905.07727 · v1 · pith:FEA7MAY6new · submitted 2019-05-19 · 💻 cs.LG · cs.AI

Reinforcement Learning for Learning of Dynamical Systems in Uncertain Environment: a Tutorial

classification 💻 cs.LG cs.AI
keywords learningdiscussedalgorithmsdynamicalmodel-freeq-learningreinforcementsystems
0
0 comments X
read the original abstract

In this paper, a review of model-free reinforcement learning for learning of dynamical systems in uncertain environments has discussed. For this purpose, the Markov Decision Process (MDP) will be reviewed. Furthermore, some learning algorithms such as Temporal Difference (TD) learning, Q-Learning, and Approximate Q-learning as model-free algorithms which constitute the main part of this article have been investigated, and benefits and drawbacks of each algorithm will be discussed. The discussed concepts in each section are explaining with details and examples.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.