This article aims to provide a concise yet comprehensive introduction to one of the most important class of control algorithms in Reinforcement Learning - Policy Gradients. I will discuss these…
S. Thoma, A. Rettinger, und F. Both. International Semantic Web Conference (1), Volume 10587 von Lecture Notes in Computer Science, Seite 694-710. Springer, (2017)