This article aims to provide a concise yet comprehensive introduction to one of the most important class of control algorithms in Reinforcement Learning - Policy Gradients. I will discuss these…
S. Thoma, A. Rettinger, и F. Both. International Semantic Web Conference (1), том 10587 из Lecture Notes in Computer Science, стр. 694-710. Springer, (2017)