From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction., , , , и . CoRR, (2020)Scaling up Mean Field Games with Online Mirror Descent., , , , , , , и . CoRR, (2021)Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs., , , , , , , и . CoRR, (2024)Offline Reinforcement Learning as Anti-Exploration., , , , , , и . CoRR, (2021)Generalization in Mean Field Games by Learning Master Policies., , , , , и . CoRR, (2021)Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision., , , , , , , , и . CoRR, (2023)Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act., , , и . CoRR, (2022)Observe and Look Further: Achieving Consistent Performance on Atari., , , , , , , , , и 3 other автор(ы). CoRR, (2018)Kalman Temporal Differences., и . CoRR, (2014)Difference of Convex Functions Programming Applied to Control with Expert Data., , и . CoRR, (2016)