From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Continuous Control with Action Quantization from Demonstrations.

R. Dadashi, L. Hussenot, D. Vincent, S. Girgin, A. Raichuk, M. Geist, и O. Pietquin. ICML, том 162 из Proceedings of Machine Learning Research, стр. 4537-4557. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Eduard Olivier

Olivier Coutand

Christoph Olivier

Olivier Brede

Paul Olivier

Другие публикации лиц с тем же именем

The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction.A. Martin, C. Ollion, F. Strub, S. Corff, и O. Pietquin. CoRR, (2020)Scaling up Mean Field Games with Online Mirror Descent.J. Pérolat, S. Perrin, R. Elie, M. Laurière, G. Piliouras, M. Geist, K. Tuyls, и O. Pietquin. CoRR, (2021)Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs.A. Ahmadian, C. Cremer, M. Gallé, M. Fadaee, J. Kreutzer, O. Pietquin, A. Üstün, и S. Hooker. CoRR, (2024)Offline Reinforcement Learning as Anti-Exploration.S. Rezaeifar, R. Dadashi, N. Vieillard, L. Hussenot, O. Bachem, O. Pietquin, и M. Geist. CoRR, (2021)Generalization in Mean Field Games by Learning Master Policies.S. Perrin, M. Laurière, J. Pérolat, R. Élie, M. Geist, и O. Pietquin. CoRR, (2021)Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision.E. Kharitonov, D. Vincent, Z. Borsos, R. Marinier, S. Girgin, O. Pietquin, M. Sharifi, M. Tagliasacchi, и N. Zeghidour. CoRR, (2023)Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act.A. Jacq, J. Ferret, O. Pietquin, и M. Geist. CoRR, (2022)Observe and Look Further: Achieving Consistent Performance on Atari.T. Pohlen, B. Piot, T. Hester, M. Azar, D. Horgan, D. Budden, G. Barth-Maron, H. van Hasselt, J. Quan, M. Vecerík и 3 other автор(ы). CoRR, (2018)Kalman Temporal Differences.M. Geist, и O. Pietquin. CoRR, (2014)Difference of Convex Functions Programming Applied to Control with Expert Data.B. Piot, M. Geist, и O. Pietquin. CoRR, (2016)

BibSonomy

Disambiguation