From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Bootstrap your own latent: A new approach to self-supervised Learning

J. Grill, F. Strub, F. Altché, C. Tallec, P. Richemond, E. Buchatskaya, C. Doersch, B. Pires, Z. Guo, M. Azar, B. Piot, K. Kavukcuoglu, R. Munos, и M. Valko. (2020)cite arxiv:2006.07733.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Azar Djalali

Azar Ahanchi

Kamal Azar

Azar Nithammer

Azar Shahidizenouz

Другие публикации лиц с тем же именем

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement LearningA. Gruslys, W. Dabney, M. Azar, B. Piot, M. Bellemare, и R. Munos. ICLR, (2017)cite arxiv:1704.04651.Convex Relaxation Regression: Black-Box Optimization of Smooth Functions by Learning Their Convex Envelopes.M. Azar, E. Dyer, и K. Körding. UAI, AUAI Press, (2016)Rainbow: Combining Improvements in Deep Reinforcement Learning.M. Hessel, J. Modayil, H. van Hasselt, T. Schaul, G. Ostrovski, W. Dabney, D. Horgan, B. Piot, M. Azar, и D. Silver. AAAI, стр. 3215-3222. AAAI Press, (2018)Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.T. Kitamura, T. Kozuno, Y. Tang, N. Vieillard, M. Valko, W. Yang, J. Mei, P. Ménard, M. Azar, R. Munos и 5 other автор(ы). ICML, том 202 из Proceedings of Machine Learning Research, стр. 17135-17175. PMLR, (2023)Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.Y. Flet-Berliac, N. Grinsztajn, F. Strub, E. Choi, C. Cremer, A. Ahmadian, Y. Chandak, M. Azar, O. Pietquin, и M. Geist. CoRR, (2024)Averaging log-likelihoods in direct alignment.N. Grinsztajn, Y. Flet-Berliac, M. Azar, F. Strub, B. Wu, E. Choi, C. Cremer, A. Ahmadian, Y. Chandak, O. Pietquin и 1 other автор(ы). CoRR, (2024)Neural Predictive Belief Representations.Z. Guo, M. Azar, B. Piot, B. Pires, T. Pohlen, и R. Munos. CoRR, (2018)Meta-learning of Sequential Strategies.P. Ortega, J. Wang, M. Rowland, T. Genewein, Z. Kurth-Nelson, R. Pascanu, N. Heess, J. Veness, A. Pritzel, P. Sprechmann и 14 other автор(ы). CoRR, (2019)Fast computation of Nash Equilibria in Imperfect Information Games.R. Munos, J. Pérolat, J. Lespiau, M. Rowland, B. Vylder, M. Lanctot, F. Timbers, D. Hennes, S. Omidshafiei, A. Gruslys и 3 other автор(ы). ICML, том 119 из Proceedings of Machine Learning Research, стр. 7119-7129. PMLR, (2020)Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning.Z. Guo, B. Pires, B. Piot, J. Grill, F. Altché, R. Munos, и M. Azar. ICML, том 119 из Proceedings of Machine Learning Research, стр. 3875-3886. PMLR, (2020)

Что такое BibSonomy?: С чего начать; Кнопки для браузера; Помощь
Разработчикам: Обзор; API-документация

Контакт и защита личных данных: о нас; Cookies; Сообщить о проблеме; BibSonomy Вики

Интеграция: PUMA; Расширение для TYPO3; Плагин для; Клиент Java REST; Поддерживаемые источники; далее

О BibSonomy: Команда; Блог; Список рассылки
Социальные сети: Наш Twitter