From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Towards an Understanding of Default Policies in Multitask Policy Optimization.

T. Moskovitz, M. Arbel, J. Parker-Holder, и A. Pacchiano. AISTATS, том 151 из Proceedings of Machine Learning Research, стр. 10661-10686. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Ted Masselink

Ted Hasbach

Ted Spitzer

Ted Tesprateep

Ted Kamins

Другие публикации лиц с тем же именем

Confronting Reward Model Overoptimization with Constrained RLHF.T. Moskovitz, A. Singh, D. Strouse, T. Sandholm, R. Salakhutdinov, A. Dragan, и S. McAleer. CoRR, (2023)A First-Occupancy Representation for Reinforcement Learning.T. Moskovitz, S. Wilson, и M. Sahani. ICLR, OpenReview.net, (2022)Towards an Understanding of Default Policies in Multitask Policy Optimization.T. Moskovitz, M. Arbel, J. Parker-Holder, и A. Pacchiano. AISTATS, том 151 из Proceedings of Machine Learning Research, стр. 10661-10686. PMLR, (2022)ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs.T. Moskovitz, B. O'Donoghue, V. Veeriah, S. Flennerhag, S. Singh, и T. Zahavy. ICML, том 202 из Proceedings of Machine Learning Research, стр. 25303-25336. PMLR, (2023)Efficient Wasserstein Natural Gradients for Reinforcement Learning.T. Moskovitz, M. Arbel, F. Huszar, и A. Gretton. ICLR, OpenReview.net, (2021)Minimum Description Length Control.T. Moskovitz, T. Kao, M. Sahani, и M. Botvinick. CoRR, (2022)Minimum Description Length Control.T. Moskovitz, T. Kao, M. Sahani, и M. Botvinick. ICLR, OpenReview.net, (2023)The Transient Nature of Emergent In-Context Learning in Transformers.A. Singh, S. Chan, T. Moskovitz, E. Grant, A. Saxe, и F. Hill. CoRR, (2023)Tactical Optimism and Pessimism for Deep Reinforcement Learning.T. Moskovitz, J. Parker-Holder, A. Pacchiano, M. Arbel, и M. Jordan. NeurIPS, стр. 12849-12863. (2021)What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation.A. Singh, T. Moskovitz, F. Hill, S. Chan, и A. Saxe. ICML, OpenReview.net, (2024)

BibSonomy

Disambiguation