From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The Transient Nature of Emergent In-Context Learning in Transformers.

A. Singh, S. Chan, T. Moskovitz, E. Grant, A. Saxe, и F. Hill. CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Ted Hasbach

Ted Masselink

Ted Spitzer

Ted Kamins

Ted Tesprateep

Другие публикации лиц с тем же именем

Towards an Understanding of Default Policies in Multitask Policy Optimization.T. Moskovitz, M. Arbel, J. Parker-Holder, и A. Pacchiano. AISTATS, том 151 из Proceedings of Machine Learning Research, стр. 10661-10686. PMLR, (2022)ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs.T. Moskovitz, B. O'Donoghue, V. Veeriah, S. Flennerhag, S. Singh, и T. Zahavy. ICML, том 202 из Proceedings of Machine Learning Research, стр. 25303-25336. PMLR, (2023)Confronting Reward Model Overoptimization with Constrained RLHF.T. Moskovitz, A. Singh, D. Strouse, T. Sandholm, R. Salakhutdinov, A. Dragan, и S. McAleer. CoRR, (2023)A First-Occupancy Representation for Reinforcement Learning.T. Moskovitz, S. Wilson, и M. Sahani. ICLR, OpenReview.net, (2022)Minimum Description Length Control.T. Moskovitz, T. Kao, M. Sahani, и M. Botvinick. CoRR, (2022)Efficient Wasserstein Natural Gradients for Reinforcement Learning.T. Moskovitz, M. Arbel, F. Huszar, и A. Gretton. ICLR, OpenReview.net, (2021)Minimum Description Length Control.T. Moskovitz, T. Kao, M. Sahani, и M. Botvinick. ICLR, OpenReview.net, (2023)The Transient Nature of Emergent In-Context Learning in Transformers.A. Singh, S. Chan, T. Moskovitz, E. Grant, A. Saxe, und F. Hill. CoRR, (2023)Tactical Optimism and Pessimism for Deep Reinforcement Learning.T. Moskovitz, J. Parker-Holder, A. Pacchiano, M. Arbel, und M. Jordan. NeurIPS, Seite 12849-12863. (2021)What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation.A. Singh, T. Moskovitz, F. Hill, S. Chan, und A. Saxe. CoRR, (2024)