From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Navigating Scaling Laws: Compute Optimality in Adaptive Model Training., , , и . ICML, OpenReview.net, (2024)Learning Associative Inference Using Fast Weight Memory., , и . CoRR, (2020)Linear Transformers Are Secretly Fast Weight Programmers., , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 9355-9366. PMLR, (2021)A Modern Self-Referential Weight Matrix That Learns to Modify Itself., , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 9660-9677. PMLR, (2022)Language Imbalance Can Boost Cross-lingual Generalisation., , , , и . CoRR, (2024)Learning to Reason with Third Order Tensor Products., и . NeurIPS, стр. 10003-10014. (2018)Going Beyond Linear Transformers with Recurrent Fast Weight Programmers., , , и . NeurIPS, стр. 7703-7717. (2021)Learning Associative Inference Using Fast Weight Memory., , и . ICLR, OpenReview.net, (2021)Mindstorms in Natural Language-Based Societies of Mind., , , , , , , , , и 16 other автор(ы). CoRR, (2023)On the Effect of (Near) Duplicate Subwords in Language Modelling., , , и . ACL (Findings), стр. 9580-9597. Association for Computational Linguistics, (2024)