From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

When can transformers reason with abstract symbols?, , , , , и . CoRR, (2023)Holographic stress tensor for non-relativistic theories, и . Journal of High Energy Physics, 2009 (09): 009 (10.07.2009)An Improved Continuous-Action Extended Classifier Systems for Function Approximation., , и . Complex Adaptive Systems, том 61 из Procedia Computer Science, стр. 361-366. Elsevier, (2015)How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks., , , , , , и . CoRR, (2024)Adaptivity and Modularity for Efficient Generalization Over Task Complexity., , , , , , , , , и 1 other автор(ы). CoRR, (2023)What Algorithms can Transformers Learn? A Study in Length Generalization., , , , , , , и . CoRR, (2023)Vanishing Gradients in Reinforcement Finetuning of Language Models., , , , , , , и . CoRR, (2023)An Improved eXtended Classifier System for the Real-time-input Real-time-output (XCSRR) Stability Control of a Biped Robot., , , и . Complex Adaptive Systems, том 61 из Procedia Computer Science, стр. 492-499. Elsevier, (2015)The Slingshot Effect: A Late-Stage Optimization Anomaly in Adaptive Gradient Methods., , , , , и . Trans. Mach. Learn. Res., (2024)How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad., , , , и . CoRR, (2024)