Author of the publication

The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models.

, , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Solving non-linear Kolmogorov equations in large dimensions by using deep learning: a numerical comparison of discretization schemes., and . CoRR, (2020)Phase transitions in the mini-batch size for sparse and dense two-layer neural networks., and . Mach. Learn. Sci. Technol., 5 (1): 15015 (March 2024)Learning in Wilson-Cowan model for metapopulation., , , , , , and . CoRR, (2024)A Short Review on Novel Approaches for Maximum Clique Problem: from Classical algorithms to Graph Neural Networks and Quantum algorithms., , and . CoRR, (2024)The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models., , , and . CoRR, (2024)Large independent sets on random d-regular graphs with d small., and . CoRR, (2020)Learning from Survey Propagation: a Neural Network for MAX-E-3-SAT.. CoRR, (2020)A Bridge between Dynamical Systems and Machine Learning: Engineered Ordinary Differential Equations as Classification Algorithm (EODECA)., , , , and . CoRR, (2023)Solving Non-linear Kolmogorov Equations in Large Dimensions by Using Deep Learning: A Numerical Comparison of Discretization Schemes., and . J. Sci. Comput., 94 (1): 8 (2023)Hard Optimization Problems have Soft Edges., and . CoRR, (2022)