Author of the publication

Thorough Characterization and Analysis of Large Transformer Model Training At-Scale.

, , , , , , , and . Proc. ACM Meas. Anal. Comput. Syst., 8 (1): 8:1-8:25 (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

RG inspired Machine Learning for lattice field theory, , , and . EPJ Web of Conferences, (Oct 5, 2017)GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics., , , , , , , , , and 26 other author(s). Int. J. High Perform. Comput. Appl., 37 (6): 683-705 (November 2023)LeapfrogLayers: A Trainable Framework for Effective Topological Sampling., , and . CoRR, (2021)Deep Learning Hamiltonian Monte Carlo., , and . CoRR, (2021)Applications of Machine Learning to Lattice Quantum Field Theory., , , , , , , , , and 1 other author(s). CoRR, (2022)A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators., , , , , , , , and . CoRR, (2023)HMC with Normalizing Flows., , , , , and . CoRR, (2021)DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies., , , , , , , , , and 82 other author(s). CoRR, (2023)Thorough Characterization and Analysis of Large Transformer Model Training At-Scale., , , , , , , and . Proc. ACM Meas. Anal. Comput. Syst., 8 (1): 8:1-8:25 (2024)