@dblp

Thorough Characterization and Analysis of Large Transformer Model Training At-Scale.

, , , , , , , and . Proc. ACM Meas. Anal. Comput. Syst., 8 (1): 8:1-8:25 (2024)

Links and resources

Tags