Inproceedings,

MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs.

, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and .
NSDI, page 745-760. USENIX Association, (2024)

Meta data

Tags

Users

  • @dblp

Comments and Reviews