Author of the publication

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters.

, , , , , , , , , , , , , , , , , and . CoRR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

MG-WFBP: Efficient Data Communication for Distributed Synchronous SGD Algorithms., and . CoRR, (2018)Mixed Precision Method for GPU-based FFT., , and . CSE, page 580-586. IEEE Computer Society, (2011)Communication-Efficient Distributed Deep Learning with Merged Gradient Sparsification on GPUs., , , , , , and . INFOCOM, page 406-415. IEEE, (2020)Reliable and Efficient In-Memory Fault Tolerance of Large Language Model Pretraining., , , , , , , , , and . CoRR, (2023)Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters., , , , , , , , , and 14 other author(s). MLSys, mlsys.org, (2021)Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters., , , , , , , , , and 8 other author(s). CoRR, (2022)Communication-Efficient Distributed Deep Learning: A Comprehensive Survey., , , , and . CoRR, (2020)Layer-Wise Adaptive Gradient Sparsification for Distributed Deep Learning with Convergence Guarantees., , , , and . ECAI, volume 325 of Frontiers in Artificial Intelligence and Applications, page 1467-1474. IOS Press, (2020)EASNet: Searching Elastic and Accurate Network Architecture for Stereo Matching., , , and . ECCV (32), volume 13692 of Lecture Notes in Computer Science, page 437-453. Springer, (2022)DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining., , , , , and . ICDCS, page 142-153. IEEE, (2023)