Author of the publication

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.

, , , , , , and . CoRR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

3D reconstruction of complex geometric solids from 2D line drawings., and . SIGGRAPH ASIA Posters, page 9. ACM, (2013)CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs., , , , , , and . CoRR, (2023)Dive into Deep Learning for Natural Language Processing., , , , , , and . EMNLP/IJCNLP (2), Association for Computational Linguistics, (2019)SAPipe: Staleness-Aware Pipeline for Data Parallel DNN Training., , , , , , , and . NeurIPS, (2022)Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies., , , and . EuroSys, page 867-882. ACM, (2023)Temporal-Contextual Recommendation in Real-Time., , , and . KDD, page 2291-2299. ACM, (2020)MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs., , , , , , , , , and 22 other author(s). NSDI, page 745-760. USENIX Association, (2024)dPRO: A Generic Performance Diagnosis and Optimization Toolkit for Expediting Distributed DNN Training., , , , , , , and . MLSys, mlsys.org, (2022)LEMON: Lossless model expansion., , , , , , , , and . ICLR, OpenReview.net, (2024)LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization., , , , and . CoRR, (2024)