Author of the publication

Whale: Efficient Giant Model Training over Heterogeneous GPUs.

, , , , , , , , , , , and . USENIX Annual Technical Conference, page 673-688. USENIX Association, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Higher-order Weyl superconductors with anisotropic Weyl-point connectivity, , , , , , and . Phys. Rev. B, 103 (18): 184510 (May 19, 2021)DREW: Efficient Winograd CNN Inference with Deep Reuse., , , , , and . WWW, page 1807-1816. ACM, (2022)HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations., , , , , and . ASPLOS, page 153-166. ACM, (2019)ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks., , , , , , , , , and 2 other author(s). CoRR, (2023)Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform., , , , , , , , , and . CoRR, (2023)Flash-LLM: Enabling Low-Cost and Highly-Efficient Large Generative Model Inference With Unstructured Sparsity., , , , , , , , and . Proc. VLDB Endow., 17 (2): 211-224 (2023)Optimizing distributed training deployment in heterogeneous GPU clusters., , , , , , , , and . CoNEXT, page 93-107. ACM, (2020)DISC: A Dynamic Shape Compiler for Machine Learning Workloads., , , , , , , , , and . EuroMLSys@EuroSys, page 89-95. ACM, (2021)Design of Combined Receiving Lens for Panoramic Laser Fuze Detection., , , and . ICIA, page 281-285. IEEE, (2018)Illumination Variation in Images in Independent Component Analysis and Principal Component Analysis Subspaces., and . ISDA (2), page 724-729. IEEE Computer Society, (2006)