Author of the publication

HPDL: Towards a General Framework for High-performance Distributed Deep Learning.

, , , , , , and . ICDCS, page 1742-1753. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Multidimensional Communication Scheduling Method for Hybrid Parallel DNN Training., , , , , and . IEEE Trans. Parallel Distributed Syst., 35 (8): 1415-1428 (August 2024)An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines., , , , , and . ICDM Workshops, page 1209-1216. IEEE, (2018)Auto-Divide GNN: Accelerating GNN Training with Subgraph Division., , , , , and . Euro-Par, volume 14100 of Lecture Notes in Computer Science, page 367-382. Springer, (2023)AutoPipe: A Fast Pipeline Parallelism Approach with Balanced Partitioning and Micro-batch Slicing., , , , , and . CLUSTER, page 301-312. IEEE, (2022)Compressed Collective Sparse-Sketch for Distributed Data-Parallel Training of Deep Learning Models., , , , , and . IEEE J. Sel. Areas Commun., 41 (4): 941-963 (April 2023)S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning., , , , , and . ICASSP, page 5233-5237. IEEE, (2022)Prophet: Fine-grained Load Balancing for Parallel Training of Large-scale MoE Models., , , , , , , and . CLUSTER, page 82-94. IEEE, (2023)Tag Pollution Detection in Web Videos via Cross-Modal Relevance Estimation., , , , and . IWQoS, page 1-10. IEEE, (2020)HPDL: Towards a General Framework for High-performance Distributed Deep Learning., , , , , , and . ICDCS, page 1742-1753. IEEE, (2019)Automated Tensor Model Parallelism with Overlapped Communication for Efficient Foundation Model Training., , , , , , , and . CoRR, (2023)