Author of the publication

Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters.

, , , , , and . ISC, volume 13289 of Lecture Notes in Computer Science, page 109-130. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Accelerating Broadcast Communication with GPU Compression for Deep Learning Workloads., , , , and . HIPC, page 22-31. IEEE, (2022)Performance Characterization of DNN Training using TensorFlow and PyTorch on Modern Clusters., , , , and . CLUSTER, page 1-11. IEEE, (2019)trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback., , , , , , , and . EMNLP, page 8578-8595. Association for Computational Linguistics, (2023)RWKV: Reinventing RNNs for the Transformer Era., , , , , , , , , and 20 other author(s). CoRR, (2023)Evaluating Multi-Level Checkpointing for Distributed Deep Neural Network Training., and . SC (Workshops), page 60-67. IEEE, (2021)Accelerating GPU-based Machine Learning in Python using MPI Library: A Case Study with MVAPICH2-GDR., , , , and . MLHPC/AI4S@SC, page 17-28. IEEE, (2020)Accelerating Distributed Deep Learning Training with Compression Assisted Allgather and Reduce-Scatter Communication., , , , , , and . IPDPS, page 134-144. IEEE, (2023)Scaling Single-Image Super-Resolution Training on Modern HPC Clusters: Early Experiences., , , and . IPDPS Workshops, page 923-932. IEEE, (2021)Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference., , , , and . IPDPS, page 915-925. IEEE, (2024)Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence., , , , , , , , , and 18 other author(s). CoRR, (2024)