Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Accelerating large-scale distributed neural network training with SPMD parallelism.

S. Zhang, L. Diao, C. Wu, S. Wang, and W. Lin. SoCC, page 403-418. ACM, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Xunxing Diao

Linan Diao

Lan Diao

Qiyu Diao

Ren Diao

Other publications of authors with the same name

PAI-FCNN: FPGA Based Inference System for Complex CNN Models.L. Xia, L. Diao, Z. Jiang, H. Liang, K. Chen, L. Ding, S. Dou, Z. Su, M. Sun, J. Zhang and 1 other author(s). ASAP, page 107-114. IEEE, (2019)DAPPLE: A Pipelined Data Parallel Approach for Training Large Models.S. Fan, Y. Rong, C. Meng, Z. Cao, S. Wang, Z. Zheng, C. Wu, G. Long, J. Yang, L. Xia and 3 other author(s). CoRR, (2020)Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform.S. Zhang, L. Diao, S. Wang, Z. Cao, Y. Gu, C. Si, Z. Shi, Z. Zheng, C. Wu, and W. Lin. CoRR, (2023)Optimizing distributed training deployment in heterogeneous GPU clusters.X. Yi, S. Zhang, Z. Luo, G. Long, L. Diao, C. Wu, Z. Zheng, J. Yang, and W. Lin. CoNEXT, page 93-107. ACM, (2020)DISC: A Dynamic Shape Compiler for Machine Learning Workloads.K. Zhu, W. Zhao, Z. Zheng, T. Guo, P. Zhao, J. Bai, J. Yang, X. Liu, L. Diao, and W. Lin. EuroMLSys@EuroSys, page 89-95. ACM, (2021)Expediting Distributed DNN Training with Device Topology-Aware Graph Deployment.S. Zhang, X. Yi, L. Diao, C. Wu, S. Wang, and W. Lin. CoRR, (2023)Accelerating large-scale distributed neural network training with SPMD parallelism.S. Zhang, L. Diao, C. Wu, S. Wang, and W. Lin. SoCC, page 403-418. ACM, (2022)PAI-FCNN: FPGA Based CNN Inference System.L. Diao, Z. Jiang, H. Liang, C. Ye, K. Chen, L. Ding, S. Dou, M. Sun, L. Xia, J. Zhang and 1 other author(s). FPGA, page 184. ACM, (2019)FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads.Z. Zheng, P. Zhao, G. Long, F. Zhu, K. Zhu, W. Zhao, L. Diao, J. Yang, and W. Lin. CoRR, (2020)HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis.S. Zhang, L. Diao, C. Wu, Z. Cao, S. Wang, and W. Lin. EuroSys, page 524-541. ACM, (2024)

BibSonomy

Disambiguation of "Diao, Lansong"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Accelerating large-scale distributed neural network training with SPMD parallelism.

Please choose a person to relate this publication to

Xunxing Diao

Linan Diao

Lan Diao

Qiyu Diao

Ren Diao

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Diao, Lansong"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Accelerating large-scale distributed neural network training with SPMD parallelism.

Please choose a person to relate this publication to

Xunxing Diao

Linan Diao

Lan Diao

Qiyu Diao

Ren Diao

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Accelerating large-scale distributed neural network training with SPMD parallelism.