Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs.

H. Zhang, Y. Li, W. Xiao, Y. Huang, X. Di, J. Yin, S. See, Y. Luo, C. Lau, and Y. You. CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Wencong Lu

Wenjiang Xiao

Fang Xiao

Chun Xiao

Other publications of authors with the same name

Tux2: Distributed Graph Computation for Machine Learning.W. Xiao, J. Xue, Y. Miao, Z. Li, C. Chen, M. Wu, W. Li, and L. Zhou. NSDI, page 669-682. USENIX Association, (2017)An empirical study on program failures of deep learning jobs.R. Zhang, W. Xiao, H. Zhang, Y. Liu, H. Lin, and M. Yang. ICSE, page 1159-1170. ACM, (2020)EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs.M. Li, W. Xiao, H. Yang, B. Sun, H. Zhao, S. Ren, Z. Luan, X. Jia, Y. Liu, Y. Li and 2 other author(s). SC, page 55:1-55:14. ACM, (2023)Balanced Sparsity for Efficient DNN Inference on GPU.Z. Yao, S. Cao, W. Xiao, C. Zhang, and L. Nie. CoRR, (2018)KV-Direct: High-Performance In-Memory Key-Value Store with Programmable NIC.B. Li, Z. Ruan, W. Xiao, Y. Lu, Y. Xiong, A. Putnam, E. Chen, and L. Zhang. SOSP, page 137-152. ACM, (2017)Efficient and Effective Sparse LSTM on FPGA with Bank-Balanced Sparsity.S. Cao, C. Zhang, Z. Yao, W. Xiao, L. Nie, D. chen Zhan, Y. Liu, M. Wu, and L. Zhang. FPGA, page 63-72. ACM, (2019)Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache.B. Lin, T. Peng, C. Zhang, M. Sun, L. Li, H. Zhao, W. Xiao, Q. Xu, X. Qiu, S. Li and 3 other author(s). CoRR, (2024)CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs.Q. Sun, Y. Liu, H. Yang, R. Zhang, M. Dun, M. Li, X. Liu, W. Xiao, Y. Li, Z. Luan and 1 other author(s). SC, page 39:1-39:15. IEEE, (2022)SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization.S. Cao, L. Ma, W. Xiao, C. Zhang, Y. Liu, L. Zhang, L. Nie, and Z. Yang. CVPR, page 11216-11225. Computer Vision Foundation / IEEE, (2019)GraM: scaling graph computation to the trillions.M. Wu, F. Yang, J. Xue, W. Xiao, Y. Miao, L. Wei, H. Lin, Y. Dai, and L. Zhou. SoCC, page 408-421. ACM, (2015)

BibSonomy

Disambiguation of "Xiao, Wencong"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs.

Please choose a person to relate this publication to

Wencong Lu

Wencong Lu

Wenjiang Xiao

Fang Xiao

Chun Xiao

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Xiao, Wencong"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs.

Please choose a person to relate this publication to

Wencong Lu

Wencong Lu

Wenjiang Xiao

Fang Xiao

Chun Xiao

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs.