Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An empirical study on program failures of deep learning jobs., , , , , and . ICSE, page 1159-1170. ACM, (2020)Tux2: Distributed Graph Computation for Machine Learning., , , , , , , and . NSDI, page 669-682. USENIX Association, (2017)EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs., , , , , , , , , and 2 other author(s). SC, page 55:1-55:14. ACM, (2023)Balanced Sparsity for Efficient DNN Inference on GPU., , , , and . CoRR, (2018)KV-Direct: High-Performance In-Memory Key-Value Store with Programmable NIC., , , , , , , and . SOSP, page 137-152. ACM, (2017)Efficient and Effective Sparse LSTM on FPGA with Bank-Balanced Sparsity., , , , , , , , and . FPGA, page 63-72. ACM, (2019)CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs., , , , , , , , , and 1 other author(s). SC, page 39:1-39:15. IEEE, (2022)Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache., , , , , , , , , and 3 other author(s). CoRR, (2024)EasyScale: Accuracy-consistent Elastic Training for Deep Learning., , , , , , , , , and 2 other author(s). CoRR, (2022)MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs., , , , , , , , , and . CoRR, (2023)