Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Tux2: Distributed Graph Computation for Machine Learning., , , , , , , and . NSDI, page 669-682. USENIX Association, (2017)EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs., , , , , , , , , and 2 other author(s). SC, page 55:1-55:14. ACM, (2023)Crux: GPU-Efficient Communication Scheduling for Deep Learning Training., , , , , , , , and . SIGCOMM, page 1-15. ACM, (2024)An empirical study on program failures of deep learning jobs., , , , , and . ICSE, page 1159-1170. ACM, (2020)Balanced Sparsity for Efficient DNN Inference on GPU., , , , and . CoRR, (2018)KV-Direct: High-Performance In-Memory Key-Value Store with Programmable NIC., , , , , , , and . SOSP, page 137-152. ACM, (2017)Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache., , , , , , , , , and 3 other author(s). CoRR, (2024)CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs., , , , , , , , , and 1 other author(s). SC, page 39:1-39:15. IEEE, (2022)Efficient and Effective Sparse LSTM on FPGA with Bank-Balanced Sparsity., , , , , , , , and . FPGA, page 63-72. ACM, (2019)Balanced Sparsity for Efficient DNN Inference on GPU., , , , and . AAAI, page 5676-5683. AAAI Press, (2019)