Author of the publication

Hoplite: efficient and fault-tolerant collective communication for task-based distributed systems.

, , , , , , , and . SIGCOMM, page 641-656. ACM, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Geometric properties estimation from discrete curves using discrete derivatives., , , and . Comput. Graph., 35 (4): 916-930 (2011)Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning., , , , , , , , , and 1 other author(s). CoRR, (2022)Hoplite: efficient and fault-tolerant collective communication for task-based distributed systems., , , , , , , and . SIGCOMM, page 641-656. ACM, (2021)Hint-Based Training for Non-Autoregressive Machine Translation., , , , , , and . EMNLP/IJCNLP (1), page 5707-5712. Association for Computational Linguistics, (2019)Fast Structured Decoding for Sequence Models., , , , , and . NeurIPS, page 3011-3020. (2019)Fairness in Serving Large Language Models., , , , , , , and . OSDI, page 965-988. USENIX Association, (2024)FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU., , , , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 31094-31116. PMLR, (2023)Efficient Training of BERT by Progressively Stacking., , , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 2337-2346. PMLR, (2019)Towards Binary-Valued Gates for Robust LSTM Training., , , , , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 3001-3010. PMLR, (2018)AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving., , , , , , , , , and 1 other author(s). OSDI, page 663-679. USENIX Association, (2023)