Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU., , , , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 31094-31116. PMLR, (2023)Diagnosing Cardiac Abnormalities from 12-Lead Electrocardiograms Using Enhanced Deep Convolutional Neural Networks., and . MLMECH/CVII-STENT@MICCAI, volume 11794 of Lecture Notes in Computer Science, page 36-44. Springer, (2019)WaveletFCNN: A Deep Time Series Classification Model for Wind Turbine Blade Icing Detection., , , , , and . CoRR, (2019)FlashFlex: Accommodating Large Language Model Training over Heterogeneous Environment., , , , , and . CoRR, (2024)Efficient flow scheduling in distributed deep learning training with echelon formation., , , , , and . HotNets, page 93-100. ACM, (2022)Tensor Relational Algebra for Distributed Machine Learning System Design., , , , , and . Proc. VLDB Endow., 14 (8): 1338-1350 (2021)Distributed Numerical and Machine Learning Computations via Two-Phase Execution of Aggregated Join Trees., , , and . Proc. VLDB Endow., 14 (7): 1228-1240 (2021)Exploring the Robustness of Decentralized Training for Large Language Models., , , , , and . CoRR, (2023)Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU., , , , , , and . CoRR, (2024)Stochastic Gradient Descent without Full Data Shuffle., , , , , , , , , and 2 other author(s). CoRR, (2022)