Author of the publication

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity.

, , , , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Randomness In Neural Network Training: Characterizing The Impact of Tooling., , , and . CoRR, (2021)An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning., , , , , , , , , and 1 other author(s). CoRR, (2020)Randomness in Neural Network Training: Characterizing the Impact of Tooling., , , and . MLSys, mlsys.org, (2022)MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures., , , , , , and . OSDI, page 989-1005. USENIX Association, (2024)ClickTrain: efficient and accurate end-to-end deep learning training via fine-grained architecture-preserving pruning., , , , , , , , , and 1 other author(s). ICS, page 266-278. ACM, (2021)Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs., , , , , , , , , and 3 other author(s). USENIX ATC, page 699-713. USENIX Association, (2024)Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity., , , , , , , , and . CoRR, (2023)Bring orders into uncertainty: enabling efficient uncertain graph processing via novel path sampling on multi-accelerator systems., , , , , , , , , and 2 other author(s). ICS, page 11:1-11:14. ACM, (2022)An efficient uncertain graph processing framework for heterogeneous architectures., , , , , , , and . PPoPP, page 477-479. ACM, (2021)η-LSTM: Co-Designing Highly-Efficient Large LSTM Training via Exploiting Memory-Saving and Architectural Design Opportunities., , , , , , and . ISCA, page 567-580. IEEE, (2021)