Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters.

Q. Weng, W. Xiao, Y. Yu, W. Wang, C. Wang, J. He, Y. Li, L. Zhang, W. Lin, and Y. Ding. NSDI, page 945-960. USENIX Association, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Friedrich Weng

Jian Weng

Matthias Weng

Volker Weng

Dietmar Weng

Other publications of authors with the same name

MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters.Q. Weng, W. Xiao, Y. Yu, W. Wang, C. Wang, J. He, Y. Li, L. Zhang, W. Lin, and Y. Ding. NSDI, page 945-960. USENIX Association, (2022)Workload consolidation in alibaba clusters: the good, the bad, and the ugly.Y. Zhang, Y. Yu, W. Wang, Q. Chen, J. Wu, Z. Zhang, J. Zhong, T. Ding, Q. Weng, L. Yang and 4 other author(s). SoCC, page 210-225. ACM, (2022)Metis: learning to schedule long-running applications in shared container clusters at scale.L. Wang, Q. Weng, W. Wang, C. Chen, and B. Li. SC, page 68. IEEE/ACM, (2020)Towards Framework-Independent, Non-Intrusive Performance Characterization for Dataflow Computation.H. Tian, Q. Weng, and W. Wang. APSys, page 54-60. ACM, (2019)Semi-dynamic load balancing: efficient distributed learning in non-dedicated environments.C. Chen, Q. Weng, W. Wang, B. Li, and B. Li. SoCC, page 431-446. ACM, (2020)CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference.S. Li, H. Lu, T. Wu, M. Yu, Q. Weng, X. Chen, Y. Shan, B. Yuan, and W. Wang. CoRR, (2024)InternLM2 Technical Report.Z. Cai, M. Cao, H. Chen, K. Chen, K. Chen, X. Chen, X. Chen, Z. Chen, Z. Chen, P. Chu and 60 other author(s). CoRR, (2024)Beware of Fragmentation: Scheduling GPU-Sharing Workloads with Fragmentation Gradient Descent.Q. Weng, L. Yang, Y. Yu, W. Wang, X. Tang, G. Yang, and L. Zhang. USENIX ATC, page 995-1008. USENIX Association, (2023)Fast Distributed Deep Learning via Worker-adaptive Batch Sizing.C. Chen, Q. Weng, W. Wang, B. Li, and B. Li. SoCC, page 521. ACM, (2018)Efficient Training of Large Language Models on Distributed Infrastructures: A Survey.J. Duan, S. Zhang, Z. Wang, L. Jiang, W. Qu, Q. Hu, G. Wang, Q. Weng, H. Yan, X. Zhang and 6 other author(s). CoRR, (2024)

BibSonomy

Disambiguation of "Weng, Qizhen"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters.

Please choose a person to relate this publication to

Friedrich Weng

Jian Weng

Matthias Weng

Volker Weng

Dietmar Weng

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Weng, Qizhen"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters.

Please choose a person to relate this publication to

Friedrich Weng

Jian Weng

Matthias Weng

Volker Weng

Dietmar Weng

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters.