Author of the publication

HEAT: A Highly Efficient and Affordable Training System for Collaborative Filtering Based Recommendation on CPUs.

, , , , , , , , and . ICS, page 324-335. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Prediction and Predictability for Search Query Acceleration., , , , and . TWEB, 10 (3): 19:1-19:28 (2016)Workload analysis and caching strategies for search advertising systems., , , , and . SoCC, page 170-180. ACM, (2017)G-SPARQL: a hybrid engine for querying large attributed graphs., , and . CIKM, page 335-344. ACM, (2012)Accelerating Large Scale Deep Learning Inference through DeepCPU at Microsoft., , , , , , , , and . OpML, page 5-7. USENIX Association, (2019)BATS: budget-constrained autoscaling for cloud performance optimization., , and . SIGMETRICS, page 563-564. ACM, (2014)Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination., , , and . SIGMOD Conference, page 2539-2554. ACM, (2020)Position Paper: Embracing Heterogeneity - Improving Energy Efficiency for Interactive Services on Heterogeneous Data Center Hardware., and . AI for Data Center Management and Cloud Computing, volume WS-11-08 of AAAI Technical Report, AAAI, (2011)DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing., , , , , , and . AAAI, page 18490-18498. AAAI Press, (2024)ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers., , , , , and . NeurIPS, (2022)Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping., and . NeurIPS, (2020)