Author of the publication

Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training.

, , , , , , , , , and . ICS, page 336-347. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training., , , , , , , , , and . ICS, page 336-347. ACM, (2023)Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale., , , , , , , , , and 12 other author(s). WWW (Companion Volume), page 73-82. ACM, (2024)A quantitative analysis on microarchitectures of modern CPU-FPGA platforms., , , , , and . DAC, page 109:1-109:6. ACM, (2016)MTIA: First Generation Silicon Targeting Meta's Recommendation Systems., , , , , , , , , and 46 other author(s). ISCA, page 80:1-80:13. ACM, (2023)PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel., , , , , , , , , and 8 other author(s). Proc. VLDB Endow., 16 (12): 3848-3860 (2023)Architectural Techniques to Enhance the Efficiency of Accelerator-Centric Architectures.. University of California, Los Angeles, USA, (2018)base-search.net (ftcdlib:qt8p2116wz).The Llama 3 Herd of Models, , , , , , , , , and 523 other author(s). (2024)Wukong: Towards a Scaling Law for Large-Scale Recommendation., , , , , , , , , and 5 other author(s). CoRR, (2024)Software-hardware co-design for fast and scalable training of deep learning recommendation models., , , , , , , , , and 43 other author(s). ISCA, page 993-1011. ACM, (2022)Supporting Address Translation for Accelerator-Centric Architectures., , , and . HPCA, page 37-48. IEEE Computer Society, (2017)