Author of the publication

Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training.

, , , , , , , , , and . ICS, page 336-347. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Accelerating Lossy Compression on HPC Datasets via Partitioning Computation for Parallel Processing., , , , , , , and . HPCC/SmartCity/DSS, page 1791-1797. IEEE, (2019)A novel memory-efficient deep learning training framework via error-bounded lossy compression., , , and . PPoPP, page 485-487. ACM, (2021)cuSZ: An Efficient GPU-Based Error-Bounded Lossy Compression Framework for Scientific Data., , , , , , , , , and 1 other author(s). PACT, page 3-15. ACM, (2020)DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression., , , , , and . HPDC, page 159-170. ACM, (2019)TSM2: optimizing tall-and-skinny matrix-matrix multiplication on GPUs., , , , , , , , , and . ICS, page 106-116. ACM, (2019)H-GCN: A Graph Convolutional Network Accelerator on Versal ACAP Architecture., , , , , , and . FPL, page 200-208. IEEE, (2022)Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs., , , , , and . IPDPS, page 717-727. IEEE, (2022)Optimizing Lossy Compression Rate-Distortion from Automatic Online Selection between SZ and ZFP., , , , and . IEEE Trans. Parallel Distributed Syst., 30 (8): 1857-1871 (2019)Fixed-PSNR Lossy Compression for Scientific Data., , , , and . CLUSTER, page 314-318. IEEE Computer Society, (2018)PVII: A pedestrian-vehicle interactive and iterative prediction framework for pedestrian's trajectory., , , , , , and . Appl. Intell., 54 (20): 9881-9891 (October 2024)