Author of the publication

PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.

, , , , , , , , , , and . SOSP, page 331-347. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices., , , , , , and . MobiSys, page 81-93. ACM, (2021)Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction., , , , , , , , , and 1 other author(s). SC, page 15. ACM, (2021)Online Video Streaming Super-Resolution with Adaptive Look-Up Table Fusion., , , , , , , , , and 2 other author(s). CoRR, (2023)URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds., , , , , , , and . ICPP, page 73:1-73:11. ACM, (2020)CLIBE: Precise Cluster-Level I/O Bandwidth Enforcement in Distributed File System., , , and . HPCC/SmartCity/DSS, page 124-131. IEEE, (2018)PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation., , , , , , , , , and 1 other author(s). SOSP, page 331-347. ACM, (2023)QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services., , , , , , and . SC, page 69:1-69:14. IEEE, (2022)Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning., , , , , , , , , and 1 other author(s). MLSys, mlsys.org, (2023)SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation., , , , , , , , , and . CoRR, (2023)FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion., , , , , , , , , and 2 other author(s). CoRR, (2024)