Author of the publication

Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures.

, , , , , , , , , , and . ICPP, page 38:1-38:12. ACM, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Efficient detection of silent data corruption in HPC applications with synchronization-free message verification., , , and . J. Supercomput., 78 (1): 1381-1408 (2022)Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP., , , , , , , and . IEEE Trans. Computers, 71 (8): 1968-1981 (2022)SMGuard: A Flexible and Fine-Grained Resource Management Framework for GPUs., , , , , , and . IEEE Trans. Parallel Distributed Syst., 29 (12): 2849-2862 (2018)QoS-aware dynamic resource allocation with improved utilization and energy efficiency on GPU., , , , , and . Parallel Comput., (2022)HitAnomaly: Hierarchical Transformers for Anomaly Detection in System Log., , , , , , and . IEEE Trans. Netw. Serv. Manag., 17 (4): 2064-2076 (2020)Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding., , , , , , , , , and 3 other author(s). CoRR, (2024)Accelerating Sparse Cholesky Factorization on Sunway Manycore Architecture., , , , , , and . IEEE Trans. Parallel Distributed Syst., 31 (7): 1636-1650 (2020)BigRoots: An Effective Approach for Root-cause Analysis of Stragglers in Big Data System., , , , and . CoRR, (2018)Modeling Power Consumption of The Code Execution Using Performance Counters Statistics., , , and . PDCAT, page 381-385. IEEE, (2019)Improving the Parallelism of CESM on GPU., , , , , , , and . ICA3PP (2), volume 11945 of Lecture Notes in Computer Science, page 11-18. Springer, (2019)