Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design., , , , , , , and . AAAI, page 955-963. AAAI Press, (2021)Fluid: a framework for approximate concurrency via controlled dependency relaxation., , , , , , and . PLDI, page 252-267. ACM, (2021)Work in Progress: Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework., , , , , , , , , and 2 other author(s). RTAS, page 493-496. IEEE, (2021)You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding., , , , , , , , , and 4 other author(s). ECCV (12), volume 13672 of Lecture Notes in Computer Science, page 34-51. Springer, (2022)An efficient segmented quantization for graph neural networks., , and . CCF Trans. High Perform. Comput., 4 (4): 461-473 (December 2022)FlexBFS: a parallelism-aware implementation of breadth-first search on GPU., , , , , , , and . PPoPP, page 279-280. ACM, (2012)Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training., , , , , , , and . CoRR, (2022)Controlled Kernel Launch for Dynamic Parallelism in GPUs., , , , , , , , and . HPCA, page 649-660. IEEE Computer Society, (2017)Enhancing GPU Performance via Neighboring Directory Table Based Inter-TLB Sharing., , , , and . ICCD, page 146-153. IEEE, (2022)Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs., , and . DAC, page 1-6. IEEE, (2023)