Author of the publication

Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.

, , , , , , , , , , and . SC, page 15. ACM, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

PAC: Preference-Aware Co-location Scheduling on Heterogeneous NUMA Architectures To Improve Resource Utilization., , , , , , , , , and . ICS, page 75-86. ACM, (2023)SSiMD: Supporting Six Signed Multiplications in a DSP Block for Low-Precision CNN on FPGAs., , , , , and . ICFPT, page 161-169. IEEE, (2023)FADO: Floorplan-Aware Directive Optimization Based on Synthesis and Analytical Models for High-Level Synthesis Designs on Multi-Die FPGAs., , , , , , , , and . ACM Trans. Reconfigurable Technol. Syst., 17 (3): 47:1-47:33 (September 2024)Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction., , , , , , , , , and 1 other author(s). SC, page 15. ACM, (2021)SpMMPlu: A Compiler Plug-in with Sparse IR for Efficient Sparse Matrix Multiplication., , , , , , and . DAC, page 1-6. IEEE, (2023)LAMA: Link-Aware Hybrid Management for Memory Accesses in Emerging CPU-FPGA Platforms., , , , and . DAC, page 1. ACM, (2019)FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications., , , , , , and . FPL, page 269-276. IEEE, (2020)An Optimizing Framework on MLIR for Efficient FPGA-based Accelerator Generation., , , , , and . HPCA, page 75-90. IEEE, (2024)Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture., , , , , , , , and . CoRR, (2024)Characterizing and orchestrating VM reservation in geo-distributed clouds to improve the resource efficiency., , , , , , , , and . SoCC, page 94-109. ACM, (2022)