Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FlexBFS: a parallelism-aware implementation of breadth-first search on GPU., , , , , , , and . PPoPP, page 279-280. ACM, (2012)ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysis., , , , , , , and . MICRO, page 268-282. ACM, (2023)Throughput optimization for streaming applications on CPU-FPGA heterogeneous systems., , , , and . ASP-DAC, page 488-493. IEEE, (2017)Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management., , and . DAC, page 125. ACM, (2019)POSTER: RadiK: Scalable Radix Top-K Selection on GPUs., , , , , and . PPoPP, page 472-474. ACM, (2024)RadiK: Scalable and Optimized GPU-Parallel Radix Top-K Selection., , , , , and . ICS, page 537-548. ACM, (2024)Automated Systolic Array Architecture Synthesis for High Throughput CNN Inference on FPGAs., , , , , , , and . DAC, page 29:1-29:6. ACM, (2017)Distributed replay protocol for distributed uniprocessors., , , , , , and . ICS, page 3-14. ACM, (2012)TGPA: tile-grained pipeline architecture for low latency CNN inference., , , , , and . ICCAD, page 58. ACM, (2018)2022 ICCAD CAD Contest Problem C: Microarchitecture Design Space Exploration., , , , , and . ICCAD, page 95:1-95:7. ACM, (2022)