Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FlexBFS: a parallelism-aware implementation of breadth-first search on GPU., , , , , , , and . PPoPP, page 279-280. ACM, (2012)ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysis., , , , , , , and . MICRO, page 268-282. ACM, (2023)Throughput optimization for streaming applications on CPU-FPGA heterogeneous systems., , , , and . ASP-DAC, page 488-493. IEEE, (2017)POSTER: RadiK: Scalable Radix Top-K Selection on GPUs., , , , , and . PPoPP, page 472-474. ACM, (2024)RadiK: Scalable and Optimized GPU-Parallel Radix Top-K Selection., , , , , and . ICS, page 537-548. ACM, (2024)Overcoming Data Transfer Bottlenecks in FPGA-based DNN Accelerators via Layer Conscious Memory Management., , and . DAC, page 125. ACM, (2019)Frequency Improvement of Systolic Array-Based CNNs on FPGAs., , , , , and . ISCAS, page 1-4. IEEE, (2019)GNNear: Accelerating Full-Batch Training of Graph Neural Networks with near-Memory Processing., , , , and . PACT, page 54-68. ACM, (2022)Overcoming Data Transfer Bottlenecks in DNN Accelerators via Layer-Conscious Memory Managment., , , , and . FPGA, page 120. ACM, (2019)Generating Systolic Array Accelerators With Reusable Blocks., , , and . IEEE Micro, 40 (4): 85-92 (2020)