Author of the publication

CIGAR: Application Partitioning for a CPU/Coprocessor Architecture.

, , , , , and . PACT, page 317-326. IEEE Computer Society, (2007)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Throughput-oriented GPU memory allocation., and . PPoPP, page 27-37. ACM, (2019)Efficient performance evaluation of memory hierarchy for highly multithreaded graphics processors., , , and . PPoPP, page 23-34. ACM, (2012)Accelerating reduction and scan using tensor core units., , , , and . ICS, page 46-57. ACM, (2019)GPU-SM: shared memory multi-GPU programming., , , , and . GPGPU@PPoPP, page 13-24. ACM, (2015)High-Performance Reverse Time Migration on GPU., , , , , and . SCCC, page 77-86. IEEE Computer Society, (2009)Automatic execution of single-GPU computations across multiple GPUs., , , , , and . PACT, page 467-468. ACM, (2014)Comparison based sorting for systems with multiple GPUs., , , , , , and . GPGPU@ASPLOS, page 1-11. ACM, (2013)Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications., , , , , and . IEEE Trans. Parallel Distributed Syst., 26 (5): 1405-1418 (2015)Enabling preemptive multiprogramming on GPUs., , , , , and . ISCA, page 193-204. IEEE Computer Society, (2014)Accelerating Reduction and Scan Using Tensor Core Units., , , , and . CoRR, (2018)