Author of the publication

Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems.

, and . ICS, page 244-255. ACM, (2009)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Methods for High-Throughput Computation of Elementary Functions., and . PPAM (1), volume 8384 of Lecture Notes in Computer Science, page 86-95. Springer, (2013)A GPU-parallel construction of volumetric tree., , , and . IA3@SC, page 10:1-10:4. ACM, (2015)Nimble GNN Embedding with Tensor-Train Decomposition., , , , , and . KDD, page 2327-2335. ACM, (2022)Griffin: grouping suspicious memory-access patterns to improve understanding of concurrency bugs., , and . ISSTA, page 134-144. ACM, (2013)Online model swapping for architectural simulation., , , and . CF, page 102-112. ACM, (2021)Communicating Software Architecture using a Unified Single-View Visualization., , , , and . ICECCS, page 217-228. IEEE Computer Society, (2007)An input-adaptive and in-place approach to dense tensor-times-matrix multiply., , , , and . SC, page 76:1-76:12. ACM, (2015)Polyadic Regression and its Application to Chemogenomics., , , , , , and . SDM, page 72-80. SIAM, (2017)A Theoretical Framework for Algorithm-Architecture Co-design., and . IPDPS, page 791-802. IEEE Computer Society, (2013)A Communication-Avoiding 3D LU Factorization Algorithm for Sparse Matrices., , and . IPDPS, page 908-919. IEEE Computer Society, (2018)