Author of the publication

HPCTOOLKIT: tools for performance analysis of optimized parallel programs.

, , , , , , and . Concurr. Comput. Pract. Exp., 22 (6): 685-701 (2010)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Simplifying Control Flow in Compiler-Generated Parallel Code., and . Int. J. Parallel Program., 26 (5): 613-638 (1998)Space-filling Curve Generation: A Table-based Approach., and . AMCS, page 40-46. CSREA Press, (2005)A tool for top-down performance analysis of GPU-accelerated applications., , and . PPoPP, page 415-416. ACM, (2020)Portable, MPI-interoperable coarray fortran., , , and . PPoPP, page 81-92. ACM, (2014)Synchronization without Contention., and . ASPLOS, page 269-278. ACM Press, (1991)SIGARCH Computer Architecture News 19(2), SIGOPS Operating System Review 25(Special Issue April 1991), and SIGPLAN Notices 26(4).On-the-fly detection of data races for programs with nested fork-join parallelism.. SC, page 24-33. ACM, (1991)Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles., , and . SC, page 1-11. IEEE, (2010)Accelerating High-Order Stencils on GPUs., , , , and . PMBS@SC, page 86-108. IEEE, (2020)Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors, and . ACM Trans. Comput. Syst., 9 (1): 21--65 (1991)Tools for top-down performance analysis of GPU-accelerated applications., , and . ICS, page 26:1-26:12. ACM, (2020)