Author of the publication

Using shared-data localization to reduce the cost of inspector-execution in unified-parallel-C programs.

, , , , and . Parallel Comput., (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Coping with very High Latencies in Petaflop Computer Systems., , , , , and . ISHPC, volume 1615 of Lecture Notes in Computer Science, page 71-82. Springer, (1999)A Multizone Pipelined Cache for IP Routing., , , and . NETWORKING, volume 3462 of Lecture Notes in Computer Science, page 574-585. Springer, (2005)11th Compiler-Driven Performance Workshop., , , , and . CASCON, page 239-240. IBM / ACM, (2012)Memory-access-aware Safety and Profitability Analysis for Transformation of Accelerator-bound OpenMP Loops., , , , and . ACM Trans. Archit. Code Optim., 16 (3): 30:1-30:26 (2019)Vulkan Vision: Ray Tracing Workload Characterization using Automatic Graphics Instrumentation., , , and . CGO, page 137-149. IEEE, (2021)To Pack or Not to Pack: A Generalized Packing Analysis and Transformation., , , , and . CGO, page 14-27. ACM, (2023)Improving Convolution via Cache Hierarchy Tiling and Reduced Packing., , , , , and . PACT, page 538-539. ACM, (2022)A Parallel External-Memory Frontier Breadth-First Traversal Algorithm for Clusters of Workstations., , and . ICPP, page 531-538. IEEE Computer Society, (2006)Function Outlining and Partial Inlining., and . SBAC-PAD, page 101-108. IEEE Computer Society, (2005)Automatic speculative parallelization of loops using polyhedral dependence analysis., and . COSMIC@CGO, page 1. ACM, (2013)