Author of the publication

A technique for overlapping computation and communication for block recursive algorithms.

, , , and . Concurr. Pract. Exp., 10 (2): 73-90 (1998)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Affine transformations for communication minimal parallelization and locality optimization of arbitrarily nested loop sequences, , , , , and . OSU-CISRC-5/07-TR43. The Ohio State University, (May 2007)A Tiling Perspective for Register Optimization., , , and . CoRR, (2014)On Characterizing the Data Access Complexity of Programs., , , , and . CoRR, (2014)One-to-one mapping of process graphs onto a hypercube., and . ICS, page 91-98. ACM, (1989)Integrating parallel file systems with object-based storage devices., , , , and . SC, page 27. ACM Press, (2007)Communication Efficient Matrix Multiplication on Hypercubes., and . SPAA, page 320-329. ACM, (1994)ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation., , , , , and . CoRR, (2018)A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction., , , and . Sci. Program., 4 (4): 275-289 (1995)Stencil-Aware GPU Optimization of Iterative Solvers., , , , , , , , , and . SIAM J. Sci. Comput., (2013)Analytical modeling of cache behavior for affine programs., , , and . Proc. ACM Program. Lang., 2 (POPL): 32:1-32:26 (2018)