Author of the publication

Pumma: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers.

, , and . Concurr. Pract. Exp., 6 (7): 543-570 (1994)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ADAPT: an event-based adaptive collective communication framework., , , , , and . HPDC, page 118-130. ACM, (2018)Evaluation of Programming Models to Address Load Imbalance on Distributed Multi-Core CPUs: A Case Study with Block Low-Rank Factorization., , , , and . PAW-ATM@SC, page 25-36. IEEE, (2019)BlackjackBench: portable hardware characterization., , , , and . SIGMETRICS Perform. Evaluation Rev., 40 (2): 74-79 (2012)Standards for Graph Algorithm Primitives., , , , , , , , , and 9 other author(s). CoRR, (2014)Performance of various computers using standard linear equations software in a Fortran environment.. SIGARCH Comput. Archit. News, 11 (5): 22-27 (1983)Performance of various computers using standard linear equations software in a Fortran environment.. SIGARCH Comput. Archit. News, 13 (1): 3-11 (1985)Performance of various computers using standard linear equations software.. SIGARCH Comput. Archit. News, 18 (1): 17 (1990)Algorithm 710: FORTRAN subroutines for computing the eigenvalues and eigenvectors of a general matrix by reduction to general tridiagonal form., , and . ACM Trans. Math. Softw., 18 (4): 392-400 (1992)Reducing the amount of out-of-core data access for GPU-accelerated randomized SVD., , , , , and . Concurr. Comput. Pract. Exp., (2020)Vectorizing compilers: a test suite and results., , and . SC, page 98-105. IEEE Computer Society, (1988)