Author of the publication

Optimizing the Performance of Streaming Numerical Kernels on the IBM Blue Gene/P PowerPC 450 Processor

, , , , and . CoRR, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators., , , and . VECPAR, volume 7851 of Lecture Notes in Computer Science, page 72-79. Springer, (2012)Parallelization of an Object-Oriented Unstructured Aeroacoustics Solver., , , and . PPSC, SIAM, (1999)Redesigning Triangular Dense Matrix Computations on GPUs., , and . Euro-Par, volume 9833 of Lecture Notes in Computer Science, page 477-489. Springer, (2016)Exploiting Data Sparsity for Large-Scale Matrix Computations., , , , , and . Euro-Par, volume 11014 of Lecture Notes in Computer Science, page 721-734. Springer, (2018)Unstructured computational aerodynamics on many integrated core architecture., , and . Parallel Comput., (2016)Multidimensional Intratile Parallelization for Memory-Starved Stencil Computations., , , and . ACM Trans. Parallel Comput., 4 (3): 12:1-12:32 (2018)Batched Triangular Dense Linear Algebra Kernels for Very Small Matrix Sizes on GPUs., , and . ACM Trans. Math. Softw., 45 (2): 15:1-15:28 (2019)A QDWH-based SVD Software Framework on Distributed-memory Manycore Systems., , , and . ACM Trans. Math. Softw., 45 (2): 18:1-18:21 (2019)Hierarchical-block conditioning approximations for high-dimensional multivariate normal probabilities., , , and . Stat. Comput., 29 (3): 585-598 (2019)A Quasi-algebraic Multigrid Approach to Fracture Problems Based on Extended Finite Elements., , , , and . SIAM J. Sci. Comput., (2012)