Author of the publication

An efficient two-dimensional blocking strategy for sparse matrix-vector multiplication on GPUs.

, , , and . ICS, page 273-282. ACM, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimization by neural networks., and . ICNN, page 325-332. IEEE, (1988)Integrating parallel file systems with object-based storage devices., , , , and . SC, page 27. ACM Press, (2007)One-to-one mapping of process graphs onto a hypercube., and . ICS, page 91-98. ACM, (1989)ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation., , , , , and . CoRR, (2018)Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining., , and . Proc. VLDB Endow., 4 (4): 231-242 (2011)Compiler-assisted detection of transient memory errors., , and . PLDI, page 204-215. ACM, (2014)IOOpt: automatic derivation of I/O complexity bounds for affine programs., , , , , and . PLDI, page 1187-1202. ACM, (2021)Scalable parallelization for the solution of phonon Boltzmann Transport Equation., , , , and . ICS, page 215-226. ACM, (2023)Scalable heterogeneous execution of a coupled-cluster model with perturbative triples., , , , , and . SC, page 79. IEEE/ACM, (2020)Memory-Constrained Communication Minimization for a Class of Array Computations., , , , and . LCPC, volume 2481 of Lecture Notes in Computer Science, page 1-15. Springer, (2002)