Author of the publication

Register-based implementation of the sparse general matrix-matrix multiplication on GPUs.

, , , and . PPoPP, page 407-408. ACM, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Dawning Nebulae: A PetaFLOPS Supercomputer with a Heterogeneous Structure., , , , , , and . J. Comput. Sci. Technol., 26 (3): 352-362 (2011)GRE: A Graph Runtime Engine for Large-Scale Distributed Graph-Parallel Applications., , and . CoRR, (2013)Analysis and performance results of computing betweenness centrality on IBM Cyclops64., , and . J. Supercomput., 56 (1): 1-24 (2011)Extending Amdahl's law in the multicore era., , , and . SIGMETRICS Perform. Evaluation Rev., 37 (2): 24-26 (2009)Routing and Spectrum Allocation for Time Varying Traffic by Artificial Bee Colony Algorithm in Elastic Optical Networks., , , , , and . ISPA/IUCC/BDCloud/SocialCom/SustainCom, page 1026-1033. IEEE, (2018)Improvement of Performance of MegaBlast Algorithm for DNA Sequence Alignment., , , , and . J. Comput. Sci. Technol., 21 (6): 973-978 (2006)FAST: A Fast Stencil Autotuning Framework Based On An Optimal-solution Space Model., , , and . ICS, page 187-196. ACM, (2015)A parallel dynamic programming algorithm on a multi-core architecture., , and . SPAA, page 135-144. ACM, (2007)Numerical assessment of flood hazard risk to people and vehicles in flash floods., , , and . Environ. Model. Softw., 26 (8): 987-998 (2011)Understanding parallelism in graph traversal on multi-core clusters., , , and . Comput. Sci. Res. Dev., 28 (2-3): 193-201 (2013)