Author of the publication

swSpAMM: optimizing large-scale sparse approximate matrix multiplication on Sunway Taihulight.

, , , , , and . Frontiers Comput. Sci., 17 (4): 174104 (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

MapReduce Workload Modeling with Statistical Approach., , , and . J. Grid Comput., 10 (2): 279-310 (2012)CoFB: latency-constrained co-scheduling of flows and batches for deep learning inference service on the CPU-GPU system., , , and . J. Supercomput., 79 (13): 14172-14199 (September 2023)Adaptive Auto-Tuning Framework for Global Exploration of Stencil Optimization on GPUs., , , , , and . IEEE Trans. Parallel Distributed Syst., 35 (1): 20-33 (January 2024)Partition-Based Hardware Transactional Memory for Many-Core Processors., , , , , and . NPC, volume 8147 of Lecture Notes in Computer Science, page 308-321. Springer, (2013)A Fair Thread-Aware Memory Scheduling Algorithm for Chip Multiprocessor., , , , , and . ICA3PP (1), volume 6081 of Lecture Notes in Computer Science, page 174-185. Springer, (2010)A Novel Scheme for High Performance Finite-Difference Time-Domain (FDTD) Computations Based on GPU., , , , and . ICA3PP (1), volume 6081 of Lecture Notes in Computer Science, page 441-453. Springer, (2010)Accelerating tile low-rank GEMM on sunway architecture: POSTER., , , and . CF, page 295-297. ACM, (2019)Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU., , , , , , , , , and 3 other author(s). IPDPS, page 156-166. IEEE, (2023)PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference., , , , , , , and . CCGRID, page 334-343. IEEE, (2021)POIGEM: A Programming-Oriented Instruction Level GPU Energy Model for CUDA Program., , , and . ICA3PP (1), volume 8285 of Lecture Notes in Computer Science, page 129-142. Springer, (2013)