Author of the publication

Improving the Parallelism of CESM on GPU.

, , , , , , , and . ICA3PP (2), volume 11945 of Lecture Notes in Computer Science, page 11-18. Springer, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Adaptive Sparse Deep Neural Network Inference on Resource-Constrained Cost-Efficient GPUs., , , , , and . HPEC, page 1-7. IEEE, (2023)Improving the Parallelism of CESM on GPU., , , , , , , and . ICA3PP (2), volume 11945 of Lecture Notes in Computer Science, page 11-18. Springer, (2019)An optimized tensor completion library for multiple GPUs., , , , , and . ICS, page 417-430. ACM, (2021)csTuner: Scalable Auto-tuning Framework for Complex Stencil Computation on GPUs., , , , , , , and . CLUSTER, page 192-203. IEEE, (2021)PriPro: Towards Effective Privacy Protection on Edge-Cloud System running DNN Inference., , , , , , , and . CCGRID, page 334-343. IEEE, (2021)SPMGAE: Self-purified masked graph autoencoders release robust expression power., , , , , and . Neurocomputing, (2025)SpTFS: sparse tensor format selection for MTTKRP via deep learning., , , , , , , and . SC, page 18. IEEE/ACM, (2020)swGBDT: Efficient Gradient Boosted Decision Tree on Sunway Many-Core Processor., , , , , , and . SCFA, volume 12082 of Lecture Notes in Computer Science, page 67-86. Springer, (2020)swCPD: Optimizing Canonical Polyadic Decomposition on Sunway Manycore Architecture., , , , , and . HPCC/SmartCity/DSS, page 1320-1327. IEEE, (2019)Towards efficient canonical polyadic decomposition on sunway many-core processor., , , , , , , , and . Inf. Sci., (2021)