Author of the publication

Adaptive Auto-Tuning Framework for Global Exploration of Stencil Optimization on GPUs.

, , , , , and . IEEE Trans. Parallel Distributed Syst., 35 (1): 20-33 (January 2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

L-DAG: Enabling Loopy Workflow in Scientific Application with Automatic DAG Transformation., , , and . DASC/PiCom/DataCom/CyberSciTech, page 946-953. IEEE, (2019)SparkOT: Diagnosing Operation Level Inefficiency in Spark., , , , and . HPCC/SmartCity/DSS, page 692-699. IEEE, (2018)Improving the Parallelism of CESM on GPU., , , , , , , and . ICA3PP (2), volume 11945 of Lecture Notes in Computer Science, page 11-18. Springer, (2019)BigRoots: An Effective Approach for Root-cause Analysis of Stragglers in Big Data System., , , , and . CoRR, (2018)An optimized tensor completion library for multiple GPUs., , , , , and . ICS, page 417-430. ACM, (2021)Toward accelerated stencil computation by adapting tensor core unit on GPU., , , , , , and . ICS, page 28:1-28:12. ACM, (2022)Efficient detection of silent data corruption in HPC applications with synchronization-free message verification., , , and . J. Supercomput., 78 (1): 1381-1408 (2022)Modeling Power Consumption of The Code Execution Using Performance Counters Statistics., , , and . PDCAT, page 381-385. IEEE, (2019)SMGuard: A Flexible and Fine-Grained Resource Management Framework for GPUs., , , , , , and . IEEE Trans. Parallel Distributed Syst., 29 (12): 2849-2862 (2018)Performance Evaluation and Analysis of Linear Algebra Kernels in the Prototype Tianhe-3 Cluster., , , , and . SCFA, volume 11416 of Lecture Notes in Computer Science, page 86-105. Springer, (2019)