Author of the publication

An Efficient GPU Implementation Technique for Higher-Order 3D Stencils.

, , , and . HPCC/SmartCity/DSS, page 552-561. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Workshop 8: AsHES Accelerators and Hybrid Exascale Systems., , and . IPDPS Workshops, page 430. IEEE, (2020)Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures., , , and . HPCC/SmartCity/DSS, page 72-80. IEEE Computer Society, (2017)Enhancing the Usability and Utilization of Accelerated Architectures via Docker., , , , , , and . UCC, page 361-367. IEEE Computer Society, (2015)Thermal aware automated load balancing for HPC applications., , , , and . CLUSTER, page 1-8. IEEE Computer Society, (2013)Chai: Collaborative heterogeneous applications for integrated-architectures., , , , , , , and . ISPASS, page 43-54. IEEE Computer Society, (2017)Few-shot HPC application runtime prediction., , and . CLUSTER Workshops, page 46-47. IEEE, (2023)A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code., , and . CC, page 110-121. ACM, (2023)An Efficient GPU Implementation Technique for Higher-Order 3D Stencils., , , and . HPCC/SmartCity/DSS, page 552-561. IEEE, (2019)Petascale XCT: 3D image reconstruction with hierarchical communications on multi-GPU nodes., , , , , , , , and . SC, page 37. IEEE/ACM, (2020)TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference in Function-as-a-Service., , , , and . CLOUD, page 372-382. IEEE, (2019)