Author of the publication

Techniques for enabling GPU code generation of low-level optimizations and dynamic parallelism from high-level abstractions

. University of Illinois Urbana-Champaign, USA, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures., , , and . HPCC/SmartCity/DSS, page 72-80. IEEE Computer Society, (2017)Workshop 8: AsHES Accelerators and Hybrid Exascale Systems., , and . IPDPS Workshops, page 430. IEEE, (2020)Thermal aware automated load balancing for HPC applications., , , , and . CLUSTER, page 1-8. IEEE Computer Society, (2013)Enhancing the Usability and Utilization of Accelerated Architectures via Docker., , , , , , and . UCC, page 361-367. IEEE Computer Society, (2015)Chai: Collaborative heterogeneous applications for integrated-architectures., , , , , , , and . ISPASS, page 43-54. IEEE Computer Society, (2017)Few-shot HPC application runtime prediction., , and . CLUSTER Workshops, page 46-47. IEEE, (2023)TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep LearningInference in Function as a Service Environments., , , , and . CoRR, (2018)Techniques for enabling GPU code generation of low-level optimizations and dynamic parallelism from high-level abstractions. University of Illinois Urbana-Champaign, USA, (2020)JACC: An OpenACC Runtime Framework with Kernel-Level and Multi-GPU Parallelization., , and . HiPC, page 182-191. IEEE, (2021)Towards OmpSs-2 and OpenACC interoperation., , , , , and . PPoPP, page 433-434. ACM, (2022)