Author of the publication

Compiler-Assisted Workload Consolidation for Efficient Dynamic Parallelism on GPU.

, , and . IPDPS, page 534-543. IEEE Computer Society, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA., , and . HiPC, page 123-132. IEEE, (2018)TactCapsNet: Tactile Capsule Network for Object Hardness Recognition., , , , , , , and . RCAR, page 1128-1133. IEEE, (2021)Multimodal Surface Material Classification Based on Ensemble Learning with Optimized Features., , , , and . HealthCom, page 1-6. IEEE, (2020)Compiler-Assisted Workload Consolidation for Efficient Dynamic Parallelism on GPU., , and . IPDPS, page 534-543. IEEE Computer Society, (2016)Evaluating Thread Coarsening and Low-cost Synchronization on Intel Xeon Phi., and . IPDPS, page 1018-1029. IEEE, (2020)A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms., , and . ICPADS, page 507-516. IEEE, (2018)Accelerating Random Forest Classification on GPU and FPGA., , , , , and . ICPP, page 4:1-4:11. ACM, (2022)Nested Parallelism on GPU: Exploring Parallelization Templates for Irregular Loops and Recursive Computations., , and . ICPP, page 979-988. IEEE Computer Society, (2015)Exploiting Dynamic Parallelism to Efficiently Support Irregular Nested Loops on GPUs., , and . COSMIC@CGO, page 5:1. ACM, (2015)An Analytical Study of Recursive Tree Traversal Patterns on Multi- and Many-Core Platforms., and . ICPADS, page 586-595. IEEE Computer Society, (2017)