Author of the publication

Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU.

, , , , and . PPoPP, page 57-68. ACM, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Combining Locality Analysis with Online Proactive Job Co-scheduling in Chip Multiprocessors., , and . HiPEAC, volume 5952 of Lecture Notes in Computer Science, page 201-215. Springer, (2010)Exploration of the Influence of Program Inputs on CMP Co-scheduling., and . Euro-Par, volume 5168 of Lecture Notes in Computer Science, page 263-273. Springer, (2008)CoCoPIE: Making Mobile AI Sweet As PIE -Compression-Compilation Co-Design Goes a Long Way., , , and . CoRR, (2020)TOP: A Framework for Enabling Algorithmic Optimizations for Distance-Related Problems., , , and . Proc. VLDB Endow., 8 (10): 1046-1057 (2015)Locality approximation using time., , , and . POPL, page 55-61. ACM, (2007)Enabling Near Real-Time NLU-Driven Natural Language Programming through Dynamic Grammar Graph-Based Translation., , and . CGO, page 278-289. IEEE, (2022)Temporal Exposure Reduction Protection for Persistent Memory., , , and . HPCA, page 908-924. IEEE, (2022)Do computer programs have to be as dumb as they are?: input-centric dynamic program optimizations.. VMIL@SPLASH, page 41-42. ACM, (2013)Software-level scheduling to exploit non-uniformly shared data cache on GPGPU., , and . MSPC@PLDI, page 10:1-10:2. ACM, (2013)Understanding co-run performance on CPU-GPU integrated processors: observations, insights, directions., , , , , and . Frontiers Comput. Sci., 11 (1): 130-146 (2017)