Author of the publication

Enabling coordinated register allocation and thread-level parallelism optimization for GPUs.

, , , , , , and . MICRO, page 395-406. ACM, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Circuit implementation of floating point range reduction for trigonometric functions., , , , , and . ISCAS, page 3010-3013. IEEE, (2007)Software and Hardware Cooperate for 1-D FFT Algorithm Optimization on Multicore Processors., , and . CIT (1), page 86-91. IEEE Computer Society, (2009)Simple and Efficient Heterogeneous Graph Neural Network., , , , and . AAAI, page 10816-10824. AAAI Press, (2023)Streamline Ring ORAM Accesses through Spatial and Temporal Optimization., , , , , , and . HPCA, page 14-25. IEEE, (2021)Magma: A Monolithic 3D Vertical Heterogeneous ReRAM-based Main Memory Architecture., , , , and . DAC, page 115. ACM, (2019)A High-accurate Multi-objective Exploration Framework for Design Space of CPU., , , , , , , and . DAC, page 1-6. IEEE, (2023)On the properties of data migration based on topology pattern keeping on cache hierarchy., , , , and . IGSC, page 1-4. IEEE Computer Society, (2016)Instruction Vulnerability Test and Code Optimization Against DVFS Attack., , , , , , , and . ITC-Asia, page 49-54. IEEE, (2019)Preliminary Investigation of Accelerating Molecular Dynamics Simulation on Godson-T Many-Core Processor., , , , , , and . Euro-Par Workshops, volume 6586 of Lecture Notes in Computer Science, page 349-356. Springer, (2010)WEAVER: An Energy Efficient, General-Purpose Acceleration Architecture for String Operations in Big Data Applications., , , , , , and . ISPA/IUCC/BDCloud/SocialCom/SustainCom, page 47-54. IEEE, (2018)