Author of the publication

High performance comparison-based sorting algorithm on many-core GPUs.

, , , , and . IPDPS, page 1-10. IEEE, (2010)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Circuit implementation of floating point range reduction for trigonometric functions., , , , , and . ISCAS, page 3010-3013. IEEE, (2007)Software and Hardware Cooperate for 1-D FFT Algorithm Optimization on Multicore Processors., , and . CIT (1), page 86-91. IEEE Computer Society, (2009)Streamline Ring ORAM Accesses through Spatial and Temporal Optimization., , , , , , and . HPCA, page 14-25. IEEE, (2021)Simple and Efficient Heterogeneous Graph Neural Network., , , , and . AAAI, page 10816-10824. AAAI Press, (2023)A High-accurate Multi-objective Exploration Framework for Design Space of CPU., , , , , , , and . DAC, page 1-6. IEEE, (2023)Magma: A Monolithic 3D Vertical Heterogeneous ReRAM-based Main Memory Architecture., , , , and . DAC, page 115. ACM, (2019)Preliminary Investigation of Accelerating Molecular Dynamics Simulation on Godson-T Many-Core Processor., , , , , , and . Euro-Par Workshops, volume 6586 of Lecture Notes in Computer Science, page 349-356. Springer, (2010)On the properties of data migration based on topology pattern keeping on cache hierarchy., , , , and . IGSC, page 1-4. IEEE Computer Society, (2016)Accelerating Sparse Convolutional Neural Networks Based on Dataflow Architecture., , , , , , and . ICA3PP (2), volume 12453 of Lecture Notes in Computer Science, page 14-31. Springer, (2020)CTA: A Critical Task Aware Scheduling Mechanism for Dataflow Architecture., , , , , , and . ICA3PP (1), volume 12452 of Lecture Notes in Computer Science, page 61-77. Springer, (2020)