Author of the publication

Fast Convolution Operations on Many-Core Architectures.

, , , and . HPCC/CSS/ICESS, page 316-323. IEEE, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Mining concise patterns on graph-connected itemsets., , , and . Neurocomputing, (2019)AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs., , , , , , , , and . CoRR, (2022)AGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format., , , , , , , , , and . IEEE Trans. Parallel Distributed Syst., 34 (3): 766-780 (March 2023)Special issue of HPCChina 2023., , and . CCF Trans. High Perform. Comput., 6 (1): 1-2 (February 2024)Asynch-SGBDT: Train Stochastic Gradient Boosting Decision Trees in an Asynchronous Parallel Manner., , and . IPDPS, page 256-267. IEEE, (2023)AutoFlow: Hotspot-Aware, Dynamic Load Balancing for Distributed Stream Processing., , , and . ICA3PP (3), volume 13157 of Lecture Notes in Computer Science, page 133-151. Springer, (2021)QuantWiz: A Parallel Software Package for LC-MS-based Label-Free Protein Quantification., , , , , , and . HPCC, page 683-687. IEEE, (2009)Research on Mahalanobis Distance Algorithm Optimization Based on OpenCL., , , and . HPCC/CSS/ICESS, page 84-91. IEEE, (2014)Optimized Password Recovery for Encrypted RAR on GPUs., , and . HPCC/CSS/ICESS, page 591-598. IEEE, (2015)基于Pthreads的并行DSRC压缩算法设计与实现 (Design and Implementation of Parallel DSRC Compression Algorithm Based on Pthreads)., , , , and . 计算机科学, 42 (1): 90-91 (2015)