Author of the publication

Parallel convolution algorithm using implicit matrix multiplication on multi-core CPUs.

, , , and . IJCNN, page 1-7. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FT-topo: Architecture-Driven Folded-Triangle Partitioning for Communication-efficient Graph Processing., , , , , , , , and . ICS, page 240-250. ACM, (2023)Customizing the HPL for China accelerator., , , , , , , and . Sci. China Inf. Sci., 61 (4): 042102:1-042102:11 (2018)OHTMA: an optimized heuristic topology-aware mapping algorithm on the Tianhe-3 exascale supercomputer prototype., , , , , , , and . Frontiers Inf. Technol. Electron. Eng., 21 (6): 939-949 (2020)Differential Fault Analysis on SHACAL-1., , and . FDTC, page 120-126. IEEE Computer Society, (2009)Succinct Representations in Collaborative Filtering: A Case Study using Wavelet Tree on 1, 000 Cores., , , , and . PDCAT, page 427-432. IEEE, (2019)Parallel Implementation of SHA256 on Multizone Heterogeneous Systems., , , and . ISPA/BDCloud/SocialCom/SustainCom, page 416-422. IEEE, (2023)Parallel 3D deterministic particle transport on Intel MIC architecture., , , , , and . HPCS, page 186-192. IEEE, (2014)The High Precision Real-Time Facial Landmark Detection Technique Based on ShufflenetV2., , , , and . NCTCS, volume 1494 of Communications in Computer and Information Science, page 59-71. Springer, (2021)An efficient image to column algorithm for convolutional neural networks., , , , , , , , and . IJCNN, page 1-8. IEEE, (2021)Parallel convolution algorithm using implicit matrix multiplication on multi-core CPUs., , , and . IJCNN, page 1-7. IEEE, (2019)