Author of the publication

Parallel convolution algorithm using implicit matrix multiplication on multi-core CPUs.

, , , and . IJCNN, page 1-7. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

TAMM: A New Topology-Aware Mapping Method for Parallel Applications on the Tianhe-2A Supercomputer., , , , , and . ICA3PP (1), volume 11334 of Lecture Notes in Computer Science, page 242-256. Springer, (2018)Stochastic Event-Triggered Cubature Kalman Filter for Power System Dynamic State Estimation., , , , , , , , and . IEEE Trans. Circuits Syst. II Express Briefs, 66-II (9): 1552-1556 (2019)Optimizing Irregular-Shaped Matrix-Matrix Multiplication on Multi-Core DSPs., , , , , and . CLUSTER, page 451-461. IEEE, (2022)Optimizing Yinyang K-Means Algorithm on ARMv8 Many-Core CPUs., , , , and . ICA3PP, volume 13777 of Lecture Notes in Computer Science, page 676-690. Springer, (2022)A Low-Latency Successive Cancellation Hybrid Decoder for Convolutional Polar Codes., , , , , , and . ICASSP, page 5105-5109. IEEE, (2020)A Two-stage Personalized Recommendation in CTS Using Graph-Based Clustering., , , and . ICEE, page 3629-3632. IEEE Computer Society, (2010)Optimizing FFT-Based Convolution on ARMv8 Multi-core CPUs., , , , , and . Euro-Par, volume 12247 of Lecture Notes in Computer Science, page 248-262. Springer, (2020)DeepEnhancer: Temporally Consistent Focal Transformer for Comprehensive Video Enhancement., , , , , and . ICMR, page 969-977. ACM, (2024)Parallel 3D deterministic particle transport on Intel MIC architecture., , , , , and . HPCS, page 186-192. IEEE, (2014)An efficient image to column algorithm for convolutional neural networks., , , , , , , , and . IJCNN, page 1-8. IEEE, (2021)