Author of the publication

Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design.

, , , , , and . HPCA, page 542-555. IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Speeding up Collective Communications Through Inter-GPU Re-Routing., , , and . IEEE Comput. Archit. Lett., 18 (2): 128-131 (2019)OO-VR: NUMA Friendly Object-Oriented VR Rendering Framework For Future NUMA-Based Multi-GPU Systems., , , and . CoRR, (2020)An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning., , , , , , , , , and 1 other author(s). CoRR, (2020)An adaptive cross-architecture combination method for graph traversal., , and . ICS, page 169. ACM, (2014)HEAT: A Highly Efficient and Affordable Training System for Collaborative Filtering Based Recommendation on CPUs., , , , , , , , and . ICS, page 324-335. ACM, (2023)OO-VR: NUMA friendly object-oriented VR rendering framework for future NUMA-based multi-GPU systems., , , and . ISCA, page 53-65. ACM, (2019)Post0-VR: Enabling Universal Realistic Rendering for Modern VR via Exploiting Architectural Similarity and Data Sharing., , , and . HPCA, page 390-402. IEEE, (2023)Lightweight detection of cache conflicts., , , and . CGO, page 200-213. ACM, (2018)LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism., , , , , , and . ASAP, page 9-16. IEEE, (2019)G-Sparse: Compiler-Driven Acceleration for Generalized Sparse Computation for Graph Neural Networks on Modern GPUs., , , , , , , , and . PACT, page 137-149. IEEE, (2023)