Author of the publication

Optimizing Attention by Exploiting Data Reuse on ARM Multi-core CPUs.

, , , and . ICS, page 137-149. ACM, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

CRSP: Network Congestion Control through Credit Reservation., , , and . ISPA/IUCC/BDCloud/SocialCom/SustainCom, page 692-699. IEEE, (2018)FLYER: Fine-grained landmark based greedy geographic routing under uncertain locations., , and . ICC, page 166-171. IEEE, (2014)A survey of machine learning for Network-on-Chips., , , , and . J. Parallel Distributed Comput., (April 2024)CIB-HIER: Centralized Input Buffer Design in Hierarchical High-radix Routers., , , , , and . ACM Trans. Archit. Code Optim., 18 (4): 50:1-50:21 (2021)MPICC: Multi-Path INT-Based Congestion Control in Datacenter Networks., , , and . NPC, volume 13152 of Lecture Notes in Computer Science, page 256-268. Springer, (2021)NoC power optimization using combined routing algorithms., , and . ICIS, page 299-304. IEEE Computer Society, (2017)STEGNN: Spatial-Temporal Embedding Graph Neural Networks for Road Network Forecasting., , , , , and . ICPADS, page 826-834. IEEE, (2022)Memory-aware Optimization for Sequences of Sparse Matrix-Vector Multiplications., , , , , , and . IPDPS, page 379-389. IEEE, (2023)FastHorovod: Expediting Parallel Message-Passing Schedule for Distributed DNN Training., , , , and . ISCC, page 1-7. IEEE, (2021)DNNEmu: A Lightweight Performance Emulator for Distributed DNN Training., , , and . ICA3PP, volume 13777 of Lecture Notes in Computer Science, page 722-736. Springer, (2022)