Author of the publication

Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware.

, , , , and . IISWC, page 140-153. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Chasing Away RAts: Semantics and Evaluation for Relaxed Atomics on Heterogeneous Systems., , and . ISCA, page 161-174. ACM, (2017)Efficient GPU synchronization without scopes: saying no to complex consistency models., , and . MICRO, page 647-659. ACM, (2015)SeqPoint: Identifying Representative Iterations of Sequence-Based Neural Networks., , , and . ISPASS, page 69-80. IEEE, (2020)Spandex: A Flexible Interface for Efficient Heterogeneous Coherence., , and . ISCA, page 261-274. IEEE Computer Society, (2018)HPVM: heterogeneous parallel virtual machine., , , , , and . PPoPP, page 68-80. ACM, (2018)Independent Forward Progress of Work-groups., , , , and . ISCA, page 1022-1035. IEEE, (2020)PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters., , , , and . CoRR, (2024)Analyzing Machine Learning Workloads Using a Detailed GPU Simulator., , , , , , , , , and 1 other author(s). CoRR, (2018)A Case for Fine-grain Coherence Specialization in Heterogeneous Systems., , , , and . ACM Trans. Archit. Code Optim., 19 (3): 41:1-41:26 (2022)Efficient coherence and consistency for specialized memory hierarchies. University of Illinois Urbana-Champaign, USA, (2017)