Author of the publication

A Case Study of Communication Optimizations on 3D Mesh Interconnects.

, , and . Euro-Par, volume 5704 of Lecture Notes in Computer Science, page 1015-1028. Springer, (2009)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization., , and . CoRR, (2023)Evaluation of an interference-free node allocation policy on fat-tree clusters., , , and . SC, page 26:1-26:13. IEEE / ACM, (2018)A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training., , , , , and . ICS, page 203-214. ACM, (2023)Improving communication performance in dense linear algebra via topology aware collectives., , and . SC, page 77:1-77:11. ACM, (2011)Topology Aware Task Mapping.. Encyclopedia of Parallel Computing, Springer, (2011)Optimizing computation-communication overlap in asynchronous task-based programs: poster., , , , , , , and . PPoPP, page 415-416. ACM, (2019)Massively Parallel Simulations of Spread of Infectious Diseases over Realistic Social Networks., , , , , , , and . CCGrid, page 689-694. IEEE Computer Society / ACM, (2017)Massively Parallel First-Principles Simulation of Electron Dynamics in Materials., , , , , and . IPDPS, page 832-841. IEEE Computer Society, (2016)Data-Driven Performance Modeling of Linear Solvers for Sparse Matrices., , , , and . PMBS@SC, page 32-42. IEEE Computer Society, (2016)Predicting Cross-Architecture Performance of Parallel Programs., , , , , , and . IPDPS, page 570-581. IEEE, (2024)