Author of the publication

Efficient Intranode Communication in GPU-Accelerated Systems.

, , , , , , and . IPDPS Workshops, page 1838-1847. IEEE Computer Society, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Guest Editor's Introduction: P2S2: SI 2016., , and . Parallel Comput., (2019)Special issue on programming models and applications for multicores and manycores., and . Int. J. High Perform. Comput. Appl., 31 (5): 359-360 (2017)Foreword to the Special Issue of the workshop on the seventh international workshop on programming models and applications for multicores and manycores (PMAM 2016)., and . Concurr. Comput. Pract. Exp., (2017)Process-in-process: techniques for practical address-space sharing., , , , , , and . HPDC, page 131-143. ACM, (2018)Improving concurrency and asynchrony in multithreaded MPI applications using software offloading., , , , , , , and . SC, page 30:1-30:12. ACM, (2015)Semantic-based distributed i/o with the paramedic framework., , and . HPDC, page 175-184. ACM, (2008)MT-MPI: multithreaded MPI for many-core environments., , , , and . ICS, page 125-134. ACM, (2014)Work stealing for GPU-accelerated parallel programs in a global address space framework., , , , and . Concurr. Comput. Pract. Exp., 28 (13): 3637-3654 (2016)Mpi on millions of Cores., , , , , , , , and . Parallel Process. Lett., 21 (1): 45-60 (2011)Enabling communication concurrency through flexible MPI endpoints., , , , , , and . Int. J. High Perform. Comput. Appl., 28 (4): 390-405 (2014)