Author of the publication

PLink: Discovering and Exploiting Locality for Accelerated Distributed Training on the public Cloud.

, , , , and . MLSys, mlsys.org, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Parameter Hub: High Performance Parameter Servers for Efficient Distributed Deep Neural Network Training., , , , and . CoRR, (2018)Scaling Distributed Machine Learning with In-Network Aggregation., , , , , , , , , and . CoRR, (2019)KVCG: a heterogeneous key-value store for skewed workloads., , , and . SYSTOR, page 5:1-5:12. ACM, (2021)ORCA: A Network and Architecture Co-design for Offloading us-scale Datacenter Applications., , , , , , , , , and . CoRR, (2022)Rambda: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications., , , , , , , , , and . HPCA, page 499-515. IEEE, (2023)Xenic: SmartNIC-Accelerated Distributed Transactions., , , , and . SOSP, page 740-755. ACM, (2021)The Demikernel Datapath OS Architecture for Microsecond-scale Datacenter Systems., , , , , , , , , and 4 other author(s). SOSP, page 195-211. ACM, (2021)Unlocking the Power of Inline Floating-Point Operations on Programmable Switches., , , , , , , and . NSDI, page 683-700. USENIX Association, (2022)Evaluating the Power of Flexible Packet Processing for Network Resource Allocation., , , , , and . NSDI, page 67-82. USENIX Association, (2017)SwiShmem: Distributed Shared State Abstractions for Programmable Switches., , , and . HotNets, page 160-167. ACM, (2020)