Author of the publication

Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning.

, , , , , , , , and . OSDI, page 681-699. USENIX Association, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

CoopStreaming: A Novel Peer-to-Peer System for Fast Live Media Streaming., , , and . WAIM, volume 3739 of Lecture Notes in Computer Science, page 882-887. Springer, (2005)Accelerating GNN training with locality-aware partial execution., , , , , , , and . APSys, page 34-41. ACM, (2021)Towards Efficient Large-Scale Graph Neural Network Computing., , , , , , and . CoRR, (2018)BitNet: Scaling 1-bit Transformers for Large Language Models., , , , , , , , , and . CoRR, (2023)Garaph: Efficient GPU-accelerated Graph Processing on a Single Machine with Balanced Replication., , , , and . USENIX Annual Technical Conference, page 195-207. USENIX Association, (2017)PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation., , , , , , , , , and 1 other author(s). SOSP, page 331-347. ACM, (2023)ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores., , , , , , , , , and . PPoPP, page 333-347. ACM, (2024)NeuGraph: Parallel Deep Neural Network Computation on Large Graphs., , , , , , and . USENIX Annual Technical Conference, page 443-458. USENIX Association, (2019)CuWide: Towards Efficient Flow-based Training for Sparse Wide Models on GPUs (Extended Abstract)., , , , , , and . ICDE, page 2330-2331. IEEE, (2021)Dense-to-Sparse Gate for Mixture-of-Experts., , , , , , , , and . CoRR, (2021)