Author of the publication

An efficient uncertain graph processing framework for heterogeneous architectures.

, , , , , , , and . PPoPP, page 477-479. ACM, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ISM2: Optimizing Irregular-Shaped Matrix-Matrix Multiplication on GPUs., , , , and . CoRR, (2020)Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving., , , , , and . MICRO, page 885-897. ACM, (2021)TEA: A General-Purpose Temporal Graph Random Walk Engine., , , , , , , , , and . EuroSys, page 182-198. ACM, (2023)Q-VR: system-level design for future mobile collaborative virtual reality., , , , , and . ASPLOS, page 587-599. ACM, (2021)AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures., , , , , , , , , and 2 other author(s). ASPLOS, page 359-373. ACM, (2022)SMT-Aware Instantaneous Footprint Optimization., , and . HPDC, page 267-279. ACM, (2016)Brief Industry Paper: The Necessity of Adaptive Data Fusion in Infrastructure-Augmented Autonomous Driving System., , , , , , , , , and . RTAS, page 293-296. IEEE, (2022)ClickTrain: efficient and accurate end-to-end deep learning training via fine-grained architecture-preserving pruning., , , , , , , , , and 1 other author(s). ICS, page 266-278. ACM, (2021)Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs., , , , , , , , , and 3 other author(s). USENIX ATC, page 699-713. USENIX Association, (2024)Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity., , , , , , , , and . CoRR, (2023)