Author of the publication

TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep LearningInference in Function as a Service Environments.

, , , , and . CoRR, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs., , , and . CoRR, (2019)Triolet: a programming system that unifies algorithmic skeleton interfaces for high-performance cluster computing., , , and . PPoPP, page 247-258. ACM, (2014)Enhancing the Usability and Utilization of Accelerated Architectures via Docker., , , , , , and . UCC, page 361-367. IEEE Computer Society, (2015)RAI: A Scalable Project Submission System for Parallel Programming Courses., , , and . IPDPS Workshops, page 315-322. IEEE Computer Society, (2017)Compiling high-level scripting languages to performant code. University of Illinois Urbana-Champaign, USA, (2020)Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles., , , , , , , and . PACT, page 345-355. IEEE, (2021)Evaluating Characteristics of CUDA Communication Primitives on High-Bandwidth Interconnects., , , , , , and . ICPE, page 209-218. ACM, (2019)WebGPU: A Scalable Online Development Platform for GPU Programming Courses., , and . IPDPS Workshops, page 942-949. IEEE Computer Society, (2016)Across-Stack Profiling and Characterization of Machine Learning Models on GPUs., , , , , and . CoRR, (2019)Challenges and Pitfalls of Reproducing Machine Learning Artifacts., , , and . CoRR, (2019)