Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

μLayer: Low Latency On-Device Inference Using Cooperative Single-Layer Acceleration and Processor-Friendly Quantization., , , , and . EuroSys, page 45:1-45:15. ACM, (2019)Google Workloads for Consumer Devices: Mitigating Data Movement Bottlenecks., , , , , , , , , and 1 other author(s). ASPLOS, page 316-331. ACM, (2018)Occamy: Memory-efficient GPU Compiler for DNN Inference., , , , , , and . DAC, page 1-6. IEEE, (2023)GPUpd: a fast and scalable multi-GPU architecture using cooperative projection and distribution., , , , , and . MICRO, page 574-586. ACM, (2017)SALoBa: Maximizing Data Locality and Workload Balance for Fast Sequence Alignment on GPUs., , , , , , , and . IPDPS, page 728-738. IEEE, (2022)It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher., , , , , , and . CVPR, page 8301-8311. IEEE, (2022)AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping., , , , , and . PPoPP, page 431-444. ACM, (2024)GPUdmm: A high-performance and memory-oblivious GPU architecture using dynamic memory management., , , and . HPCA, page 546-557. IEEE Computer Society, (2014)Pipe-BD: Pipelined Parallel Blockwise Distillation., , , , , and . DATE, page 1-6. IEEE, (2023)Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples., , , , and . NeurIPS, page 14835-14847. (2021)