Author of the publication

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.

, , , , , , , , and . ISCA, page 3:1-3:15. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Ivory: Early-Stage Design Space Exploration Tool for Integrated Voltage Regulators., , , , , , , and . DAC, page 1:1-1:6. ACM, (2017)Braum: Analyzing and Protecting Autonomous Machine Software Stack., , , , , and . ISSRE, page 85-96. IEEE, (2022)DistSim: A performance model of large-scale hybrid distributed DNN training., , , , , , , , , and 1 other author(s). CF, page 112-122. ACM, (2023)AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs., , , , , , , , , and . CF, page 52-62. ACM, (2023)Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators., , , , , , , and . IISWC, page 214-225. IEEE, (2021)Dual-side Sparse Tensor Core., , , , , and . ISCA, page 1083-1095. IEEE, (2021)SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences., , , , , and . DAC, page 571-576. ACM, (2022)Modern Hardware Margins: CPUs, GPUs, FPGAs Recent System-Level Studies., , , , , , , and . IOLTS, page 129-134. IEEE, (2019)GPU voltage noise: Characterization and hierarchical smoothing of spatial and temporal voltage noise interference in GPU architectures., , and . HPCA, page 161-173. IEEE Computer Society, (2015)CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs., , , , , , , and . ICDCS, page 853-863. IEEE, (2020)