Author of the publication

Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers.

, , , , and . EACL, page 916-929. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Workload-aware Automatic Parallelization for Multi-GPU DNN Training., , , , , and . CoRR, (2018)Regularizing Activation Distribution for Ultra Low-bit Quantization-Aware Training of MobileNets., , and . SiPS, page 1-6. IEEE, (2022)Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks., , , and . AAAI, page 6794-6802. AAAI Press, (2021)Architecture-Aware Optimization of Layer Fusion for Latency-Optimal CNN Inference., and . AICAS, page 1-4. IEEE, (2023)CH-MAC: A Cluster-based, Hybrid TDMA MAC Protocol over Wireless Ad-hoc Networks., , , , , , and . MILCOM, page 743-748. IEEE, (2019)Understanding and Reducing Weight-Load Overhead of Systolic Deep Learning Accelerators., , , , , , , , and . ISOCC, page 413-414. IEEE, (2021)EMERALD: Characterization of emerging applications and algorithms for low-power devices., , , , , , , , , and 5 other author(s). ISPASS, page 122-123. IEEE Computer Society, (2013)FPGA acceleration of Markov Random Field TRW-S inference for stereo matching., and . MEMOCODE, page 139-142. IEEE, (2013)Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization., , , , , and . CoRR, (2023)Configurable and scalable belief propagation accelerator for computer vision., and . FPL, page 1-4. IEEE, (2016)