Author of the publication

Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks.

, , , , , , and . ICLR (Poster), OpenReview.net, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Supporting handover in an IEEE 802.11p-based wireless access system., and . Vehicular Ad Hoc Networks, page 75-80. ACM, (2010)Hardware and Software Co-optimization for the Initialization Failure of the ReRAM-based Cross-bar Array., , , , and . ACM J. Emerg. Technol. Comput. Syst., 16 (4): 36:1-36:19 (2020)Optimizing Exponent Bias for Sub-8bit Floating-Point Inference of Fine-tuned Transformers., and . AICAS, page 98-101. IEEE, (2022)Lightweight Error Correction for In-Storage Acceleration of Large Language Model Inference., , , and . ICEIC, page 1-4. IEEE, (2024)Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers., , , , and . EACL, page 916-929. Association for Computational Linguistics, (2023)An FPGA implementation of speech recognition with weighted finite state transducers., , and . ICASSP, page 1602-1605. IEEE, (2010)Analysis of error resiliency of belief propagation in computer vision., , , and . ICASSP, page 1060-1064. IEEE, (2016)Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling., , , and . ISOCC, page 357-358. IEEE, (2021)A Compiler for Deep Neural Network Accelerators to Generate Optimized Code for a Wide Range of Data Parameters from a Hand-crafted Computation Kernel., , , , , , , , and . COOL CHIPS, page 1-3. IEEE, (2019)Distributed Space-Time Block Coding for Barrage Relay Networks., , , , and . MILCOM, page 292-297. IEEE, (2023)