Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers.

Z. Yao, R. Aminabadi, S. Youn, X. Wu, E. Zheng, and Y. He. CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Yao Yao

Hongmei Yao

Ning Yao

Haimin Yao

Yefeng Yao

Other publications of authors with the same name

Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT.S. Shen, Z. Dong, J. Ye, L. Ma, Z. Yao, A. Gholami, M. Mahoney, and K. Keutzer. AAAI, page 8815-8821. AAAI Press, (2020)ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning.Z. Yao, A. Gholami, S. Shen, M. Mustafa, K. Keutzer, and M. Mahoney. AAAI, page 10665-10673. AAAI Press, (2021)ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers.Z. Yao, R. Aminabadi, S. Youn, X. Wu, E. Zheng, and Y. He. CoRR, (2023)Selective Guidance: Are All the Denoising Steps of Guided Diffusion Important?P. Golnari, Z. Yao, and Y. He. CoRR, (2023)Residual Networks as Nonlinear Systems: Stability Analysis using Linearization.K. Rothauge, Z. Yao, Z. Hu, and M. Mahoney. CoRR, (2019)PyHessian: Neural Networks Through the Lens of the Hessian.Z. Yao, A. Gholami, K. Keutzer, and M. Mahoney. CoRR, (2019)HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks.Z. Dong, Z. Yao, D. Arfeen, A. Gholami, M. Mahoney, and K. Keutzer. NeurIPS, (2020)HAWQ-V3: Dyadic Neural Network Quantization.Z. Yao, Z. Dong, Z. Zheng, A. Gholami, J. Yu, E. Tan, L. Wang, Q. Huang, Y. Wang, M. Mahoney and 1 other author(s). ICML, volume 139 of Proceedings of Machine Learning Research, page 11875-11886. PMLR, (2021)PowerNorm: Rethinking Batch Normalization in Transformers.S. Shen, Z. Yao, A. Gholami, M. Mahoney, and K. Keutzer. ICML, volume 119 of Proceedings of Machine Learning Research, page 8741-8751. PMLR, (2020)ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training.J. Chen, L. Zheng, Z. Yao, D. Wang, I. Stoica, M. Mahoney, and J. Gonzalez. ICML, volume 139 of Proceedings of Machine Learning Research, page 1803-1813. PMLR, (2021)

BibSonomy

Disambiguation of "Yao, Zhewei"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers.

Please choose a person to relate this publication to

Yao Yao

Hongmei Yao

Ning Yao

Haimin Yao

Yefeng Yao

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Yao, Zhewei"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers.

Please choose a person to relate this publication to

Yao Yao

Hongmei Yao

Ning Yao

Haimin Yao

Yefeng Yao

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers.