Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization.

P. Qi, E. Sha, Q. Zhuge, H. Peng, S. Huang, Z. Kong, Y. Song, and B. Li. ICCAD, page 1-9. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Hongwu Zhang

Changgeng Peng

Xiao-Wei Peng

Guo Peng

Jin Peng

Other publications of authors with the same name

Accommodating Transformer onto FPGA: Coupling the Balanced Model Compression and FPGA-Implementation Optimization.P. Qi, Y. Song, H. Peng, S. Huang, Q. Zhuge, and E. Sha. ACM Great Lakes Symposium on VLSI, page 163-168. ACM, (2021)Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization.P. Qi, E. Sha, Q. Zhuge, H. Peng, S. Huang, Z. Kong, Y. Song, and B. Li. ICCAD, page 1-9. IEEE, (2021)RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference.H. Peng, S. Zhou, Y. Luo, N. Xu, S. Duan, R. Ran, J. Zhao, S. Huang, X. Xie, C. Wang and 4 other author(s). CoRR, (2023)MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training.H. Peng, X. Xie, K. Shivdikar, M. Hasan, J. Zhao, S. Huang, O. Khan, D. Kaeli, and C. Ding. ASPLOS (2), page 683-698. ACM, (2024)AQ2PNN: Enabling Two-party Privacy-Preserving Deep Neural Network Inference with Adaptive Quantization.Y. Luo, N. Xu, H. Peng, C. Wang, S. Duan, K. Mahmood, W. Wen, C. Ding, and X. Xu. MICRO, page 628-640. ACM, (2023)Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search.H. Peng, S. Chen, Z. Wang, J. Yang, S. Weitze, T. Geng, A. Li, J. Bi, M. Song, W. Jiang and 2 other author(s). CoRR, (2021)Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads.T. Cai, Y. Li, Z. Geng, H. Peng, J. Lee, D. Chen, and T. Dao. ICML, OpenReview.net, (2024)An Automatic and Efficient BERT Pruning for Edge AI Systems.S. Huang, N. Liu, Y. Liang, H. Peng, H. Li, D. Xu, M. Xie, and C. Ding. ISQED, page 1-6. IEEE, (2022)CoDG-ReRAM: An Algorithm-Hardware Co-design to Accelerate Semi-Structured GNNs on ReRAM.Y. Luo, P. Behnam, K. Thorat, Z. Liu, H. Peng, S. Huang, S. Zhou, O. Khan, A. Tumanov, C. Ding and 1 other author(s). ICCD, page 280-289. IEEE, (2022)Binary Complex Neural Network Acceleration on FPGA : (Invited Paper).H. Peng, S. Zhou, S. Weitze, J. Li, S. Islam, T. Geng, A. Li, W. Zhang, M. Song, M. Xie and 2 other author(s). ASAP, page 85-92. IEEE, (2021)

BibSonomy

Disambiguation of "Peng, Hongwu"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization.

Please choose a person to relate this publication to

Hongwu Zhang

Changgeng Peng

Xiao-Wei Peng

Guo Peng

Jin Peng

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Peng, Hongwu"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization.

Please choose a person to relate this publication to

Hongwu Zhang

Changgeng Peng

Xiao-Wei Peng

Guo Peng

Jin Peng

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization.