Inproceedings,

A 17-95.6 TOPS/W Deep Learning Inference Accelerator with Per-Vector Scaled 4-bit Quantization for Transformers in 5nm.

B. Keller, R. Venkatesan, S. Dai, S. Tell, B. Zimmer, W. Dally, C. Gray, and B. Khailany.
VLSI Technology and Circuits, page 16-17. IEEE, (2022)

Meta data

BibTeX key: conf/vlsit/KellerVDTZDGK22
entry type: inproceedings
booktitle: VLSI Technology and Circuits
year: 2022
pages: 16-17
publisher: IEEE
crossref: conf/vlsit/2022
ee: https://doi.org/10.1109/VLSITechnologyandCir46769.2022.9830277
isbn: 978-1-6654-9772-5
url: http://dblp.uni-trier.de/db/conf/vlsit/vlsit2022.html#KellerVDTZDGK22

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on