Inproceedings,

DTQAtten: Leveraging Dynamic Token-based Quantization for Efficient Attention Architecture.

, , , , , , , and .
DATE, page 700-705. IEEE, (2022)

Meta data

Tags

Users

  • @dblp

Comments and Reviews