Author of the publication

BiQGEMM: matrix multiplication with lookup table for binary-coding-based quantized DNNs.

, , , , , and . SC, page 95. IEEE/ACM, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

BiQGEMM: matrix multiplication with lookup table for binary-coding-based quantized DNNs., , , , , and . SC, page 95. IEEE/ACM, (2020)Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression., , , , and . ICLR, OpenReview.net, (2022)Retraining-Based Iterative Weight Quantization for Deep Neural Networks, and . (2018)cite arxiv:1805.11233Comment: 12 pages, 13 figures, NIPS 2018 (32nd Annual Conference on Neural Information Processing Systems) submission.DeepTwist: Learning Model Compression via Occasional Weight Distortion., , and . CoRR, (2018)Modulating Regularization Frequency for Efficient Compression-Aware Model Training., , , , , and . CoRR, (2021)FleXOR: Trainable Fractional Quantization., , , , , and . NeurIPS, (2020)Structured Compression by Unstructured Pruning for Sparse Quantized Neural Networks., , , , , and . CoRR, (2019)Network Pruning for Low-Rank Binary Indexing., , , , and . CoRR, (2019)Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization., , , , , , and . CoRR, (2021)AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models., , , , , , , , , and . EMNLP (Findings), page 3288-3305. Association for Computational Linguistics, (2022)