Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning Low-Rank Approximation for CNNs., , , and . CoRR, (2019)Integrated hybrid systems modeling and simulation methodology based on HDEVS formalism., , , and . SummerSim, page 52. Society for Computer Simulation International / ACM DL, (2013)Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity., , , , , , and . CoRR, (2021)Structured Compression by Weight Encryption for Unstructured Pruning and Quantization., , , , , and . CVPR, page 1906-1915. Computer Vision Foundation / IEEE, (2020)LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models., , , , , , , , , and . ICLR, OpenReview.net, (2024)Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models., , , , , and . ICLR, OpenReview.net, (2024)Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation., , , , , , , and . EMNLP (Findings), volume EMNLP 2020 of Findings of ACL, page 4812-4826. Association for Computational Linguistics, (2020)Modulating Regularization Frequency for Efficient Compression-Aware Model Training., , , , , and . CoRR, (2021)Network Pruning for Low-Rank Binary Indexing., , , , and . CoRR, (2019)Structured Compression by Unstructured Pruning for Sparse Quantized Neural Networks., , , , , and . CoRR, (2019)