Author of the publication

Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization.

, , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimal wavelength provisioning with fuzzy logic control for power saving in TWDM-PONs., , and . ICTC, page 926-929. IEEE, (2016)Viterbi-Based Efficient Test Data Compression., and . IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 31 (4): 610-619 (2012)Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models., , , , , and . CoRR, (2023)Retraining-Based Iterative Weight Quantization for Deep Neural Networks., and . CoRR, (2018)DeepTwist: Learning Model Compression via Occasional Weight Distortion., , and . CoRR, (2018)A Scalable Multi- TeraOPS Deep Learning Processor Core for AI Trainina and Inference., , , , , , , , , and 21 other author(s). VLSI Circuits, page 35-36. IEEE, (2018)Area Efficient ROM-Embedded SRAM Cache., and . IEEE Trans. Very Large Scale Integr. Syst., 21 (9): 1583-1595 (2013)Embedding Read-Only Memory in Spin-Transfer Torque MRAM-Based On-Chip Caches., , , , and . IEEE Trans. Very Large Scale Integr. Syst., 24 (3): 992-1002 (2016)Fast management of ONUs based on broadcast control channel for a 10-gigabit-capable passive optical network (XG-PON) system., , , , and . J. Commun. Networks, 15 (5): 538-542 (2013)Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization., , , , , , and . CoRR, (2021)