Author of the publication

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

, , , , and . (2017)cite arxiv:1712.01887Comment: we find 99.9% of the gradient exchange in distributed SGD is redundant; we reduce the communication bandwidth by two orders of magnitude without losing accuracy.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Large Scale Analysis of Chinese Mobile Query Behavior., , , , , , , and . IKE, page 258-264. CSREA Press, (2009)Query Recommendation and Its Usefulness Evaluation on Mobile Search Engine., , , , , and . SMC, page 1292-1297. IEEE, (2009)Query-biased Near Duplicate Web Document Detecting: Effective, Efficient and Customizable., , , , and . DMIN, page 654-659. CSREA Press, (2008)Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding, , and . (2015)cite arxiv:1510.00149Comment: Published as a conference paper at ICLR 2016 (oral).Once for All: Train One Network and Specialize it for Efficient Deployment, , and . (2019)cite arxiv:1908.09791.LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models, , , , , , and . (2023)The Detection of the Pipe Crack Utilizing the Operational Modal Strain Identified from Fiber Bragg Grating., , , , , , , and . Sensors, 19 (11): 2556 (2019)Nanoarchitectonics for Heterogeneous Integrated Nanosystems., , , , , and . Proc. IEEE, 96 (2): 212-229 (2008)Design of a network-based mobile gait rehabilitation system., , , , , and . ROBIO, page 1773-1778. IEEE, (2012)Improved Dynamic Graph Learning through Fault-Tolerant Sparsification., , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 7624-7633. PMLR, (2019)