Author of the publication

A Data-Loader Tunable Knob to Shorten GPU Idleness for Distributed Deep Learning.

, , , and . CLOUD, page 449-458. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning., , , , , , , , , and 1 other author(s). CoRR, (2020)A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation., , , , and . CoRR, (2019)Memristor crossbar-based ultra-efficient next-generation baseband processors., , , , , , , and . MWSCAS, page 1121-1124. IEEE, (2017)ESRU: Extremely Low-Bit and Hardware-Efficient Stochastic Rounding Unit Design for Low-Bit DNN Training., , , , , , , , , and 2 other author(s). DATE, page 1-6. IEEE, (2023)Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization., , , , , , , , , and 2 other author(s). FPL, page 109-116. IEEE, (2022)YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design., , , , , , , and . AAAI, page 955-963. AAAI Press, (2021)Towards Real-Time Segmentation on the Edge., , , , , , , , , and 2 other author(s). AAAI, page 1468-1476. AAAI Press, (2023)Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training., , , , , , , , , and 5 other author(s). AAAI, page 8360-8368. AAAI Press, (2023)Work in Progress: Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework., , , , , , , , , and 2 other author(s). RTAS, page 493-496. IEEE, (2021)You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding., , , , , , , , , and 4 other author(s). ECCV (12), volume 13672 of Lecture Notes in Computer Science, page 34-51. Springer, (2022)