Author of the publication

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training.

, , , , , , , , and . ICCD, page 738-745. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Ivory: Early-Stage Design Space Exploration Tool for Integrated Voltage Regulators., , , , , , , and . DAC, page 1:1-1:6. ACM, (2017)Braum: Analyzing and Protecting Autonomous Machine Software Stack., , , , , and . ISSRE, page 85-96. IEEE, (2022)CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs., , , , , , , and . ICDCS, page 853-863. IEEE, (2020)Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators., , , , , , , and . IISWC, page 214-225. IEEE, (2021)SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences., , , , , and . DAC, page 571-576. ACM, (2022)Dual-side Sparse Tensor Core., , , , , and . ISCA, page 1083-1095. IEEE, (2021)Modern Hardware Margins: CPUs, GPUs, FPGAs Recent System-Level Studies., , , , , , , and . IOLTS, page 129-134. IEEE, (2019)GPU voltage noise: Characterization and hierarchical smoothing of spatial and temporal voltage noise interference in GPU architectures., , and . HPCA, page 161-173. IEEE Computer Society, (2015)Transkimmer: Transformer Learns to Layer-wise Skim., , , , and . ACL (1), page 7275-7286. Association for Computational Linguistics, (2022)GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching., , , , , , , , , and 1 other author(s). ASPLOS (2), page 450-466. ACM, (2024)