Author of the publication

DLUX: A LUT-Based Near-Bank Accelerator for Data Center Deep Learning Training Workloads.

, , , , , , and . IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 40 (8): 1586-1599 (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

EPQuant: A Graph Neural Network compression approach based on product quantization., , , , , , and . Neurocomputing, (2022)STAR: Synthesis of Stateful Logic in RRAM Targeting High Area Utilization., , , , , , , and . IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 40 (5): 864-877 (2021)NPS: A Framework for Accurate Program Sampling Using Graph Neural Network., , , , , , , , , and . CoRR, (2023)TT-GNN: Efficient On-Chip Graph Neural Network Training via Embedding Reformation and Hardware Optimization., , , , and . MICRO, page 452-464. ACM, (2023)DRAM-Level Prefetching for Fully-Buffered DIMM: Design, Performance and Power Saving., , , , and . ISPASS, page 94-104. IEEE Computer Society, (2007)Power and Performance Trade-Offs in Contemporary DRAM System Designs for Multicore Processors., and . IEEE Trans. Computers, 59 (8): 1033-1046 (2010)LazyPIM: Efficient Support for Cache Coherence in Processing-in-Memory Architectures., , , , , , , , , and . CoRR, (2017)DRAF: A Low-Power DRAM-Based Reconfigurable Acceleration Fabric., , , , , , and . IEEE Micro, 37 (3): 70-78 (2017)Accelerating CPU-based Sparse General Matrix Multiplication with Binary Row Merging., , , , , and . CoRR, (2022)Predicting the Output Structure of Sparse Matrix Multiplication with Sampled Compression Ratio., , , , , , , , , and . ICPADS, page 483-490. IEEE, (2022)