Author of the publication

Near-Data Processing in Memory Expander for DNN Acceleration on GPUs.

, , , , , , , , and . IEEE Comput. Archit. Lett., 20 (2): 171-174 (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Automatic Copying of Pointer-Based Data Structures., , and . LCPC, volume 10136 of Lecture Notes in Computer Science, page 265-281. Springer, (2016)DeNovo: Rethinking the Memory Hierarchy for Disciplined Parallelism., , , , , , , , and . PACT, page 155-166. IEEE Computer Society, (2011)A type and effect system for deterministic parallel Java., , , , , , , , , and . OOPSLA, page 97-116. ACM, (2009)POSTER: CogR: Exploiting Program Structures for Machine-Learning Based Runtime Solutions., , , and . PACT, page 485-486. IEEE, (2019)DeNovoND: Efficient Hardware for Disciplined Nondeterminism., , and . IEEE Micro, 34 (3): 138-148 (2014)Using Structured Input and Modularity for Improved Learning., , and . CoRR, (2019)One-shot tuner for deep learning compilers., , and . CC, page 89-103. ACM, (2022)DeNovoSync: Efficient Support for Arbitrary Synchronization without Writer-Initiated Invalidations., and . ASPLOS, page 545-559. ACM, (2015)Eliminating on-chip traffic waste: are we there yet?, , , and . ISPASS, page 163-164. IEEE Computer Society, (2015)Parallel SAH k-D tree construction., , , , , , and . High Performance Graphics, page 77-86. Eurographics Association, (2010)