Author of the publication

At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads.

, , , , , , , , , , , and . ACM Trans. Archit. Code Optim., 20 (4): 57:1-57:26 (December 2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Data-centric GPU-based adaptive mesh refinement., and . IA3@SC, page 3:1-3:7. ACM, (2015)Scalable Kernel Fusion for Memory-Bound GPU Applications., and . SC, page 191-202. IEEE Computer Society, (2014)Highly optimized full GPU-acceleration of non-hydrostatic weather model SCALE-LES., and . CLUSTER, page 1-8. IEEE Computer Society, (2013)A Framework for Cloud Embedded Web Services Utilized by Cloud Applications., , , and . SERVICES, page 265-271. IEEE Computer Society, (2011)At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads., , , , , , , , , and 2 other author(s). ACM Trans. Archit. Code Optim., 20 (4): 57:1-57:26 (December 2023)A Bayesian Optimization Algorithm for De Novo ligand design based docking running over GPU., , , and . IEEE Congress on Evolutionary Computation, page 1-8. IEEE, (2010)Scaling distributed deep learning workloads beyond the memory capacity with KARMA., , , , , , , and . SC, page 19. IEEE/ACM, (2020)Scalable FBP decomposition for cone-beam CT reconstruction., , , , , , , , and . SC, page 9. ACM, (2021)Exploiting Scratchpad Memory for Deep Temporal Blocking: A case study for 2D Jacobian 5-point iterative stencil kernel (j2d5pt)., , , , , , and . GPGPU@PPoPP, page 34-35. ACM, (2023)Model for dynamic grain sizing through compound parallelization for an optimization problem solving grid application., , , and . GRID, page 316-321. IEEE Computer Society, (2008)