Author of the publication

Efficient Stencil Computation with Temporal Blocking by Halide DSL.

, , , , and . ISPA/BDCloud/SocialCom/SustainCom, page 870-877. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Revisiting Temporal Blocking Stencil Optimizations., , , , , , and . ICS, page 251-263. ACM, (2023)Applying Temporal Blocking with a Directive-based Approach., , and . LLVM-HPC@SC, page 8:1-8:11. ACM, (2017)An Accurate Simulator of Cache-Line Conflicts to Exploit the Underlying Cache Performance., and . Euro-Par, volume 10417 of Lecture Notes in Computer Science, page 119-133. Springer, (2017)Exana: an execution-driven application analysis tool for assisting productive performance tuning., , and . SEPS@SPLASH, page 1-10. ACM, (2015)The scalable petascale data-driven approach for the Cholesky factorization with multiple GPUs., , and . ESPM@SC, page 38-45. ACM, (2015)mdx: A Cloud Platform for Supporting Data Science and Cross-Disciplinary Research Collaborations., , , , , , , , , and 22 other author(s). DASC/PiCom/CBDCom/CyberSciTech, page 1-7. IEEE, (2022)Pyramid Swin Transformer: Different-Size Windows Swin Transformer for Image Classification and Object Detection., , , and . VISIGRAPP (5: VISAPP), page 583-590. SCITEPRESS, (2023)Statistical power modeling of GPU kernels using performance counters., , , , and . Green Computing Conference, page 115-122. IEEE Computer Society, (2010)High-performance general solver for extremely large-scale semidefinite programming problems., , , , , and . SC, page 93. IEEE/ACM, (2012)Access-pattern and bandwidth aware file replication algorithm in a grid environment., , , and . GRID, page 250-257. IEEE Computer Society, (2008)