Author of the publication

Tiling Optimizations for Stencil Computations Using Rewrite Rules in Lift.

, , , , and . ACM Trans. Archit. Code Optim., 16 (4): 52:1-52:25 (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Mapping parallelism in a functional IR through constraint satisfaction: a case study on convolution for mobile GPUs., , , and . CC, page 218-230. ACM, (2022)SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator., , , , , and . MICRO, page 1005-1021. ACM, (2021)Tiling Optimizations for Stencil Computations Using Rewrite Rules in Lift., , , , and . ACM Trans. Archit. Code Optim., 16 (4): 52:1-52:25 (2020)Generating high performance code for irregular data structures using dependent types., , and . FHPNC@ICFP, page 37-49. ACM, (2021)Accelerated Finite State Machine Test Execution Using GPUs., , , and . APSEC, page 109-118. IEEE, (2018)High-level hardware feature extraction for GPU performance prediction of stencils., , , , , and . GPGPU@PPoPP, page 21-30. ACM, (2020)High-level synthesis of functional patterns with Lift., , , and . ARRAY@PLDI, page 35-45. ACM, (2019)Binary Ostensibly-Implicit Trees for Fast Collision Detection., , and . Comput. Graph. Forum, 39 (2): 509-521 (2020)Optimizing data reshaping operations in functional IRs for high-level synthesis., , and . LCTES, page 61-72. ACM, (2022)Fast Optimisation of Convolutional Neural Network Inference using System Performance Models., , and . EuroMLSys@EuroSys, page 104-110. ACM, (2021)