Author of the publication

Effective automatic computation placement and dataallocation for parallelization of regular programs.

, and . ICS, page 13-22. ACM, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Automatic mapping of nested loops to FPGAS., , and . PPoPP, page 101-111. ACM, (2007)Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors., , , , , and . PPoPP, page 219-228. ACM, (2009)PLUTO+: near-complete modeling of affine transformations for parallelism and locality., and . PPoPP, page 54-64. ACM, (2015)High Performance RDMA Based All-to-All Broadcast for InfiniBand Clusters., , , , and . HiPC, volume 3769 of Lecture Notes in Computer Science, page 148-157. Springer, (2005)Compiling affine loop nests for distributed-memory parallel architectures.. SC, page 33:1-33:12. ACM, (2013)PolyGLoT: A Polyhedral Loop Transformation Framework for a Graphical Dataflow Language., and . CC, volume 7791 of Lecture Notes in Computer Science, page 123-143. Springer, (2013)High Performance Code Generation in MLIR: An Early Case Study with GEMM.. CoRR, (2020)A practical automatic polyhedral parallelizer and locality optimizer., , , and . PLDI, page 101-113. ACM, (2008)An effective fusion and tile size model for optimizing image processing pipelines., and . PPoPP, page 261-275. ACM, (2018)A Model for Fusion and Code Motion in an Automatic Parallelizing Compiler, , , and . Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, page 343--352. New York, NY, USA, ACM, (2010)