Author of the publication

Optimizing the linear fascicle evaluation algorithm for many-core systems.

, and . ICS, page 425-437. ACM, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

High Performance RDMA Based All-to-All Broadcast for InfiniBand Clusters., , , , and . HiPC, volume 3769 of Lecture Notes in Computer Science, page 148-157. Springer, (2005)Automatic mapping of nested loops to FPGAS., , and . PPoPP, page 101-111. ACM, (2007)Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors., , , , , and . PPoPP, page 219-228. ACM, (2009)PLUTO+: near-complete modeling of affine transformations for parallelism and locality., and . PPoPP, page 54-64. ACM, (2015)Compiling affine loop nests for distributed-memory parallel architectures.. SC, page 33:1-33:12. ACM, (2013)An effective fusion and tile size model for optimizing image processing pipelines., and . PPoPP, page 261-275. ACM, (2018)A practical automatic polyhedral parallelizer and locality optimizer., , , and . PLDI, page 101-113. ACM, (2008)PolyGLoT: A Polyhedral Loop Transformation Framework for a Graphical Dataflow Language., and . CC, volume 7791 of Lecture Notes in Computer Science, page 123-143. Springer, (2013)High Performance Code Generation in MLIR: An Early Case Study with GEMM.. CoRR, (2020)A Model for Fusion and Code Motion in an Automatic Parallelizing Compiler, , , and . Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, page 343--352. New York, NY, USA, ACM, (2010)