Author of the publication

Improving performance of optimized kernels through fast instantiations of templates.

, , and . Concurr. Comput. Pract. Exp., 21 (1): 59-70 (2009)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Automatic Code Motion to Extend MPI Nonblocking Overlap Window., , , , and . ISC Workshops, volume 12321 of Lecture Notes in Computer Science, page 43-54. Springer, (2020)Feasibility of Whole-Heart Electrophysiological Models With Near-Cellular Resolution., , , and . CinC, page 1-4. IEEE, (2020)MPI Thread-Level Checking for MPI+OpenMP Applications., , and . Euro-Par, volume 9233 of Lecture Notes in Computer Science, page 31-42. Springer, (2015)SPAGHETtI: Scheduling/Placement Approach for Task-Graphs on HETerogeneous archItecture., and . Euro-Par, volume 8632 of Lecture Notes in Computer Science, page 174-185. Springer, (2014)Compositional Approach Applied to Loop Specialization., , and . Euro-Par, volume 4641 of Lecture Notes in Computer Science, page 268-279. Springer, (2007)Optimizing code through iterative specialization., , and . SAC, page 206-210. ACM, (2008)Hydra: Automatic algorithm exploration from linear algebra equations., , and . CGO, page 25:1-25:10. IEEE Computer Society, (2013)Optimal scheduling algorithms for software-defined radio pipelined and replicated task chains on multicore architectures., , , , , , , and . J. Parallel Distributed Comput., (2025)Performance Portability of Generated Cardiac Simulation Kernels Through Automatic Dimensioning and Load Balancing on Heterogeneous Nodes., , , , , , , , and . IPDPS (Workshops), page 1006-1015. IEEE, (2024)Compositional approach applied to loop specialization., , and . Concurr. Comput. Pract. Exp., 21 (1): 71-84 (2009)