Author of the publication

A Flexible Research-Oriented Framework for Distributed Training of Deep Neural Networks.

, , , , and . IPDPS Workshops, page 730-739. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

High performance and energy efficient inference for deep learning on ARM processors., , , , and . CoRR, (2021)Modeling power and energy consumption of dense matrix factorizations on multicore processors., , , and . Concurr. Comput. Pract. Exp., 26 (17): 2743-2757 (2014)Parallelizing and Optimizing LHCb-Kalman for Intel Xeon Phi KNL Processors., , , , , and . PDP, page 741-750. IEEE Computer Society, (2018)CID: A Compile-Time Implementation Decider for Heterogeneous Platforms Based on C++ Attributes., , , and . UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld, page 1149-1156. IEEE Computer Society, (2016)Optimizing Convolutions for Deep Learning Inference on ARM Cortex-M Processors., , , and . IEEE Internet Things J., 11 (15): 26203-26219 (August 2024)Automatic detection of power bottlenecks in parallel scientific applications., , , , and . Comput. Sci. Res. Dev., 29 (3-4): 221-229 (2014)Energy-aware matrix computacion on multirhreaded architectures.. Jaume I University, Spain, (2014)Towards Portable Realizations of Winograd-based Convolution with Vector Intrinsics and OpenMP., , and . PDP, page 39-46. IEEE, (2022)Supporting Advanced Patterns in GrPPI, a Generic Parallel Pattern Interface., , , and . Euro-Par Workshops, volume 10659 of Lecture Notes in Computer Science, page 55-67. Springer, (2017)A generic parallel pattern interface for stream and data processing., , , and . Concurr. Comput. Pract. Exp., (2017)