Author of the publication

Fast Deep Neural Network Training on Distributed Systems and Cloud TPUs.

, , , , and . IEEE Trans. Parallel Distributed Syst., 30 (11): 2449-2462 (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Faster Numerical Algorithms via Exception Handling., and . IEEE Trans. Computers, 43 (8): 983-992 (1994)Model Reduction for RF MEMS Simulation., , and . PARA, volume 3732 of Lecture Notes in Computer Science, page 286-295. Springer, (2004)The generalized Schur decomposition of an arbitrary pencil A-λB - robust software with error bounds and applications. Part II: software and applications., and . ACM Trans. Math. Softw., 19 (2): 175-201 (1993)CALU: A Communication Optimal LU Factorization Algorithm., , and . SIAM J. Matrix Anal. Appl., 32 (4): 1317-1350 (2011)Design of a Parallel Nonsymmetric Eigenroutine Toolbox, Part I., and . PPSC, page 391-398. SIAM, (1993)Accurate and Efficient Expression Evaluation and Linear Algebra, , , and . CoRR, (2007)Code Generators for Automatic Tuning of Numerical Kernels: Experiences with FFTW., and . SAIG, volume 1924 of Lecture Notes in Computer Science, page 190-211. Springer, (2000)Bifurcation Analysis of Large Equilibrium Systems in Matlab., , , , and . International Conference on Computational Science (1), volume 3514 of Lecture Notes in Computer Science, page 50-57. Springer, (2005)An improved analysis and unified perspective on deterministic and randomized low rank matrix approximations., , and . CoRR, (2019)Algorithm 880: A testing infrastructure for symmetric tridiagonal eigensolvers., , , and . ACM Trans. Math. Softw., 35 (1): 8:1-8:13 (2008)