Author of the publication

Dynamically Adapting Floating-Point Precision to Accelerate Deep Neural Network Training.

, , , , and . ICMLA, page 980-987. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An updated set of basic linear algebra subprograms (BLAS), , , , , , , , , and 1 other author(s). ACM Transactions on Mathematical Software, 28 (2): 135--151 (2002)A BF16 FMA is All You Need for DNN Training., , , , and . ARITH, page 9. IEEE, (2022)Deconstructing HPL-MxP Benchmark: A Numerical Perspective., , , and . Euro-Par (1), volume 14801 of Lecture Notes in Computer Science, page 47-60. Springer, (2024)On a Distributed Design and Implementation for a Matrix Equation., and . PPSC, SIAM, (1997)Improving the Unsymmetric Parallel QR Algorithm on Vector Machines.. PPSC, page 353-357. SIAM, (1993)ScaLAPACK: A Linear Algebra Library for Message-Passing Computers., , , , , , , , , and 3 other author(s). PPSC, SIAM, (1997)Efficiency of High Order Spectral Element Methods on Petascale Architectures., , , , , and . ISC, volume 9697 of Lecture Notes in Computer Science, page 449-466. Springer, (2016)Proposed Consistent Exception Handling for the BLAS and LAPACK., , , , , , , , , and . Correctness@SC, page 1-9. IEEE, (2022)FASE: A Fast, Accurate and Seamless Emulator for Custom Numerical Formats., , , , and . ISPASS, page 144-146. IEEE, (2022)A Distributed Memory Implementation of the Nonsymmetric QR Algorithm., , and . PPSC, SIAM, (1997)