Author of the publication

CAHTR: Communication-Avoiding Householder TRidiagonalization.

, , , , and . PARCO, volume 27 of Advances in Parallel Computing, page 381-390. IOS Press, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Communication-overlap techniques for improved strong scaling of gyrokinetic Eulerian code beyond 100k cores on the K-computer., , , , , , , , , and 1 other author(s). Int. J. High Perform. Comput. Appl., 28 (1): 73-86 (2014)Performance Analysis of the Householder-Type Parallel Tall-Skinny QR Factorizations Toward Automatic Algorithm Selection., , and . VECPAR, volume 8969 of Lecture Notes in Computer Science, page 269-283. Springer, (2014)Grid Computing Supporting System on ITBL Project., , , , , , , , , and 3 other author(s). ISHPC, volume 2858 of Lecture Notes in Computer Science, page 245-257. Springer, (2003)Gordon Bell finalists I - High-performance computing for exact numerical approaches to quantum many-body problems on the earth simulator., , , and . SC, page 47. ACM Press, (2006)10TFLOPS Eigenvalue Solver for Strongly-Correlated Fermions on the Earth Simulator., , and . Parallel and Distributed Computing and Networks, page 638-643. IASTED/ACTA Press, (2005)Iterative methods with mixed-precision preconditioning for ill-conditioned linear systems in multiphase CFD simulations., , , , and . ScalA@SC, page 1-8. IEEE, (2021)CAHTR: Communication-Avoiding Householder TRidiagonalization., , , , and . PARCO, volume 27 of Advances in Parallel Computing, page 381-390. IOS Press, (2015)Performance Analysis of 2D-compatible 2.5D-PDGEMM on Knights Landing Cluster., and . ICCS (3), volume 10862 of Lecture Notes in Computer Science, page 853-858. Springer, (2018)Accurate Matrix Multiplication on Binary128 Format Accelerated by Ozaki Scheme., , , and . ICPP, page 78:1-78:11. ACM, (2021)Task Scheduling Strategies for Batched Basic Linear Algebra Subprograms on Many-core CPUs., , and . MCSoC, page 234-241. IEEE, (2021)