Author of the publication

Flexible Communication Avoiding Matrix Multiplication on FPGA with High-Level Synthesis.

, , and . FPGA, page 244-254. ACM, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations., , , , , , , , , and 4 other author(s). SC, page 43:1-43:17. IEEE, (2022)Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts., , , , , , , , , and 4 other author(s). CoRR, (2024)SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems., , , , , , , , , and 9 other author(s). MICRO, page 282-297. ACM, (2021)SeBS: a serverless benchmark suite for function-as-a-service computing., , , , and . Middleware, page 64-78. ACM, (2021)Automatic complexity analysis of explicitly parallel programs., and . SPAA, page 226-235. ACM, (2014)Lifting C semantics for dataflow optimization., , , , , , and . ICS, page 17:1-17:13. ACM, (2022)GNN Scaling 0.1 Software Artifact., , , , , , , , , and 8 other author(s). (June 2023)On the parallel I/O optimality of linear algebra kernels: near-optimal LU factorization., , , , , and . PPoPP, page 463-464. ACM, (2021)Deinsum: Practically I/O Optimal Multi-Linear Algebra., , , , and . SC, page 25:1-25:15. IEEE, (2022)High Performance Unstructured SpMM Computation Using Tensor Cores., , , , , and . CoRR, (2024)