Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Optimizing massively parallel sparse matrix computing on ARM many-core processor.

J. Zheng, J. Jiang, J. Du, D. Huang, and Y. Lu. Parallel Comput., (September 2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Qiu Jiang

Ping Jiang

Jiang Yi

Xin Jiang

Kaiyun Jiang

Other publications of authors with the same name

Optimizing massively parallel sparse matrix computing on ARM many-core processor.J. Zheng, J. Jiang, J. Du, D. Huang, and Y. Lu. Parallel Comput., (September 2023)SAIH: A Scalable Evaluation Methodology for Understanding AI Performance Trend on HPC Systems.J. Du, D. Li, Y. Wen, J. Jiang, D. Huang, X. Liao, and Y. Lu. CoRR, (2022)MixRec: Orchestrating Concurrent Recommendation Model Training on CPU-GPU platform.J. Jiang, R. Tian, J. Du, D. Huang, and Y. Lu. ICCD, page 366-374. IEEE, (2023)Liger: Interleaving Intra- and Inter-Operator Parallelism for Distributed Large Model Inference.J. Du, J. Wei, J. Jiang, S. Cheng, D. Huang, Z. Chen, and Y. Lu. PPoPP, page 42-54. ACM, (2024)A mechanism for scheduling multi robot intelligent warehouse system face with dynamic demand.Z. Li, A. Barenji, J. Jiang, R. Zhong, and G. Xu. J. Intell. Manuf., 31 (2): 469-480 (2020)Full-Stack Optimizing Transformer Inference on ARM Many-Core CPU.J. Jiang, J. Du, D. Huang, Z. Chen, Y. Lu, and X. Liao. IEEE Trans. Parallel Distributed Syst., 34 (7): 2221-2235 (July 2023)Optimizing small channel 3D convolution on GPU with tensor core.J. Jiang, D. Huang, J. Du, Y. Lu, and X. Liao. Parallel Comput., (2022)Characterizing and Optimizing Transformer Inference on ARM Many-core Processor.J. Jiang, J. Du, D. Huang, D. Li, J. Zheng, and Y. Lu. ICPP, page 20:1-20:11. ACM, (2022)Hierarchical Model Parallelism for Optimizing Inference on Many-core Processor via Decoupled 3D-CNN Structure.J. Jiang, Z. Huang, D. Huang, J. Du, L. Chen, Z. Chen, and Y. Lu. ACM Trans. Archit. Code Optim., 20 (3): 42:1-42:21 (September 2023)Improving Computation and Memory Efficiency for Real-world Transformer Inference on GPUs.J. Du, J. Jiang, J. Zheng, H. Zhang, D. Huang, and Y. Lu. ACM Trans. Archit. Code Optim., 20 (4): 46:1-46:22 (December 2023)

BibSonomy

Disambiguation of "Jiang, Jiazhi"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Optimizing massively parallel sparse matrix computing on ARM many-core processor.

Please choose a person to relate this publication to

Qiu Jiang

Ping Jiang

Jiang Yi

Xin Jiang

Kaiyun Jiang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Jiang, Jiazhi"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Optimizing massively parallel sparse matrix computing on ARM many-core processor.

Please choose a person to relate this publication to

Qiu Jiang

Ping Jiang

Jiang Yi

Xin Jiang

Kaiyun Jiang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Optimizing massively parallel sparse matrix computing on ARM many-core processor.