Author of the publication

Exploring cache bypassing and partitioning for multi-tasking on GPUs.

, , and . ICCAD, page 9-16. IEEE, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

TGPA: tile-grained pipeline architecture for low latency CNN inference., , , , , and . ICCAD, page 58. ACM, (2018)MultiDS-MDA: Integrating multiple data sources into heterogeneous network for predicting novel metabolite-drug associations., , , , , , and . Comput. Biol. Medicine, (August 2023)CuLDA_CGS: solving large-scale LDA problems on GPUs., , , and . PPoPP, page 435-436. ACM, (2019)AMOS: enabling automatic mapping for tensor computations on spatial accelerators with hardware abstraction., , , , , , , , , and . ISCA, page 874-887. ACM, (2022)CuLDA: Solving Large-scale LDA Problems on GPUs., , , and . HPDC, page 195-205. ACM, (2019)Exploring cache bypassing and partitioning for multi-tasking on GPUs., , and . ICCAD, page 9-16. IEEE, (2017)Centauri: Enabling Efficient Scheduling for Communication-Computation Overlap in Large Model Training via Communication Partitioning., , , , , , and . ASPLOS (3), page 178-191. ACM, (2024)A Review of Data Augmentation Methods of Remote Sensing Image Target Recognition., , , , , and . Remote. Sens., 15 (3): 827 (February 2023)A coordinated tiling and batching framework for efficient GEMM on GPUs., , , , and . PPoPP, page 229-241. ACM, (2019)Design for learners., , , , , , , , , and 5 other author(s). CCHI, page 304-314. ACM, (2022)