Author of the publication

BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling.

, , , , and . MLSys, mlsys.org, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Visage: enabling timely analytics for drone imagery., , , , , , , , , and . MobiCom, page 789-803. ACM, (2021)GradiVeQ: Vector Quantization for Bandwidth-Efficient Gradient Aggregation in Distributed CNN Training., , , , , , , , and . NeurIPS, page 5129-5139. (2018)PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication., , , , , and . ICLR, OpenReview.net, (2022)BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling., , , , and . MLSys, mlsys.org, (2022)Liquid state machine based pattern recognition on FPGA with firing-activity dependent power gating and approximate computing., , and . ISCAS, page 361-364. IEEE, (2016)DeepStore: In-Storage Acceleration for Intelligent Queries., , , , , , , , , and . MICRO, page 224-238. ACM, (2019)Accelerating distributed reinforcement learning with in-switch computing., , , , , and . ISCA, page 279-291. ACM, (2019)Pipe-SGD: A Decentralized Pipelined SGD Framework for Distributed Deep Net Training., , , , , and . NeurIPS, page 8056-8067. (2018)Doing more with less: training large DNN models on commodity servers for the masses., , , and . HotOS, page 119-127. ACM, (2021)A Network-Centric Hardware/Algorithm Co-Design to Accelerate Distributed Training of Deep Neural Networks., , , , , , , , , and . MICRO, page 175-188. IEEE Computer Society, (2018)