Abstract

I present a new way to parallelize the training of convolutional neural networks across multiple GPUs. The method scales significantly better than all alternatives when applied to modern convolutional neural networks.

Description

One weird trick for parallelizing convolutional neural networks

Links and resources

Tags