Accelerating distributed deep neural network training with pipelined MPI allreduce | Publicación