Automatic Parallelization of Kernels in Shared-Memory Multi-GPU Nodes | Publicación