Hardware mapping of a parallel algorithm for matrix-vector multiplication overlapping communications and computations | Publicación