Load balancing in a heterogeneous world: CPU-Xeon Phi co-execution of data-parallel kernels | Publicación