Balancing task- and data-level parallelism to improve performance and energy consumption of matrix computations on the Intel Xeon Phi | Publicación