High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS | Publicación