Multimodal audio-visual information fusion using canonical-correlated Graph Neural Network for energy-efficient speech enhancement | Publicación