Multimodal Audio-Visual Information Fusion Using Canonical-Correlated Graph Neural Network for Energy-Efficient Speech Enhancement | Publicación