Tailored design of Audio–Visual Speech Recognition models using Branchformers | Publicación