Multimodal Audio-Language Model for Speech Emotion Recognition | Publicación