Analysis of representation and generalization capabilities of pre-trained audio models in urban environments | Publicación