Self-Supervised Visual Representations for Cross-Modal Retrieval | Publicación