Intramodal consistency in triplet-based cross-modal learning for image retrieval | Publicación