VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification | Publicación