A fine-tuning approach based on spatio-temporal features for few-shot video object detection | Publicación