Multi-scale hybrid vision transformer and Sinkhorn tokenizer for sewer defect classification | Publicación