Enhancing image–text matching through multi-level semantic consistency alignment | Publicación