Regularizing Transformers With Deep Probabilistic Layers | Publicación