Large-Scale, Diverse, Paraphrastic Bitexts via Sampling and Clustering | Publicación