The continued usefulness of vocabulary tests for evaluating large language models | Publicación