Learning General Policies with Policy Gradient Methods | Publicación