POAR: Efficient Policy Optimization via Online Abstract State Representation Learning | Publicación