Online fitted policy iteration based on extreme learning machines | Publicación