Possibilistic reward methods for the multi-armed bandit problem | Publicación