A formal model for multiagent Q-learning on graphs | Publicación