Softmax and ε-greedy policies applied to process control | Publicación