Policy weighting via discounted Thomson sampling for non-stationary market-making | Publicación