Learner excellence biased by data set selection: A case for data characterisation and artificial data sets | Publicación