On the design of a similarity function for sparse binary data with application on protein function annotation | Publicación