Using the words/leafs ratio in the DOM tree for content extraction | Publicación