Efficient and scalable barrier synchronization for many-core CMPs | Publicación