FLAMA: Architecting Floating-Point Atomic Memory Operations for Heterogeneous HPC Systems | Publicación