Measuring and Improving the Energy Efficiency of Large Language Models Inference | Publicación