A novel evaluation benchmark for medical LLMs illuminating safety and effectiveness in clinical domains | Publicación