Agrawal, M., Chen, I. Y., Gulamali, F., & Joshi, S. (2025). The evaluation illusion of large language models in medicine. npj Digital Medicine, 8(1), 600.