“Optimizing LLM Inference: Metrics that Matter for Real Time Applications” (2025) Journal of Artificial Intelligence & Cloud Computing, 4(1), pp. 1–4. doi:10.47363/JAICC/2025(4)446.