Optimizing LLM Inference: Metrics that Matter for Real Time Applications. (2025). Journal of Artificial Intelligence & Cloud Computing, 4(1), 1-4. https://doi.org/10.47363/JAICC/2025(4)446