AWS demonstrates an observability solution for LLM inference on SageMaker that correlates and jointly optimizes infrastructure metrics (latency, GPU utilization, error rates) and quality metrics (accuracy, consistency) via Amazon CloudWatch and Managed Grafana.