This is a dedicated watch page for a single video.
A Generative AI Engineer is monitoring a deployed RAG system that shows a decline in user satisfaction due to irrelevant outputs. The retrieval system has high latency. What metrics should the engineer focus on to diagnose and resolve the issue?