A Generative AI Engineer deployed a databricks video

 ·  PT1H46M27S  ·  EN

generative-ai video for a Generative AI Engineer deployed a multilingual RAG application in production. After two weeks, the product team notices inconsistent

Full Certification Question

A Generative AI Engineer deployed a multilingual RAG application in production. After two weeks, the product team notices inconsistent performance across languages and slower response times during peak hours. Additionally, the LLM API usage cost has nearly doubled compared to initial estimates. Which combination of monitoring strategies should the engineer implement to address both performance and cost concerns?