Watch this video on YouTube
What is the primary challenge when evaluating the performance of generative AI models?