Full AWS Practitioner Certification Question

A tech startup is developing a new foundation model (FM) for image classification, aiming to deploy it for use in various applications, from identifying product defects to recognizing objects in photos. The team needs to assess the accuracy of the model to ensure it meets performance requirements before deployment. What is the best way to assess the accuracy of the foundation model for image classification?