generative-ai-leader video for a global retail company wants to launch a personalized marketing campaign using generative AI. Their business analysts have
A global retail company wants to launch a personalized marketing campaign using generative AI. Their business analysts have identified a promising use case, and the data science team has confirmed they have access to relevant customer data. However, the data is spread across multiple legacy systems, contains inconsistencies, and has many missing fields. To make this data usable for training an AI model, which stage of the machine learning lifecycle will require the most significant initial investment of time and resources?