You're employed by an organization running a streaming music service with a custom production model providing "next song" recommendations based on user listening history. The model is deployed on a Vertex AI endpoint and recently retrained with fresh data, showing positive offline test results. Now, you aim to test the new model in production with minimal complexity. What approach should you take?