An AI team has packaged a custom data-processing model into a container image. They need to deploy this container as a serverless, scalable API endpoint that a generative AI agent can call as a tool. The service must scale down to zero when not in use to minimize costs. Which Google Cloud service should they use to deploy this containerized tool?