You are an ML Engineer at a financial services company tasked with deploying a machine learning model for real-time fraud detection. The model requires low-latency inference in production and a cost-effective test environment for experimentation and validation. Which two strategies should you use to provision compute resources for production and testing environments using Amazon SageMaker? (Select two)