You have developed a machine learning model to predict customer churn and plan to deploy it using AWS SageMaker for real-time predictions during customer interactions. After initial testing, you observe that the endpoint latency exceeds your application's requirements. No advanced configurations have been applied yet. Which of the following configurations would most effectively reduce the latency of your SageMaker endpoint?