A sports betting application relies on machine learning to predict match outcomes. These predictions are obtained by invoking an Amazon SageMaker endpoint. A machine learning engineer must ensure the SageMaker model stays available, even with a surge of traffic expected during an upcoming finals game. Which solution would offer the highest scalability to meet this requirement?