This is a dedicated watch page for a single video.
You are a Machine Learning Engineer deploying a scalable web application on AWS that provides real-time recommendations using model inference. The application must efficiently handle sudden spikes in user traffic without increasing latency. Which AWS service is the most appropriate to meet this requirement?