This is a dedicated watch page for a single video.
As an ML engineer, you are designing a system to handle real-time predictions for a web application based on user interactions, which requires low-latency responses. Which AWS service should you primarily use to deploy your model to achieve this objective?