You are a machine learning engineer tasked with developing a cost-effective solution for a healthcare application that predicts patient re-admission risk. The application requires high availability and low-latency predictions due to the critical nature of the use case. Which AWS service should you use to deploy your machine learning model?