As an ML engineer working on a real-time fraud detection system for a financial institution, which AWS service should you choose to provide a fully managed environment for real-time inference with automatic scaling and built-in model monitoring capabilities?