This is a dedicated watch page for a single video.
You are deploying a machine learning model to provide real-time predictions in an e-commerce application. The deployment requires low latency and high throughput. Which configuration should you choose for optimal online deployment?