You are a machine learning engineer at a biotechnology company working on a deep learning model to analyze genomic sequences. The dataset consists of billions of nucleotide sequences, requiring significant computational resources for training. Additionally, the deployed model must provide real-time, low-latency inferences for clinical applications. Which compute environment should you choose for both training and inference to optimize performance and cost-efficiency?