Scenario: A TensorFlow machine learning model on Compute Engine virtual machines (n2-standard-32) takes two days to complete training. The model has custom TensorFlow operations that must run partially on a CPU. You want to reduce the training time in a cost-effective manner. Question: What should you do to reduce the training time of the TensorFlow model in a cost-effective way?