You downloaded a TensorFlow language model gcp video

 ·  PT1H46M27S  ·  EN

ml-engineer-pro video for you downloaded a TensorFlow language model pre-trained on a proprietary dataset by another company, and you tuned the model with

Full Certification Question

You downloaded a TensorFlow language model pre-trained on a proprietary dataset by another company, and you tuned the model with Vertex AI Training by replacing the last layer with a custom dense layer. The model achieves the expected offline accuracy; however, it exceeds the required online prediction latency by 20ms. You want to optimize the model to reduce latency while minimizing the offline performance drop before deploying the model to production. What should you do?