A startup is developing an application that uses several generative models on Vertex AI for different functions: one for summarizing long documents, one for classifying customer feedback, and one for brainstorming marketing ideas. The engineering lead wants to ensure the application is both high-performing and cost-effective. When designing the application's architecture, which of the following is NOT a valid strategy for optimizing cost and performance on Vertex AI?