A startup is building a generative AI application to create personalized children's stories. A key technical constraint they face is the limited availability of GPUs on their chosen deployment platform for real-time story generation, which could impact user experience. This constraint will primarily influence which aspect of their gen AI solution?