A company is building a generative AI agent that needs to understand spoken customer inquiries, process the request, and then respond verbally. The agent will also need to analyze the sentiment of the customer's spoken words to tailor its response style. Which combination of Google Cloud AI APIs would be most essential for this agent's core functionalities?