You are building a chat application. Incoming messages are published to a Pub/Sub topic and need to be processed with minimal latency. The solution must automatically scale to handle bursts of messages, minimize infrastructure management, and only incur compute costs when messages are being processed. What should you do?