This is a dedicated watch page for a single video.
You are overseeing the data lake infrastructure for DataNova Corp , which is built on BigQuery. The data ingestion pipelines pull messages from Pub/Sub and write the incoming data into BigQuery tables. After rolling out a new version of the ingestion pipelines, you notice that the daily data volume stored in BigQuery has surged by 50%, even though the data volume in Pub/Sub hasn't changed. Only certain BigQuery tables show a doubling in the size of their daily partitions. How should you investigate and resolve the root cause of this increase?