Scenario: You need to design a data pipeline to copy time-series transaction data for analysis in BigQuery by your data science team. Thousands of transactions receive status updates every hour. The initial dataset size is 1.5 PB and grows by 3 TB per day. The data is heavily structured, and the team will use it to build machine learning models. Your goal is to maximize performance and usability for the data science team. Question: Which two strategies should you use to achieve this? (Choose two.)