This is a dedicated watch page for a single video.
Scenario: You are designing a cloud-native historical data processing system to meet the following requirements: Data is in CSV, Avro, and PDF formats. Multiple analysis tools (Dataproc, BigQuery, Compute Engine) will access the data. A batch pipeline processes daily data. Performance is not a concern, but availability must be maximized. Question: How should you design the data storage for this system?