Video upload date:  · Duration: PT1H46M27S  · Language: EN

Scenario: Your organization stores customer data gcp video

data-engineer-pro video for scenario: Your organization stores customer data in an on-premises Apache Hadoop cluster in Apache Parquet format. Apache Spark jobs

This is a dedicated watch page for a single video.

Full Certification Question

Scenario: Your organization stores customer data in an on-premises Apache Hadoop cluster in Apache Parquet format. Apache Spark jobs process the data daily on the cluster. You plan to migrate the Parquet data and Spark jobs to Google Cloud. Future transformation pipelines will use BigQuery, so the data must be accessible in BigQuery. You want to use managed services while minimizing changes to ETL processing and reducing overhead costs. Question: How should you migrate and manage the data and Spark pipelines to meet the requirements?