Scenario: You created an analytics environment on gcp video

 ·  PT1H46M27S  ·  EN

data-engineer-pro video for scenario: You created an analytics environment on Google Cloud to allow your data scientist team to explore data without impacting

Full Certification Question

Scenario: You created an analytics environment on Google Cloud to allow your data scientist team to explore data without impacting the on-premises Apache Hadoop solution. The data in the on-premises Hadoop Distributed File System (HDFS) cluster is stored in Optimized Row Columnar (ORC) formatted files with multiple columns of Hive partitioning. The data scientist team needs to explore the data in a similar way to how they used SQL with the Hive query engine on the on-premises HDFS cluster. You need to select the most cost-effective storage and processing solution. Question: What should you do to enable the data scientist team to explore the data cost-effectively?