AWS Exams GCP Exams Azure Exams GitHub Exams Jira Exams ISC2 Exams

Video: An IT company needs to set up a data lake on aws video

Question 1
« Back   Next aws Cloud data-engineer Question »

Full Certification Question

An IT company needs to set up a data lake on Amazon S3 for a healthcare client. The data lake is split into raw and curated zones. For compliance reasons, the source data needs to be kept for a minimum of 5 years. The source data arrives in the raw zone and is then processed via an AWS Glue-based ETL job into the curated zone. The data engineering team runs ad-hoc queries only on the data in the curated zone using Athena. The team is concerned about the cost of data storage in both the raw and curated zones as the data is increasing at a rate of 2 TB daily in each zone. Which of the following options would you implement together as the MOST cost-optimal solution? (Select two)