This is a dedicated watch page for a single video.
The data engineering team has a Silver table called ‘sales_cleaned’ where new sales data is appended in near real-time. They want to create a new Gold-layer entity against the ‘sales_cleaned’ table to calculate the year-to-date (YTD) of the sales amount. The new entity will have the following schema: country_code STRING, category STRING, ytd_total_sales FLOAT, updated TIMESTAMP It’s enough for these metrics to be recalculated once daily. But since they will be queried very frequently by several business teams, the data engineering team wants to cut down the potential costs and latency associated with materializing the results. Which of the following solutions meets these requirements?