Video upload date:  · Duration: PT1H46M27S  · Language: EN

A junior data engineer has been asked to databricks video

data-engineer-professional video for a junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df.

This is a dedicated watch page for a single video.

Full Certification Question

A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Events are recorded once per minute per device. The streaming DataFrame df has the following schema: "device_id INT, event_time TIMESTAMP, temp FLOAT, humidity FLOAT" Code block: df . withWatermark ( "event_time" , "10 minutes" ) . groupBy ( _________ , "device_id" ) . agg ( avg ( "temp" ). alias ( "avg_temp" ), avg ( "humidity" ). alias ( "avg_humidity" ) ) . writeStream . format ( "delta" ) . saveAsTable ( "sensor_avg" ) ) Choose the response that correctly fills in the blank within the code block to complete this task.