Watch this video on YouTube
Which of the following is the most efficient way to perform a grouped aggregation in PySpark where you want to compute the average salary per department, while also eliminating any rows with null values in the salary column?