A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Incremental state information should be maintained for 10 minutes for late-arriving data.
Streaming DataFrame df has the following schema:
"device_id INT, event_time TIMESTAMP, temp FLOAT, humidity FLOAT"
Code block:
Choose the response that correctly fills in the blank within the code block to complete this task.
aragorn_brego
Highly Voted 1 year, 8 months agosturcu
Highly Voted 1 year, 9 months agoKadELbied
Most Recent 2 months, 3 weeks ago71dfab9
11 months, 3 weeks agoDileepvikram
1 year, 8 months ago