A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFrame df. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Incremental state information should be maintained for 10 minutes for late-arriving data.
Streaming DataFrame df has the following schema:
"device_id INT, event_time TIMESTAMP, temp FLOAT, humidity FLOAT"
Code block:
Choose the response that correctly fills in the blank within the code block to complete this task.
aragorn_brego
Highly Voted 1 year, 6 months agosturcu
Highly Voted 1 year, 7 months agoKadELbied
Most Recent 1 month, 1 week ago71dfab9
10 months, 1 week agoDileepvikram
1 year, 7 months ago