exam questions

Exam DP-300 All Questions

View all questions & answers for the DP-300 exam

Exam DP-300 topic 1 question 21 discussion

Actual exam question from Microsoft's DP-300
Question #: 21
Topic #: 1
[All DP-300 Questions]

You plan to build a structured streaming solution in Azure Databricks. The solution will count new events in five-minute intervals and report only events that arrive during the interval.
The output will be sent to a Delta Lake table.
Which output mode should you use?

  • A. complete
  • B. append
  • C. update
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
sincerebb
3 weeks, 5 days ago
Selected Answer: B
The answer is correct.
upvoted 1 times
...
bingomutant
7 months, 2 weeks ago
• Append Mode: Outputs only the new rows added during each trigger (i.e., each 5-minute interval in this case). This is ideal when you're continuously adding new records to the destination (e.g., a Delta Lake table) without updating existing data.
upvoted 2 times
...
Icyb3r
2 years, 8 months ago
Selected Answer: A
In question not mentioned that we need to keep old data. so the correct answer should be A
upvoted 1 times
MS_KoolaidMan
1 year, 11 months ago
Why would you use a data lake if you do not want to keep old data?
upvoted 1 times
...
...
nehima
2 years, 8 months ago
Selected Answer: B
The answer is append
upvoted 2 times
...
New_Azure_User
2 years, 9 months ago
Solution is 'Complete'. The Question stated - "will count new events in five-minute intervals and report only events that arrive during the interval. - Says 'NEW' events and 'ONLY ARRIVE DURING THE 5min INTERVAL' - so it can't be append because that would just add new data to the old data.
upvoted 3 times
zafara55
1 year, 9 months ago
This is correct. Only events that arrive during the 5 minutes interval are kept. A is correct.
upvoted 1 times
...
...
eric0718
3 years ago
Selected Answer: A
The preceding example continuously updates a table that contains the aggregate number of events by customer.
upvoted 2 times
...
CaptainJameson
3 years, 1 month ago
Selected Answer: A
Complete - The question doesn't state we need to keep existing data, only that we need from the given interval.
upvoted 1 times
...
AlCubeHead
3 years, 2 months ago
Selected Answer: B
Append
upvoted 2 times
...
VinayakBudapanahalli
3 years, 3 months ago
Selected Answer: B
Append is the answer
upvoted 2 times
...
sqljuanito
3 years, 4 months ago
I think append
upvoted 1 times
...
ramelas
3 years, 5 months ago
complete - You can also use Structured Streaming to replace the entire table with every batch. One example use case is to compute a summary using aggregation https://docs.databricks.com/delta/delta-streaming.html
upvoted 2 times
...
quermi
3 years, 5 months ago
https://docs.microsoft.com/en-us/azure/databricks/getting-started/spark/streaming Append Mode: Only new rows appended in the result table since the last trigger are written to external storage. This is applicable only for the queries where existing rows in the Result Table are not expected to change Then must be complete. append is if the table is static, with few changes.
upvoted 1 times
...
Manmeets27
3 years, 5 months ago
Selected Answer: B
it should be append
upvoted 2 times
ramelas
3 years, 5 months ago
why? it should be complete because you just want to report events that happen in those 5 five minute intervals, you dont want data from the window of 10 minutes ago neither 15 minutes ago. so you replace the entire table to achieve that and it is more straight forward
upvoted 1 times
...
...
Aggie0702
3 years, 7 months ago
The answer is correct.
upvoted 4 times
...
Dawn7
3 years, 8 months ago
Which is the correct answer? I think complete as the question do not ask to add new recrods.
upvoted 1 times
...
maple580122
3 years, 8 months ago
compete mode is correct. It requires count but not add a new record.
upvoted 2 times
...
Gesia
3 years, 8 months ago
Correc answer is B - Append
upvoted 1 times
ramelas
3 years, 5 months ago
why? it should be complete because you just want to report events that happen in those 5 minute intervals, you dont want data from the window of 10 minutes ago neither 15 minutes ago. so you replace the entire table to achieve that and it is more straight forward
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago