exam questions

Exam DP-203 All Questions

View all questions & answers for the DP-203 exam

Exam DP-203 topic 2 question 57 discussion

Actual exam question from Microsoft's DP-203
Question #: 57
Topic #: 2
[All DP-203 Questions]

You are designing a solution that will copy Parquet files stored in an Azure Blob storage account to an Azure Data Lake Storage Gen2 account.
The data will be loaded daily to the data lake and will use a folder structure of {Year}/{Month}/{Day}/.
You need to design a daily Azure Data Factory data load to minimize the data transfer between the two accounts.
Which two configurations should you include in the design? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point

  • A. Specify a file naming pattern for the destination.
  • B. Delete the files in the destination before loading the data.
  • C. Filter by the last modified date of the source files.
  • D. Delete the source files after they are copied.
Show Suggested Answer Hide Answer
Suggested Answer: AC 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Philipp
Highly Voted 3 years, 2 months ago
Selected Answer: AC
AC is correct, there is no point about deletion in source and might be the case that the data should stay in source too.
upvoted 15 times
...
necktru
Highly Voted 3 years ago
Selected Answer: AC
I think the C option has impact in data transfer, B are incorrect, D is irrelevant for the question, and A is a complement of the task
upvoted 7 times
...
moneytime
Most Recent 1 year, 2 months ago
C & D The question requested a solution that will reduce data transfer between the two systems . The solution is thesame as how can REUNDANCY or Multiple Copies of the data be avoided during copying. Explanation 1.). Deleting the source files after they are copied will keep track of the where to start the next copy . 2.)Filter by the last modified date of the source files; This also helps to keep track of where to resuming file movement from
upvoted 1 times
...
kkk5566
1 year, 7 months ago
Selected Answer: AC
AC is correct
upvoted 2 times
...
Spinozabubble
1 year, 12 months ago
A. Specify a file naming pattern for the destination: By specifying a file naming pattern for the destination files in the Azure Data Lake Storage Gen2 account, you can ensure that the files are organized and stored in a structured manner. This can help with data management and subsequent processing. C. Filter by the last modified date of the source files: By filtering the source files based on the last modified date, you can select only the files that have been modified on the current day. This reduces the amount of data transferred and improves the efficiency of the data load process.
upvoted 5 times
...
Deeksha1234
2 years, 9 months ago
Selected Answer: AC
should be AC
upvoted 5 times
...
Boumisasound
3 years, 1 month ago
I will go for AC Why not D? Cause they are not mentionned some cost opitmisation
upvoted 3 times
...
boopathi
3 years, 2 months ago
AD are correct ?
upvoted 1 times
...
Istiaque
3 years, 2 months ago
The requirement is to minimize the data transfer. If we delete the files in source then there is no need to filter for daily load. So answer C,D is incorrect. Beside, there is no requirement to for minimizing the cost. To my point of view, AC is correct because, even though filter by the modified date will take long time for lot of files, it won't impact the transfer.
upvoted 4 times
...
dev2dev
3 years, 3 months ago
Selected Answer: AD
Normally we move the files after being processed, so it has to be D.
upvoted 6 times
yo1233
3 years, 3 months ago
is A,D correct
upvoted 2 times
...
...
rainbowyu
3 years, 3 months ago
Shout it be A &D as the requirement is to minimize the process time. Will option C take longer compared to D?
upvoted 2 times
djblue
3 years, 2 months ago
Minimizing the process time is not part of the question. "Minimizing the data transfer", whatever that is - either time or amount.
upvoted 4 times
...
...
Canary_2021
3 years, 3 months ago
Selected Answer: CD
Either C or D can realize daily incremental load. Not sure why need to setup both of them.
upvoted 3 times
...
edba
3 years, 4 months ago
should it be C, D?
upvoted 2 times
Dusica
2 years, 3 months ago
YOU CAN'T GO WITHOUT A
upvoted 4 times
drosen
7 months ago
why? "Minimizing the data transfer" has nothing to do with A.
upvoted 1 times
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago