exam questions

Exam DP-201 All Questions

View all questions & answers for the DP-201 exam

Exam DP-201 topic 7 question 4 discussion

Actual exam question from Microsoft's DP-201
Question #: 4
Topic #: 7
[All DP-201 Questions]

HOTSPOT -
Which Azure service and feature should you recommend using to manage the transient data for Data Lake Storage? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

Show Suggested Answer Hide Answer
Suggested Answer:
Scenario: Stage inventory data in Azure Data Lake Storage Gen2 before loading the data into the analytical data store. Litware wants to remove transient data from Data Lake Storage once the data is no longer in use. Files that have a modified date that is older than 14 days must be removed.

Service: Azure Data Factory -
Clean up files by built-in delete activity in Azure Data Factory (ADF).
ADF built-in delete activity, which can be part of your ETL workflow to deletes undesired files without writing code. You can use ADF to delete folder or files from
Azure Blob Storage, Azure Data Lake Storage Gen1, Azure Data Lake Storage Gen2, File System, FTP Server, sFTP Server, and Amazon S3.
You can delete expired files only rather than deleting all the files in one folder. For example, you may want to only delete the files which were last modified more than 13 days ago.

Feature: Delete Activity -
Reference:
https://azure.microsoft.com/sv-se/blog/clean-up-files-by-built-in-delete-activity-in-azure-data-factory/
Design data processing solutions

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
AhmedReda
Highly Voted 4 years, 11 months ago
The question asked to remove files older than 14 days which i think ADF & Delete could not do it, so the answer might be = (1) Azure Storage (2) Lifecycle management rule
upvoted 24 times
Sai02
4 years, 7 months ago
In ADF, the Metadata activity has the LastModified property through which we can delete the files I believe.
upvoted 5 times
...
...
bansal_vikrant
Highly Voted 5 years, 1 month ago
The files are stored in ADLS Gen2 which supports Life cycle management rules
upvoted 23 times
vrmei
3 years, 11 months ago
Yes, This is correct.
upvoted 1 times
...
Psycho
4 years ago
https://azure.microsoft.com/en-au/updates/lifecycle-management-for-azure-data-lake-storage-is-now-generally-available/
upvoted 1 times
...
...
hoangton
Most Recent 3 years, 11 months ago
Given answer is correct (1)ADF (2)Delete activity
upvoted 1 times
...
kn_shn
3 years, 11 months ago
From older comments, ADF + Delete and Azure Storage + Lifecycle management rule seem to have similar functionality to remove files. However there is a difference: Liftcycle is defined based on the creation of the file, and in this question and context, it says:" Files that have a modified date that is older than 14 days must be removed". i.e. the file removal is based on the modified date. As BungyTex confirmed below, ADF + Delete can achieve this objective and the answer is correct.
upvoted 3 times
...
savin
4 years ago
Azure storage lifecycle management should be easier option
upvoted 2 times
...
Dymize
4 years ago
The way i see this, if the inventory data is coming from a microsoft SQL server, it is being ingested by ADF and not in Azure Storage, and if using ADF then the delete activity should be used. As per other comments this is proven to work
upvoted 1 times
...
davita8
4 years, 1 month ago
Azure storage lifecycle management
upvoted 6 times
...
felmasri
4 years, 2 months ago
Azure Data Lake Storage lifecycle management is now generally available https://azure.microsoft.com/en-us/updates/lifecycle-management-for-azure-data-lake-storage-is-now-generally-available/
upvoted 4 times
...
Needium
4 years, 3 months ago
The prefered option should be Az Storage and life cycle management rule
upvoted 3 times
...
lky17
4 years, 3 months ago
The correct answer should be Az Store and Lifecycle ... because ADLSG2 lets delete any file, the unique exception is "If you use the Delete Blob API to delete a directory, that directory will be deleted only if it's empty. This means that you can't use the Blob API delete directories recursively." and support all operation in lifecycle management except "Lifecycle management policies with premium tier for Azure Data Lake Storage. You can't move data that's stored in the premium tier between hot, cool, and archive tiers. However, you can copy data from the premium tier to the hot access tier in a different account." Ref https://docs.microsoft.com/en-us/azure/storage/blobs/data-lake-storage-known-issues
upvoted 2 times
...
NasRim
4 years, 3 months ago
lifecycle management is available in ADLS from July 31, 2020 https://azure.microsoft.com/en-us/updates/lifecycle-management-for-azure-data-lake-storage-is-now-generally-available/
upvoted 3 times
...
ThijsN
4 years, 4 months ago
Both ADF with delete or storage with lifecyle will work. I literally build the last one this week. I think that is the best solution as this is the cheapest and easiest. Doesn't cost anything to run, to build or to maintain.
upvoted 3 times
...
mohowzeh
4 years, 4 months ago
Seems to me that there are two valid combinations: (Azure Data Factory, delete activity) and (Azure storage, Lifecycle management)
upvoted 4 times
memo43
4 years ago
and second one the easiest!!
upvoted 1 times
...
...
KasiaK
4 years, 5 months ago
Lifecycle management policies (delete blob): Generally available in Premium, Generally available in Standard https://docs.microsoft.com/pl-pl/azure/storage/blobs/data-lake-storage-supported-blob-storage-features
upvoted 2 times
...
syu31svc
4 years, 6 months ago
https://azure.microsoft.com/en-us/updates/lifecycle-management-for-azure-data-lake-storage-is-now-generally-available/ Azure storage and lifecycle management rule are the answers
upvoted 9 times
...
BungyTex
4 years, 6 months ago
I just tested this in my ADL Gen 2, can set a rule to delete files last modifed more than 14 days ago.
upvoted 6 times
...
NikP
4 years, 10 months ago
Now, Lifecycle management is supported for accounts that have a hierarchical namespace for General-purpose V2. With this, you can reduce the delete activity (less cost even it is negligible for a pipeline). However, I would prefer to use delete activity in ADF to make sure that they got deleted after I load them to database. Better than auto delete through lifecycle. For me, given answer is correct based on requirement.
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...