exam questions

Exam DP-201 All Questions

View all questions & answers for the DP-201 exam

Exam DP-201 topic 1 question 21 discussion

Actual exam question from Microsoft's DP-201
Question #: 21
Topic #: 1
[All DP-201 Questions]

You need to recommend a storage solution to store flat files and columnar optimized files. The solution must meet the following requirements:
✑ Store standardized data that data scientists will explore in a curated folder.
✑ Ensure that applications cannot access the curated folder.
✑ Store staged data for import to applications in a raw folder.
✑ Provide data scientists with access to specific folders in the raw folder and all the content the curated folder.
Which storage solution should you recommend?

  • A. Azure Synapse Analytics
  • B. Azure Blob storage
  • C. Azure Data Lake Storage Gen2
  • D. Azure SQL Database
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️
Azure Blob Storage containers is a general purpose object store for a wide variety of storage scenarios. Blobs are stored in containers, which are similar to folders.
Incorrect Answers:
C: Azure Data Lake Storage is an optimized storage for big data analytics workloads.
Reference:
https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/data-storage

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Sam9999
Highly Voted 5 years, 2 months ago
Shouldn't answer be C, there is no concept of folders and folder permissions in Azure storage.
upvoted 112 times
Marcus1612
3 years, 8 months ago
RBAC Security on Azure Blob can be scoped at the container level or above. With two containers (one for raw data and one for curated data) without folders, it would be possible to manage the security. BUT the current use case states that: " datascientists need access to specifics folders in the "raw" folder. You cannot manage security at this level with Azure Blob. You have to use Azure Data Lake with RBAC/ACLs. The right answer is C
upvoted 1 times
...
kempstonjoystick
5 years, 1 month ago
I agree, Azure Data Lake Stroage includes ACLs which can be applied to folder structures, which Blob Storage does not. Therefore the security requirements mean the answer should be ADLS
upvoted 13 times
MLCL
5 years, 1 month ago
There is the notion of public anonymous access in blob storage as well as shared access signatures, and of course RBAC can be implemented through Azure AD for Blobs and Queues, so the security requirements can be met. Check this doc : https://docs.microsoft.com/en-us/azure/storage/common/storage-auth-aad-rbac-portal
upvoted 10 times
Yuri1101
5 years, 1 month ago
Agree, especially it is only required to handle standardized data. There is no need to use ADLS.
upvoted 3 times
Leonido
5 years, 1 month ago
However, strictly speaking, in BLOB storage, data not stored in folders, just the name of the blob will include the folder name. So if the requirement is to store in folder, it have to be ADLS
upvoted 16 times
Leonido
5 years, 1 month ago
Also, in blob, without RBAC you can only grant permission to the level of container.
upvoted 11 times
...
...
...
lingjun
4 years, 6 months ago
When an Azure role is assigned to an Azure AD security principal, Azure grants access to those resources for that security principal. Access can be scoped to the level of the subscription, the resource group, the storage account, or an individual container or queue. An Azure AD security principal may be a user, a group, an application service principal, or a managed identity for Azure resources. https://docs.microsoft.com/de-de/azure/storage/common/storage-auth-aad
upvoted 2 times
...
...
...
tes
3 years, 11 months ago
there is, it is called container.
upvoted 2 times
...
...
HeB
Highly Voted 5 years, 1 month ago
Answer should definitely be C, Azure Data Lake Storage Gen2.
upvoted 45 times
...
tes
Most Recent 3 years, 11 months ago
The given answer is wrong and it should be C. The answer given states container is same as folder but it is not. A folder can have sub folders and access can be given only to sub folder. Where as in containers there are no sub containers hence the answer is wrong. Folder however can be given access in ADLS Gen2 using ACL so when we have a straight forward answer, why go with assumtion that 'container is same as folder'
upvoted 3 times
...
azurenav
3 years, 11 months ago
Azure Data Lake Store Gen2 is a superset of Azure Blob storage capabilities. In the list below, some of the key differences between ADLS Gen2 and Blob storage are summarized. ADLS Gen2 supports ACL and POSIX permissions allowing for more granular access control compared to Blob storage. ADLS Gen2 introduces a hierarchical namespace. This is a true file system, unlike Blob Storage which has a flat namespace. This capability has a significant impact on performance, especially in big data analytics scenarios. ADLS Gen2 is an HDFS-compatible store. This means that Apache Hadoop services can use data stored in ADLS Gen2. Azure Blob storage is not Hadoop-compatible.
upvoted 1 times
...
cadio30
3 years, 12 months ago
ADLS is the appropriate solution here as it has ACL function.
upvoted 1 times
...
Arjun16
4 years ago
In Question they mentioned about flat files and columnar optimized files(Binary Files) and Containers are similar to folders, so Azure storage is Correct
upvoted 1 times
...
cadio30
4 years ago
The requirements leads to using ADLS gen 2 as it can manage the folder level using ACL
upvoted 1 times
...
davita8
4 years ago
C. Azure Data Lake Storage Gen2
upvoted 2 times
...
rmk4ever
4 years, 1 month ago
Columnar optimized file for Raw, enriched and curated structure with Folder level access Ans is ADLS ref: https://www.dremio.com/data-lake/adls/ https://medium.com/microsoftazure/building-your-data-lake-on-adls-gen2-3f196fc6b430
upvoted 2 times
...
Deepu1987
4 years, 3 months ago
The given answer is correct as when you check the below link https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/data-storage it's clearly mentioned that ADLS can be used with certain restrictions it can be accessed via az synapse using poly base feature. There are certain performance tuning guidelines but in qn it's asked it need to be easily accessed by data scientistists as per the conditions we can go with blob storage
upvoted 1 times
...
AyeshJr
4 years, 3 months ago
I will choose Azure Datalake on the only fact that the question did ask for columnar optimized files and this is available in Datalake and not Azure storage account
upvoted 2 times
...
mohowzeh
4 years, 5 months ago
A folder can be created in a blob (e.g. via button "Create folder" in the portal) but such a folder is virtual. Using Azure Storage Explorer (presently v1.17.0), one can verify that an SAS can be created on a blob container, but not on a folder within a blob. Still, multiple containers could be created where each container maps to one group of users in the security requirements. This is not forbidden in the question. If each container has one or more folders, all requirements would still be met, making answer B a "minimum viable answer". However, I agree that answer C is the best and most flexible. Using Azure Storage Explorer, one can easily verify that the option "Manage Access Control Lists" is available on an individual folder.
upvoted 2 times
...
M0e
4 years, 7 months ago
The given answer is clearly incorrect. All the points that are mentioned in the questions are hints to use ADLS Gen 2.
upvoted 3 times
...
monumentalcrankiness
4 years, 7 months ago
The answer also mentions that the files are supposed to be explored by Data Scientists in curated folder. ADLS Gen 2 hooked up with Databricks or Azure Synapse Analytics is a ready-made solution for this kind of exploration.
upvoted 3 times
...
monumentalcrankiness
4 years, 7 months ago
I think correct answer should be ADLS Gen 2.
upvoted 2 times
...
yilpiz
4 years, 9 months ago
raw, curated folder, folder level access all characteristics of ADLS
upvoted 6 times
...
Bob123456
4 years, 9 months ago
I believe there is actually only a single layer of containers. You can virtually create a "file-system" like layered storage, but in reality everything will be in 1 layer, the container in which it is. So Answer should be DATA LAKE
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...