exam questions

Exam DP-200 All Questions

View all questions & answers for the DP-200 exam

Exam DP-200 topic 1 question 57 discussion

Actual exam question from Microsoft's DP-200
Question #: 57
Topic #: 1
[All DP-200 Questions]

HOTSPOT -
You have two Azure Storage accounts named Storage1 and Storage2. Each account contains an Azure Data Lake Storage file system. The system has files that contain data stored in the Apache Parquet format.
You need to copy folders and files from Storage1 to Storage2 by using a Data Factory copy activity. The solution must meet the following requirements:
✑ No transformations must be performed.
✑ The original folder structure must be retained.
How should you configure the copy activity? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

Show Suggested Answer Hide Answer
Suggested Answer:
Box 1: Parquet -
For Parquet datasets, the type property of the copy activity source must be set to ParquetSource..

Box 2: PreserveHierarchy -
PreserveHierarchy (default): Preserves the file hierarchy in the target folder. The relative path of the source file to the source folder is identical to the relative path of the target file to the target folder.
Incorrect Answers:
FlattenHierarchy: All files from the source folder are in the first level of the target folder. The target files have autogenerated names.
MergeFiles: Merges all files from the source folder to one file. If the file name is specified, the merged file name is the specified name. Otherwise, it's an autogenerated file name.
Reference:
https://docs.microsoft.com/en-us/azure/data-factory/format-parquet https://docs.microsoft.com/en-us/azure/data-factory/connector-azure-data-lake-storage

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
SorinXp
Highly Voted 4 years ago
The first box should be "Binary". It says - no transformation.
upvoted 6 times
[Removed]
3 years, 11 months ago
Binary is only for Binary format: https://docs.microsoft.com/en-us/azure/data-factory/format-binary
upvoted 1 times
lgtiza
3 years, 11 months ago
Every parquet file is also a binary file. I think the key is "no transformations", so why the extra work of interpreting a parquet file?! Binary and preserve hierarchy should do it imo.
upvoted 3 times
...
...
...
medsimus
Most Recent 3 years, 8 months ago
First box should be "Binary" . I tested it with the 2 options . using paquet i got an error with the following message : "Dataset Parquet1 location is a folder, the wildcard file name is required for Copy data1"
upvoted 3 times
...
Hinzzz
3 years, 11 months ago
The given answer is correct Parquet and preserve hierarchy
upvoted 4 times
...
CarNama_IG
3 years, 11 months ago
You can use Binary dataset in Copy activity, GetMetadata activity, or Delete activity. When using Binary dataset, ADF does not parse file content but treat it as-is. When using Binary dataset in copy activity, you can only copy from Binary dataset to Binary dataset..so the ans should be parquet
upvoted 1 times
hello_there_
3 years, 11 months ago
Why does it need to be parquet? Just configure the sink dataset as binary as well. This way ADF doesn't need to parse the files. You just need parquet if you want to do some transformation or when the sink dataset is an existing parquet dataset
upvoted 1 times
...
...
maciejt
4 years ago
It should be Binary - it copies the files as they are, no need to parse the parquet format if you don't need to transform them.
upvoted 2 times
...
cadio30
4 years, 1 month ago
Agree with the answer as both source and sink can accommodate "parquet" extension files using the behavior as seen below. Try working it on ADFv2 File Format: Parquet (source and sink) Copy behavior: Preserve Hierarchy
upvoted 4 times
...
dangal95
4 years, 1 month ago
Answer is correct. https://docs.microsoft.com/en-us/azure/data-factory/format-parquet
upvoted 3 times
...
Dark12arrow
4 years, 1 month ago
do u have any reference ? and if u cant use parquet to load parquet files whats the point of ever choosing parquet?
upvoted 2 times
...
eliabsbueno
4 years, 1 month ago
The first box should be "Binary". You can't use a parquet data source to load different parquet files.
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...