Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.

Unlimited Access

Get Unlimited Contributor Access to the all ExamTopics Exams!
Take advantage of PDF Files for 1000+ Exams along with community discussions and pass IT Certification Exams Easily.

Exam DP-203 topic 1 question 44 discussion

Actual exam question from Microsoft's DP-203
Question #: 44
Topic #: 1
[All DP-203 Questions]

DRAG DROP -
You have data stored in thousands of CSV files in Azure Data Lake Storage Gen2. Each file has a header row followed by a properly formatted carriage return (/ r) and line feed (/n).
You are implementing a pattern that batch loads the files daily into a dedicated SQL pool in Azure Synapse Analytics by using PolyBase.
You need to skip the header row when you import the files into the data warehouse. Before building the loading pattern, you need to prepare the required database objects in Azure Synapse Analytics.
Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.
NOTE: Each correct selection is worth one point
Select and Place:

Show Suggested Answer Hide Answer
Suggested Answer:
Step 1: Create an external data source that uses the abfs location
Create External Data Source to reference Azure Data Lake Store Gen 1 or 2
Step 2: Create an external file format and set the First_Row option.
Create External File Format.
Step 3: Use CREATE EXTERNAL TABLE AS SELECT (CETAS) and configure the reject options to specify reject values or percentages
To use PolyBase, you must create external tables to reference your external data.
Use reject options.
Note: REJECT options don't apply at the time this CREATE EXTERNAL TABLE AS SELECT statement is run. Instead, they're specified here so that the database can use them at a later time when it imports data from the external table. Later, when the CREATE TABLE AS SELECT statement selects data from the external table, the database will use the reject options to determine the number or percentage of rows that can fail to import before it stops the import.
Reference:
https://docs.microsoft.com/en-us/sql/relational-databases/polybase/polybase-t-sql-objects https://docs.microsoft.com/en-us/sql/t-sql/statements/create-external-table-as-select-transact-sql

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
sunil_smile
Highly Voted 1 year, 8 months ago
1) create database scoped credentials 2) create external source 3) create file format 4) create external table (it not supports CTAS)
upvoted 32 times
auwia
10 months, 3 weeks ago
It supports, it is a dedicated SQL pool (means not severless), reading the question: You are implementing a pattern that batch loads the files daily into a dedicated SQL pool in Azure Synapse Analytics by using PolyBase. --> provided answers are correct in my opinion.
upvoted 4 times
...
...
OldSchool
Highly Voted 1 year, 5 months ago
Because it's saying "You have data stored in thousands of CSV files in Azure Data Lake Storage Gen2" and "You are implementing a pattern that batch loads the files daily into a dedicated SQL pool in Azure Synapse Analytics by using PolyBase" assumption is that we already have database credentials, so the answer is: 1) create external source 2) create file format 3) create external table
upvoted 17 times
Rob77
11 months, 4 weeks ago
No, CETAS is not used for loading Azure Synapse Analytics. It's used to export data from and not to!
upvoted 3 times
...
...
dgerok
Most Recent 1 month ago
The provided answer is correct. 1) source 2) file format 3) CETAS - this is dedicated SQL Pool. So, it is preferred. While external table is the only option for SERVERLESS SQL Pool... https://learn.microsoft.com/en-us/sql/t-sql/statements/create-external-table-as-select-transact-sql?view=azure-sqldw-latest&tabs=powershell#examples
upvoted 1 times
...
ankeshpatel2112
1 month, 2 weeks ago
The correct answer are : 1) create database scoped credentials 2) create external source 3) create file format Why ? : Focus on this below statement in question. Before building the loading pattern, you need to prepare the required database objects in Azure Synapse Analytics. Explanation: You just need to specify about prerequisite database object ( Not Loading pattern which is about Creating External Table )
upvoted 2 times
...
hydmt07
5 months, 1 week ago
I think the answer (A, B, D) is very clearly explained on this page: https://learn.microsoft.com/en-us/azure/synapse-analytics/sql/create-use-external-tables
upvoted 2 times
...
ellala
7 months, 1 week ago
I believe the only reason why "create database scoped credential" is not a right answer is because it should be a managed identity instead of a service principal. Service principals are used for applications outside of the Azure Environment (such as SQL Server, as some comments here refer to SQL Server documentation). But since we are using Synapse Analytics environment, we use managed identities. Check the link: https://learn.microsoft.com/en-us/sql/t-sql/statements/create-external-table-as-select-transact-sql?view=azure-sqldw-latest&preserve-view=true&tabs=powershell#g-use-create-external-table-as-select-with-a-view-as-the-source And if you want to compare to SQL Server documentation, where indeed they use Service principal keys (which is not the situation we are given in this question): https://learn.microsoft.com/en-us/sql/relational-databases/polybase/polybase-t-sql-objects?view=sql-server-ver16#create-external-tables-for-azure-data-lake-store
upvoted 2 times
...
pperf
7 months, 1 week ago
The provided Answer is correct check yourself, goto F section in the following link https://learn.microsoft.com/en-us/sql/t-sql/statements/create-external-table-as-select-transact-sql?view=azure-sqldw-latest&tabs=powershell#examples
upvoted 2 times
SushilJinder
2 weeks, 2 days ago
F section shows all the options given in this questions :) meaning including CETAS and data credential also
upvoted 1 times
...
...
Ram9198
8 months, 1 week ago
Before building the loading pattern, you need to prepare the required database objects in Azure Synapse Analytics - Loading pattern is CETAS, so answer DSC DS EFF
upvoted 1 times
...
kkk5566
8 months, 2 weeks ago
source ,format ,external
upvoted 3 times
...
eladioyovera
9 months, 3 weeks ago
The answer is correct, - Create database scoped credential: "This step is required only for Kerberos-secured Hadoop clusters." In this case, the previous step does not apply.
upvoted 1 times
...
vctrhugo
10 months, 3 weeks ago
1. Create database scoped credential 2. Create external data source 3. Create external file format https://learn.microsoft.com/en-us/sql/relational-databases/polybase/polybase-t-sql-objects?view=sql-server-ver16#create-external-tables-for-azure-blob-storage
upvoted 5 times
...
mamahani
1 year ago
the given answer is correct imo; "'PolyBase loads can be run using CTAS or INSERT INTO. CTAS will minimize transaction logging and is the fastest way to load your data. "' https://learn.microsoft.com/en-us/azure/synapse-analytics/sql/best-practices-dedicated-sql-pool#use-polybase-to-load-and-export-data-quickly
upvoted 1 times
mamahani
1 year ago
sorry, wrong link: https://learn.microsoft.com/en-us/sql/relational-databases/polybase/polybase-t-sql-objects?view=sql-server-ver16 "'PolyBase can now use CETAS to create an external table and then export, in parallel, the result of a Transact-SQL SELECT statement to Azure Data Lake Storage Gen2, Azure Storage Account V2, and S3-compatible object storage."' "'Creates an external table and then exports, in parallel, the results of a Transact-SQL SELECT statement. For Azure Synapse Analytics and Analytics Platform System, Hadoop or Azure Blob storage are supported."'
upvoted 3 times
...
...
olegjdll
1 year, 3 months ago
When using serverless SQL pool, CETAS is used to create an external table and export query results to Azure Storage Blob or Azure Data Lake Storage Gen2 and we need to implement a pattern that batch loads the files daily into a dedicated SQL pool in Azure Synapse Analytics, so: 1) create database scoped credentials 2) create external source 3) create file format
upvoted 5 times
...
Rakrah
1 year, 3 months ago
In this question clearly stating that, "Before building the loading pattern, you need to prepare the required database objects" So Database objects list 1) Data Source 2) Data File Format 3) Table ( need to set up skip the header row) My Answer is 1) Create external source; 2) Create File Format ; 3) Create External Table
upvoted 1 times
...
aws123
1 year, 4 months ago
CTAS for external table is to write the result of the query (select) in a destination folder. So the good answer for this question is : ) create database scoped credentials 2) create external source 3) create file format
upvoted 3 times
...
rohanb1986
1 year, 4 months ago
Should be - 1) create database scoped credentials 2) create external source 3) create file format As per https://learn.microsoft.com/en-us/sql/relational-databases/polybase/polybase-t-sql-objects?view=sql-server-ver16 - Check under : Create external tables for Azure Blob Storage - CTAS is not an option
upvoted 2 times
mamahani
1 year ago
yes it is an option PolyBase loads can be run using CTAS or INSERT INTO. CTAS will minimize transaction logging and is the fastest way to load your data. https://learn.microsoft.com/en-us/azure/synapse-analytics/sql/best-practices-dedicated-sql-pool#use-polybase-to-load-and-export-data-quickly
upvoted 1 times
mamahani
1 year ago
sorry, wrong link: https://learn.microsoft.com/en-us/sql/relational-databases/polybase/polybase-t-sql-objects?view=sql-server-ver16 "'PolyBase can now use CETAS to create an external table and then export, in parallel, the result of a Transact-SQL SELECT statement to Azure Data Lake Storage Gen2, Azure Storage Account V2, and S3-compatible object storage."' "'Creates an external table and then exports, in parallel, the results of a Transact-SQL SELECT statement. For Azure Synapse Analytics and Analytics Platform System, Hadoop or Azure Blob storage are supported."'
upvoted 1 times
...
...
...
Bro111
1 year, 5 months ago
What is azure active directory application? is it managed identity?
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...