exam questions

Exam DP-203 All Questions

View all questions & answers for the DP-203 exam

Exam DP-203 topic 1 question 78 discussion

Actual exam question from Microsoft's DP-203
Question #: 78
Topic #: 1
[All DP-203 Questions]

You have an Azure Synapse Analytics dedicated SQL pool.

You plan to create a fact table named Table1 that will contain a clustered columnstore index.

You need to optimize data compression and query performance for Table1.

What is the minimum number of rows that Table1 should contain before you create partitions?

  • A. 100,000
  • B. 600,000
  • C. 1 million
  • D. 60 million
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Ankit_Az
Highly Voted 1 year, 11 months ago
Selected Answer: D
Clustered Column Store will by default have 60 partitions. And to achieve best compression we need at least 1 Million rows per partition, hence Option D 60 Millions (1M per partition)
upvoted 23 times
Vanq69
1 year, 7 months ago
You mean the dedicated SQL pool has 60 distributions "by default"?
upvoted 2 times
AkosL
1 year, 1 month ago
Not by default, but always
upvoted 1 times
...
Lscranio
1 year, 5 months ago
60 Million is correct
upvoted 2 times
...
...
...
EmnCours
Most Recent 5 months, 2 weeks ago
Selected Answer: D
Correct Answer: D
upvoted 1 times
...
ahana1074
8 months, 1 week ago
D is correct-:Partitioning in a dedicated SQL pool in Azure Synapse Analytics is typically used to manage very large tables, and the recommendation is to start considering partitions when the table contains 60 million rows or more. Partitioning helps optimize both data compression and query performance by allowing the system to process smaller subsets of data more efficiently. However, partitioning comes with overhead, and if you partition tables that are too small, it can actually degrade performance. Clustered columnstore indexes are designed to provide efficient compression and query performance for large datasets. Partitioning further helps with large fact tables, but for smaller tables (e.g., fewer than 60 million rows), partitioning is usually not necessary and might even be counterproductive.
upvoted 1 times
...
renan_ineu
8 months, 3 weeks ago
"When creating partitions on clustered columnstore tables, it is important to consider how many rows belong to each partition. For optimal compression and performance of clustered columnstore tables, a minimum of 1 million rows per distribution and partition is needed. Before partitions are created, dedicated SQL pool already divides each table into 60 distributions." https://learn.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-data-warehouse-tables-partition#partition-sizing
upvoted 1 times
...
jaco_0930
9 months, 3 weeks ago
Selected Answer: D
1M * 60 = 60M
upvoted 1 times
...
AA9292
10 months, 2 weeks ago
it is 1 Million. the question is clearly asking for minimum number of rows before creating partitions. and in Microsoft document they stated that 1 million rows is the minimum number of rows before partitioning a table
upvoted 2 times
renan_ineu
8 months, 3 weeks ago
The question is asking the minimum number of rows in the whole table and not the number per partition. So, in order to have at least 1 million rows per partition, you must have 60 million rows in the fact table.
upvoted 1 times
...
...
Siva_Jsn_23
11 months ago
Selected Answer: C
1 Million (Option C) is correct. You need the minimum number of rows to create a optimized partition. A single optimal partition requires 1 Million rows.
upvoted 4 times
...
I think the correct answer should be 120 million rows. Since splitting the data up into 2 partitions would result in 1 million rows per distribution and partition for 120 million rows
upvoted 1 times
...
MBRSDG
1 year, 1 month ago
Selected Answer: D
even without knowing how many partitions you're going to create, you know that each partition should contain at least 1million rows. 60million rows are the only case enabling to use partitions.
upvoted 1 times
...
s_unsworth
1 year, 2 months ago
Selected Answer: D
Cluster columnstore tables begin to achieve optimal compression once there is more than 60 million rows. For small lookup tables, less than 60 million rows, consider using HEAP or clustered index for faster query performance. -- Microsoft
upvoted 3 times
...
Charley92
1 year, 3 months ago
Selected Answer: D
To optimize data compression and query performance for Table in Azure Synapse Analytics dedicated SQL pool, you should create partitions when the table contains at least 60 million rows. Partitioning tables can improve query performance by reducing the amount of data that needs to be scanned. It can also improve data compression by allowing each partition to be compressed separately. In general, you should consider partitioning a table when it contains a large amount of data and queries frequently filter on a specific column or set of columns
upvoted 2 times
...
58d2382
1 year, 4 months ago
Selected Answer: A
Question says "What is the minimum number of rows that Table1 should contain before you create (add/new/extra) partitions?" As per microsoft documentation, each partition will contain 1Million records. So, if there atleast 1million records, we can go for partitioning. Here is the link for documentation https://learn.microsoft.com/en-us/sql/relational-databases/indexes/columnstore-indexes-overview?view=sql-server-ver16
upvoted 3 times
...
6d954df
1 year, 4 months ago
60m, see https://learn.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-data-warehouse-tables-partition
upvoted 2 times
...
blazy001
1 year, 5 months ago
A 100mil is correct >> From the answers to this Q, I see that MS has done a bad job because people don't understand what distributions or partitions are. My explanation: Each table with column store index is auto divided into 60 distributions, on each of these distributions there is auto 1 partition. For good performance (with column store) each partition must have at least 1Mil rows. The question was: "What is the minimum number of rows that Table1 should contain before you create (add/new/extra) partitions?" So there is no point in creating partitions with 60M rows, because then you divide this into 0.5Mil per partition. At least 120Mil would be ideal, but 100Mil already starts.
upvoted 1 times
...
hassexat
1 year, 8 months ago
Selected Answer: D
60 million
upvoted 2 times
...
kkk5566
1 year, 8 months ago
Selected Answer: D
is correct
upvoted 2 times
...
Deeksha1234
1 year, 9 months ago
should be D
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago