exam questions

Exam DP-100 All Questions

View all questions & answers for the DP-100 exam

Exam DP-100 topic 3 question 20 discussion

Actual exam question from Microsoft's DP-100
Question #: 20
Topic #: 3
[All DP-100 Questions]

HOTSPOT -
You plan to preprocess text from CSV files. You load the Azure Machine Learning Studio default stop words list.
You need to configure the Preprocess Text module to meet the following requirements:
✑ Ensure that multiple related words from a single canonical form.
✑ Remove pipe characters from text.
Remove words to optimize information retrieval.

Which three options should you select? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

Show Suggested Answer Hide Answer
Suggested Answer:
Box 1: Remove stop words -
Remove words to optimize information retrieval.
Remove stop words: Select this option if you want to apply a predefined stopword list to the text column. Stop word removal is performed before any other processes.

Box 2: Lemmatization -
Ensure that multiple related words from a single canonical form.
Lemmatization converts multiple related words to a single canonical form
Box 3: Remove special characters
Remove special characters: Use this option to replace any non-alphanumeric special characters with the pipe | character.
Reference:
https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/preprocess-text

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
hyz123
Highly Voted 3 years, 7 months ago
"Remove special characters" should not be correct, because the requirement include remove pipe from the data, but this one will add pipe to the data. it should be "Split tokens on special characters", this one should remove the pipe.
upvoted 14 times
trickerk
2 years, 6 months ago
Split tokens on special characters just break words and won't remove the pipe(|). According the page https://docs.microsoft.com/en-us/azure/machine-learning/algorithm-module-reference/preprocess-text: Remove special characters: Use this option to remove any non-alphanumeric special characters. So given answer is correct.
upvoted 5 times
...
epgd
3 years, 7 months ago
Remove special characters: Use this option to replace any non-alphanumeric special characters with the pipe | character.
upvoted 11 times
trickerk
2 years, 6 months ago
I think this description changed: Remove special characters: Use this option to remove any non-alphanumeric special characters. https://docs.microsoft.com/en-us/azure/machine-learning/algorithm-module-reference/preprocess-text
upvoted 8 times
...
...
...
ljljljlj
Highly Voted 2 years, 6 months ago
On exam 2021/7/10
upvoted 7 times
...
victorafb
Most Recent 1 year, 3 months ago
Answer is corret, check on https://learn.microsoft.com/en-us/azure/machine-learning/component-reference/preprocess-text
upvoted 6 times
...
David_Tadeu
1 year, 10 months ago
I see these definitions 5Jun2019 - *Remove special characters*: Use this option to replace any non-alphanumeric special characters with the pipe | character. (https://docs.microsoft.com/en-us/previous-versions/azure/machine-learning/studio-module-reference/preprocess-text) 11Apr2021 - *Remove special characters*: Use this option to remove any non-alphanumeric special characters. (https://docs.microsoft.com/en-us/azure/machine-learning/component-reference/preprocess-text) So for the second I'm going with *Remove Special Characters*, i.e. I think the answer is right as of March 2022.
upvoted 7 times
michaelmorar
1 year, 1 month ago
Thank you for this explanation!
upvoted 1 times
...
...
Mrinals
2 years, 2 months ago
Remove special characters: Use this option to replace any non-alphanumeric special characters with the pipe | character. in link "https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/preprocess-text"
upvoted 1 times
...
trickerk
2 years, 6 months ago
Split tokens on special characters just break words and won't remove the pipe(|). According the page https://docs.microsoft.com/en-us/azure/machine-learning/algorithm-module-reference/preprocess-text: Remove special characters: Use this option to remove any non-alphanumeric special characters. So given answer is correct.
upvoted 2 times
...
azurecert2021
2 years, 7 months ago
looks like question is for designers or misprint the requirement as Remove special characters can not be the choice for studio as already explained by others.
upvoted 1 times
...
YG59
2 years, 8 months ago
It seems like that 'Remove Special Characters' has different definition in designer and studio. 1) Designer: Use this option to remove any non-alphanumeric special characters. https://docs.microsoft.com/en-us/azure/machine-learning/algorithm-module-reference/preprocess-text 2) Studio: Use this option to replace any non-alphanumeric special characters with the pipe | character. https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/preprocess-text
upvoted 5 times
...
lucazav
3 years, 3 months ago
The suggested answer works if you are using Azure ML Studio Classic (infact the picture is related to the Preprocess Text module of the Classic one). It doesn't work on the new Azure ML Designer. In order to remove the pipe there, you have to: - enter the string "(\|)" into the "Custom regular expression" textbox - leave the "Custom replacement string" textbox empty
upvoted 7 times
...
Kamalraj
3 years, 6 months ago
Remove special characters is correct, https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/preprocess-text
upvoted 5 times
allanm
2 years, 8 months ago
Incorrect, the requirement states to "remove pipe characters from text". Remove special characters will introduce the pipe (|) charcater in your "bag of words" which is not what is asked.
upvoted 2 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...