exam questions

Exam AWS Certified Machine Learning - Specialty All Questions

View all questions & answers for the AWS Certified Machine Learning - Specialty exam

Exam AWS Certified Machine Learning - Specialty topic 1 question 106 discussion

A Data Scientist is building a linear regression model and will use resulting p-values to evaluate the statistical significance of each coefficient. Upon inspection of the dataset, the Data Scientist discovers that most of the features are normally distributed. The plot of one feature in the dataset is shown in the graphic.

What transformation should the Data Scientist apply to satisfy the statistical assumptions of the linear regression model?

  • A. Exponential transformation
  • B. Logarithmic transformation
  • C. Polynomial transformation
  • D. Sinusoidal transformation
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
astonm13
Highly Voted 3 years, 1 month ago
I would say B. Logarithmic transformation converts skewed distributions towards normal
upvoted 20 times
...
AIWave
Most Recent 8 months, 3 weeks ago
I would go with B. For right skewed distributions -> Logrithmic transformation For left skewed distributions -> exponential transformations=
upvoted 1 times
...
Mickey321
1 year, 2 months ago
Selected Answer: B
The linear regression model assumes that the errors are normally distributed. The plot of the feature shows that the errors are not normally distributed. The logarithmic transformation can be used to transform the errors to be normally distributed. The exponential transformation, polynomial transformation, and sinusoidal transformation cannot be used to transform the errors to be normally distributed.
upvoted 1 times
...
ccpmad
1 year, 3 months ago
Selected Answer: B
B when the feature data is not normally distributed, applying a logarithmic transformation can help to normalize the data and satisfy the assumptions of the linear regression model.
upvoted 1 times
...
cpal012
1 year, 7 months ago
'A' would make it considerably worse.
upvoted 1 times
goku58
1 year, 4 months ago
Exponential transformation would make it exponentially worse. :D
upvoted 3 times
...
...
Zhechen0912
1 year, 7 months ago
Selected Answer: B
Log Normal Distribution => Log() => Normal Distribution
upvoted 2 times
...
sqavi
1 year, 9 months ago
Selected Answer: B
B is correct answer
upvoted 1 times
...
vetaal
2 years, 9 months ago
Selected Answer: B
This is B, as this feature seems skewed while others have a regular distribution according to the question. The log transformation will reduce this features skewness.
upvoted 2 times
...
YJ4219
3 years, 1 month ago
I think it's B. reference: https://corporatefinanceinstitute.com/resources/knowledge/other/positively-skewed-distribution/#:~:text=For%20positively%20skewed%20distributions%2C%20the,each%20value%20in%20the%20dataset. "For positively skewed distributions, the most popular transformation is the log transformation. The log transformation implies the calculations of the natural logarithm for each value in the dataset. The method reduces the skew of a distribution. Statistical tests are usually run only when the transformation of the data is complete."
upvoted 2 times
...
konradL
3 years, 1 month ago
I would also go for B, as Log transformation is often mentioned, when we are talking about right (positive) skewness.
upvoted 3 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago