Exam Professional Machine Learning Engineer topic 1 question 326 discussion

Actual exam question from Google's Professional Machine Learning Engineer

Question #: 326
Topic #: 1

[All Professional Machine Learning Engineer Questions]

You work at an organization that manages a popular payment app. You built a fraudulent transaction detection model by using scikit-learn and deployed it to a Vertex AI endpoint. The endpoint is currently using 1 e2-standard-2 machine with 2 vCPUs and 8 GB of memory. You discover that traffic on the gateway fluctuates to four times more than the endpoint's capacity. You need to address this issue by using the most cost-effective approach. What should you do?

A. Re-deploy the model with a TPU accelerator.
B. Change the machine type to e2-highcpu-32 with 32 vCPUs and 32 GB of memory.
C. Set up a monitoring job and an alert for CPU usage. If you receive an alert, scale the vCPUs as needed.
D. Increase the number of maximum replicas to 6 nodes, each with 1 e2-standard-2 machine.

Show Suggested Answer

Suggested Answer: D 🗳️

by Duke_CT at June 20, 2025, 3:52 a.m.

Comments

Submit Cancel

ricardovazz

6 days, 17 hours ago

Selected Answer: D

D, scale horizontally increasing replicas for fluctuating traffic

upvoted 1 times

...

kirukkuman

1 week ago

Selected Answer: D

The correct answer is D. This approach uses horizontal autoscaling, which is the most cost-effective and efficient way to handle fluctuating traffic on a Vertex AI endpoint

upvoted 1 times

...

Duke_CT

2 weeks, 5 days ago

Selected Answer: A

Could be D, thoughts?

upvoted 1 times

...