You have built a custom model that performs several memory-intensive preprocessing tasks before it makes a prediction. You deployed the model to a Vertex AI endpoint, and validated that results were received in a reasonable amount of time. After routing user traffic to the endpoint, you discover that the endpoint does not autoscale as expected when receiving multiple requests. What should you do?
b1a8fae
Highly Voted 1 year, 3 months agob1a8fae
1 year, 3 months agopikachu007
Highly Voted 1 year, 3 months agoVinaoSilva
Most Recent 10 months, 1 week agofitri001
1 year agofitri001
1 year agoguilhermebutzke
1 year, 2 months ago