You have built a custom model that performs several memory-intensive preprocessing tasks before it makes a prediction. You deployed the model to a Vertex AI endpoint, and validated that results were received in a reasonable amount of time. After routing user traffic to the endpoint, you discover that the endpoint does not autoscale as expected when receiving multiple requests. What should you do?
b1a8fae
Highly Voted 1 year, 5 months agob1a8fae
1 year, 5 months agopikachu007
Highly Voted 1 year, 5 months agoVinaoSilva
Most Recent 11 months, 3 weeks agofitri001
1 year, 2 months agofitri001
1 year, 2 months agoguilhermebutzke
1 year, 4 months ago