An organization creates and deploys a multi-class image classification deep learning model that uses a set of labeled photographs.
The software engineering team reports there is a heavy inferencing load for the prediction web services during the summer. The production web service for the model fails to meet demand despite having a fully-utilized compute cluster where the web service is deployed.
You need to improve performance of the image classification web service with minimal downtime and minimal administrative effort.
What should you advise the IT Operations team to do?
santoshpandit
Highly Voted 3 years, 5 months agoevangelist
Most Recent 6 months, 1 week agojames2033
1 year, 2 months agoahmednani015
6 months, 3 weeks agomichaelmorar
2 years agotchip
2 years, 9 months ago[Removed]
2 years, 10 months agotchip
2 years, 10 months agonick234987
3 years, 2 months ago