An organization creates and deploys a multi-class image classification deep learning model that uses a set of labeled photographs.
The software engineering team reports there is a heavy inferencing load for the prediction web services during the summer. The production web service for the model fails to meet demand despite having a fully-utilized compute cluster where the web service is deployed.
You need to improve performance of the image classification web service with minimal downtime and minimal administrative effort.
What should you advise the IT Operations team to do?
santoshpandit
Highly Voted 3 years, 4 months agoevangelist
Most Recent 4 months, 3 weeks agojames2033
1 year agoahmednani015
5 months, 1 week agomichaelmorar
1 year, 11 months agotchip
2 years, 8 months ago[Removed]
2 years, 8 months agotchip
2 years, 8 months agonick234987
3 years ago