You have trained a DNN regressor with TensorFlow to predict housing prices using a set of predictive features. Your default precision is tf.float64, and you use a standard TensorFlow estimator:
Your model performs well, but just before deploying it to production, you discover that your current serving latency is 10ms @ 90 percentile and you currently serve on CPUs. Your production requirements expect a model latency of 8ms @ 90 percentile. You're willing to accept a small decrease in performance in order to reach the latency requirement.
Therefore your plan is to improve latency while evaluating how much the model's prediction decreases. What should you first try to quickly lower the serving latency?
0e6b9e2
7 months, 1 week agobaimus
10 months, 4 weeks agofitri001
1 year, 3 months agogscharly
1 year, 3 months agoCarlose2108
1 year, 5 months agoTayoso
1 year, 7 months agoMickey321
1 year, 8 months agoMickey321
1 year, 8 months agoandresvelasco
1 year, 10 months agoVoyager2
2 years, 1 month agojulliet
2 years, 1 month agoVoyager2
2 years, 2 months agoaryaavinash
2 years, 2 months agoM25
2 years, 2 months agoM25
2 years, 2 months agoM25
2 years, 2 months agoM25
2 years, 2 months ago[Removed]
2 years, 3 months agofrangm23
2 years, 3 months ago[Removed]
2 years, 2 months agotavva_prudhvi
2 years agoTNT87
2 years, 5 months agoTNT87
2 years, 4 months agoTNT87
2 years, 5 months agoTNT87
2 years, 5 months agoimamapri
2 years, 6 months ago