A user new to Databricks is trying to troubleshoot long execution times for some pipeline logic they are working on. Presently, the user is executing code cell-by-cell, using display() calls to confirm code is producing the logically correct results as new transformations are added to an operation. To get a measure of average time to execute, the user is running each cell multiple times interactively.
Which of the following adjustments will get a more accurate measure of how code is likely to perform in production?
c315d10
2 months agoKadELbied
2 months, 4 weeks agokishanu
3 months, 3 weeks agokishanu
3 months, 3 weeks agosuranga4
4 months, 1 week agomohadjhamad
4 months, 2 weeks agomouthwash
6 months, 4 weeks agoarekm
7 months ago