You manage a process that performs analysis of daily web traffic logs on an HDInsight cluster. Each of the 250 web servers generates approximately 10 megabytes (MB) of log data each day. All log data is stored in a single folder in Microsoft Azure Data Lake Storage Gen 2.
You need to improve the performance of the process.
Which two changes should you make? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.
Cassielovedata
Highly Voted 4 years, 8 months agoakram786
Most Recent 4 years, 3 months agomohowzeh
4 years, 5 months agoAb5381
4 years, 5 months agosyu31svc
4 years, 6 months agoRajatNaik
4 years, 10 months agoJPaul
4 years, 10 months ago