A machine learning (ML) specialist wants to create a data preparation job that uses a PySpark script with complex window aggregation operations to create data for training and testing. The ML specialist needs to evaluate the impact of the number of features and the sample count on model performance.
Which approach should the ML specialist use to determine the ideal data transformations for the model?
dolorez
Highly Voted 3 years, 1 month agoJerry84
2 years, 5 months agoJerry84
2 years, 4 months agobluer1
Highly Voted 3 years, 1 month agoKlaudYu
3 years agosalim1905
Most Recent 1 year agoef12052
3 months ago3eb0542
1 year, 2 months agosanjosh
1 year, 7 months agoMickey321
1 year, 10 months agoADVIT
1 year, 11 months agodkx
2 years, 1 month agoMllb
2 years, 2 months agoZSun
2 years, 2 months agoblanco750
2 years, 3 months agoSANDEEP_AWS
2 years, 3 months agoZSun
2 years, 1 month agojhonivy
2 years, 4 months agoaScientist
2 years, 7 months agoovokpus
2 years, 12 months ago