A machine learning (ML) specialist wants to create a data preparation job that uses a PySpark script with complex window aggregation operations to create data for training and testing. The ML specialist needs to evaluate the impact of the number of features and the sample count on model performance.
Which approach should the ML specialist use to determine the ideal data transformations for the model?
dolorez
Highly Voted 2 years, 11 months agoJerry84
2 years, 3 months agoJerry84
2 years, 2 months agobluer1
Highly Voted 3 years agoKlaudYu
2 years, 10 months agosalim1905
Most Recent 10 months, 3 weeks agoef12052
1 month, 1 week ago3eb0542
1 year agosanjosh
1 year, 5 months agoMickey321
1 year, 9 months agoADVIT
1 year, 10 months agodkx
1 year, 11 months agoMllb
2 years, 1 month agoZSun
2 years agoblanco750
2 years, 1 month agoSANDEEP_AWS
2 years, 1 month agoZSun
1 year, 12 months agojhonivy
2 years, 3 months agoaScientist
2 years, 5 months agoovokpus
2 years, 10 months ago