A credit card company wants to build a credit scoring model to help predict whether a new credit card applicant will default on a credit card payment. The company has collected data from a large number of sources with thousands of raw attributes. Early experiments to train a classification model revealed that many attributes are highly correlated, the large number of features slows down the training speed significantly, and that there are some overfitting issues.
The Data Scientist on this project would like to speed up the model training time without losing a lot of information from the original dataset.
Which feature engineering technique should the Data Scientist use to meet the objectives?
ahquiceno
Highly Voted 3 years, 8 months agoDr_Kiko
3 years, 7 months agoVinceCar
2 years, 7 months ago[Removed]
Highly Voted 3 years, 8 months agoSophieSu
3 years, 7 months agorodrigus
2 years, 3 months agoxicocaio
Most Recent 8 months, 3 weeks agoGiodefa96
10 months, 3 weeks agogeoan13
1 year, 7 months agoMickey321
1 year, 9 months agoMickey321
1 year, 9 months agokaike_reis
1 year, 10 months agovbal
2 years agoJK1977
2 years agoGOSD
2 years, 1 month agooso0348
2 years, 2 months agoPaolo991
2 years, 2 months agoSneep
2 years, 5 months agoAninina
2 years, 5 months agoovokpus
2 years, 11 months agoovokpus
2 years, 11 months ago