Finding features with linear predictive power by maximizing a correlation; for binary classification problems; why isn’t this working?
train-4.csv has 200000 rows and roughly 200 columns. train-4.csv has as its first column, either the ground truth 0 or the ground truth 1. aux-n.csv where n>0 is meant to store what happens when we apply the solution for train-4 in the algorithm.