I have a data sample that looks like the following:
<code>ID Outcome Flags
1 Win 728,491,294,889,551
2 Lose 491,889,541,904
3 Lose 491,904,213,541
</code>
<code>ID Outcome Flags
1 Win 728,491,294,889,551
2 Lose 491,889,541,904
3 Lose 491,904,213,541
</code>
ID Outcome Flags
1 Win 728,491,294,889,551
2 Lose 491,889,541,904
3 Lose 491,904,213,541
There could be hundreds of rows and there could be hundreds of potential flags.
My task is to work out which flags most closely correlate with winners and which correlate with losers; the ultimate aim is to “score” the flags so that we can predict the outcome based on the flags.
Is there a way to generate a correlation score per flag in a simple way?
Thanks