Context: I’m processing Reddit data. There is too much data to handle, therefore I created a random sample of the data. That leads in my network to have a lot of isolated nodes (emphasized, because isolated nodes are usually of degree 0, but here I am referring to degree <=2). An image is better than anything else:
The whole big gray ring is composed of nodes that are of degree 1 or 2.
Hence, I’d like to get rid of those nodes in order to have a more meaningful graph based on the sample I have.
Is this the correct approach? Is it feasible?