This is an open-ended question which I am analysing while doing some fun projects in leisure. Websites like Google, Facebook do store user activities and perhaps sell them or use them for advertisements. But I believe anonymity is maintained. For instance, I need to expose that there are 5 people between age 45-50 who are drug-addicts in a given area… but I don’t want to offer the names of the people. In fact I want to be sure that noone can access their names, otherwise resulting in privacy violation (Sorry for the weird example !)
Is there some tool/standard which everyone(especially smaller organizations) follow to limit legal complications due to identity theft like this ?
I am tagging MongoDB and Java, as I am using these technologies and answers specific to them would be easier to implement.
2
There is a company in my area (no I do not work for them or have any connection with them, although I did interview for a job there some time back) that does what they call “Anonymizing Health Data”. While they have various software solutions, the CEO and team have also produced a number of references that might be helpful in solving your problem. The list of books on their website can be found at:
http://www.privacyanalytics.ca/resources/books/
I suggest starting with those references and then seeing if there are more specific questions as to functionality that we might help you with.