Hacker News new | past | comments | ask | show | jobs | submit login

In hwalthcare there usually an ethics panel, that will look at the data, and look for way to reduce re-identification.

The common example is the one-legged child with cancer from a remote town. You can remove a the PII columns and it's pretty easy to find that person.




One way around that is to drop all cases below a certain occurrence threshold, ie. if there aren't at least 1000 people in the same town with the same condition, they aren't getting into the dataset.

(The downside is that rare diseases might fall through the cracks.)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: