To reduce privacy risk, I should scrub the SSNs from the input. I need to scrub them in a way that preserves uniqueness. E.g. if I have 111-11-1119
, 111-11-1119
, and 111-11-1118
, we could number 111-11-1119
as 1 and 111-11-1118
as 2.
What's the best way to do that?