Skip to content

Data Anonymization through Dimensionality Reduction #79

Discussion options

You must be logged in to vote

@MerlinSchaefer - great question! While every case is probably a little different - dimensionality reduction reduces across dimensions of each individual's data to create "latent features". So perhaps the simplest example is - with Machine Learning there's very little structure you need to know in order to de-anonymize. The most straightforward way would be to find the true records for a small percentage of the data and then learn a linear classifier (for linear techniques) or a non-linear classifier for more advanced compression techniques.

Worth mentioning - many dimensionality reduction techniques have fantastic differentially private alternatives.

Replies: 3 comments 3 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@em-blue
Comment options

@iamtrask
Comment options

Comment options

You must be logged in to vote
1 reply
@MerlinSchaefer
Comment options

Answer selected by MerlinSchaefer
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
5 participants