Lecture 23: Privacy and security 2022

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/17

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

18 Terms

New cards

Q: What types of private data does data science often collect?

Medical records, census data, browser history, location data, social media data.

New cards

Q: What can be learned from credit card metadata?

Sensitive patterns about a person’s location, identity, and shopping behavior.

New cards

Q: What did the study by De Montjoye et al. (2015) find about credit card data?

Four spatiotemporal points can reidentify 90% of individuals.

New cards

Q: How does knowing transaction price affect reidentification risk?

Increases reidentification risk by 22% on average.

New cards

Q: Are women more or less reidentifiable than men in credit card metadata?

More reidentifiable.

New cards

Q: How can medical data combined with voter data breach privacy?

Linking datasets can reveal identities even if medical data is anonymized.

New cards

Q: What is one method to anonymize location data?

Coarsening latitude/longitude into larger regions like zipcodes.

New cards

Q: What is randomized response used for?

Protecting individual privacy in sensitive surveys.

New cards

Q: How does randomized response work?

Individuals answer truthfully or randomly according to a known probability to protect privacy.

New cards

Q: What inequality is linked to high-probability accuracy in randomized response?

Chebyshev’s inequality.

New cards

Q: What is differential privacy?

A method that ensures the addition or removal of a single data point doesn’t significantly affect outcomes.

New cards

Q: What does adversarial machine learning study?

How to secure ML models against attacks like input perturbations.

New cards

Q: What famous example shows adversarial ML vulnerabilities?

A panda image slightly perturbed to be misclassified by a neural network.

New cards

Q: What are three types of attacks on ML models?

Inversion, extraction, and data poisoning.

New cards

Q: What is inversion attack?

Reconstructing sensitive input data from model outputs.

New cards

Q: What is extraction attack?

Stealing model parameters or training data.

New cards

Q: What is data poisoning?

Maliciously injecting bad data into training to corrupt a model.

New cards

Q: What is federated learning?

Training models across decentralized devices without transferring raw data