Consider the following scenario:

You are an administrator of a large data set at a hospital, a phone provider, a search engine or a social network. The data you hold is very valuable, and you would like to make it publicly available so that you and the rest of the world can make a better use of it. However, the data is also highly sensitive (e.g., consisting of patient medical records)! So, even though you are only planning to release aggregate data, you still must do so in a way that does not compromise the privacy of any individual in the data set.

 

What can you do, and why? Those are the questions we will explore in this course.

The covered material will draw from learning theory, approximation algorithms, information theory, game theory, probability and geometry

Course grade based upon homework, reading and discussion and a project.