|
|
Department of Statistics, The Ohio State University
Statistics and Biostatistics Colloquium Series
Differential privacy and statistical hypothesis testing
Aleksandra Slavkovic
Penn State University
3:30PM - Thursday, November 5, 2009
Room 170, Eighteenth Avenue Bldg. (EA 170)
ABSTRACT
Data privacy is an overarching concern in modern society, as government
and non-government agencies alike collect, archive, and release increasing
amounts of sensitive personal data. In the first part of the talk, we give
a brief overview of data confidentiality problems, the related statistical
disclosure limitation methods and their link to other privacy-preserving
data techniques. The concept of differential privacy as a rigorous
definition of privacy has emerged from the cryptographic community.
However, further careful evaluation is needed before we can apply these
theoretical results to privacy preservation in everyday data mining and
statistical analysis. For example, many funding agencies and ethics boards
frequently request that a power analysis be completed before a study is
conducted, or before a study's results are published. In this talk we
demonstrate how to integrate a differential privacy framework with the
classical statistical hypothesis testing in domains such as clinical
trials where personal information is sensitive. We develop concrete
methodology that researchers can use. We derive rules for the sample size
adjustment whereby both statistical efficiency and differential privacy
can be achieved for the specific tests for binomial and normal random
variables and in contingency tables.
Meet the speaker in Room 212 Cockins Hall at 4:30
p.m. Refreshments will be served.
|