Data Perturbation

Name: Data Perturbation
Start: 2022-10-27T16:00:00-07:00
End: 2022-10-27T17:00:00-07:00
Location: Donald Bren Hall

Xiaotong Shen

University of Minnesota

Abstract:

Data perturbation is a technique for generating synthetic data by adding “noise” to original data, which has a wide range of applications, primarily in data security. Yet, it has not received much attention within data science. In this presentation, I will describe a fundamental principle of data perturbation that preserves the distributional information, thus ascertaining the validity of the downstream analysis and a machine learning task while protecting data privacy. Applying this principle, we derive a scheme to allow a user to perturb data nonlinearly while meeting the requirements of differential privacy and statistical analysis. It yields credible statistical analysis and high predictive accuracy of a machine learning task. Finally, I will highlight multiple facets of data perturbation through examples.

This work is joint with B Xuan and R Shen.

Date & Time

Thursday, October 27, 2022

4:00 PM - 5:00 PM

Location

Title:: 6011, Donald Bren Hall
Address:: Irvine, CA 92697 United States
Google Map Link:: View map

Organizer

Department of Statistics

Cost

Free

Data Perturbation

Xiaotong Shen

University of Minnesota

Date & Time

Location

Organizer

Category

Cost

Add to Calendar

Event Information

Date & Time

Location

Organizer

Category

Cost

Add to Calendar