Printable PDF
Department of Mathematics,
University of California San Diego

****************************

Math 278B: Mathematics of Information, Data, and Signals

Yiyun He

UCI

Differentially Private Algorithms for Synthetic Data

Abstract:

We present a highly effective algorithmic approach, PMM, for generating differentially private synthetic data in a bounded metric space with near-optimal utility guarantees under the 1-Wasserstein distance. In particular, for a dataset in the hypercube [0,1]^d, our algorithm generates synthetic dataset such that the expected 1-Wasserstein distance between the empirical measure of true and synthetic dataset is O(n^{-1/d}) for d>1. Our accuracy guarantee is optimal up to a constant factor for d>1, and up to a logarithmic factor for d=1. Also, PMM is time-efficient with a fast running time of O(\epsilon d n). Derived from the PMM algorithm, more variations of synthetic data publishing problems can be studied under different settings.

January 17, 2025

11:00 AM

APM 2402

Research Areas

Mathematics of Information, Data, and Signals

****************************