Differentially Private Sampling from Distributions

Raskhodnikova, Sofya; Sivakumar, Satchit; Smith, Adam; Swanberg, Marika

Computer Science > Machine Learning

arXiv:2211.08193 (cs)

[Submitted on 15 Nov 2022]

Title:Differentially Private Sampling from Distributions

Authors:Sofya Raskhodnikova, Satchit Sivakumar, Adam Smith, Marika Swanberg

View PDF

Abstract:We initiate an investigation of private sampling from distributions. Given a dataset with $n$ independent observations from an unknown distribution $P$, a sampling algorithm must output a single observation from a distribution that is close in total variation distance to $P$ while satisfying differential privacy. Sampling abstracts the goal of generating small amounts of realistic-looking data. We provide tight upper and lower bounds for the dataset size needed for this task for three natural families of distributions: arbitrary distributions on $\{1,\ldots ,k\}$, arbitrary product distributions on $\{0,1\}^d$, and product distributions on $\{0,1\}^d$ with bias in each coordinate bounded away from 0 and 1. We demonstrate that, in some parameter regimes, private sampling requires asymptotically fewer observations than learning a description of $P$ nonprivately; in other regimes, however, private sampling proves to be as difficult as private learning. Notably, for some classes of distributions, the overhead in the number of observations needed for private learning compared to non-private learning is completely captured by the number of observations needed for private sampling.

Comments:	44 pages, preliminary version in NeurIPS 2021
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2211.08193 [cs.LG]
	(or arXiv:2211.08193v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.08193

Submission history

From: Satchit Sivakumar [view email]
[v1] Tue, 15 Nov 2022 14:56:42 UTC (94 KB)

Computer Science > Machine Learning

Title:Differentially Private Sampling from Distributions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Differentially Private Sampling from Distributions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators