research-article

Optimal and Differentially Private Data Acquisition: : Central and Local Mechanisms

Authors:

Alireza Fallah,

Ali Makhdoumi,

Azarakhsh Malekian,

Asuman OzdaglarAuthors Info & Claims

Operations Research, Volume 72, Issue 3

Pages 1105 - 1123

https://doi.org/10.1287/opre.2022.0014

Published: 05 October 2023 Publication History

Abstract

The data for many machine learning tasks are owned by individuals who are typically concerned about privacy. Here, the authors study the optimal design of a data acquisition mechanism aimed at learning the mean of a population. This data acquisition scheme includes the design of a payment rule to compensate users for their privacy loss. It also involves selecting an estimator that minimizes estimation error while simultaneously providing privacy guarantees to users in line with their privacy preferences. The authors formulate this problem as a Bayesian mechanism design problem and propose approximately optimal data acquisition mechanisms.

Abstract

We consider a platform’s problem of collecting data from privacy sensitive users to estimate an underlying parameter of interest. We formulate this question as a Bayesian-optimal mechanism design problem, in which an individual can share their (verifiable) data in exchange for a monetary reward or services, but at the same time has a (private) heterogeneous privacy cost which we quantify using differential privacy. We consider two popular differential privacy settings for providing privacy guarantees for the users: central and local. In both settings, we establish minimax lower bounds for the estimation error and derive (near) optimal estimators for given heterogeneous privacy loss levels for users. Building on this characterization, we pose the mechanism design problem as the optimal selection of an estimator and payments that will elicit truthful reporting of users’ privacy sensitivities. Under a regularity condition on the distribution of privacy sensitivities, we develop efficient algorithmic mechanisms to solve this problem in both privacy settings. Our mechanism in the central setting can be implemented in time O(nlogn) where n is the number of users and our mechanism in the local setting admits a polynomial time approximation scheme (PTAS).

Funding: A. Fallah acknowledges support from the Apple Scholars in AI/ML PhD fellowship.

Supplemental Material: The online appendix is available at https://doi.org/10.1287/opre.2022.0014.

References

[1]

Abernethy JD, Cummings R, Kumar B, Morgenstern J, Taggart S (2019) Learning auctions with robust incentive guarantees. Proceedings of the 33rd International Conference on Neural Information Processing (Curran Associates, Inc., Red Hook, NY), 11587–11597.

Abstract

Abstract

References

Recommendations

Optimal and Differentially Private Data Acquisition: Central and Local Mechanisms

A differentially private algorithm for location data release

Differentially private data publishing via optimal univariate microaggregation and record perturbation

Comments

Information

Published In

Publisher

Publication History

Author Tag

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations