Computer Science > Machine Learning

arXiv:2206.02371 (cs)

[Submitted on 6 Jun 2022 (v1), last revised 9 Jun 2022 (this version, v2)]

Title:Markovian Interference in Experiments

Authors:Vivek F. Farias, Andrew A. Li, Tianyi Peng, Andrew Zheng

View PDF

Abstract:We consider experiments in dynamical systems where interventions on some experimental units impact other units through a limiting constraint (such as a limited inventory). Despite outsize practical importance, the best estimators for this `Markovian' interference problem are largely heuristic in nature, and their bias is not well understood. We formalize the problem of inference in such experiments as one of policy evaluation. Off-policy estimators, while unbiased, apparently incur a large penalty in variance relative to state-of-the-art heuristics. We introduce an on-policy estimator: the Differences-In-Q's (DQ) estimator. We show that the DQ estimator can in general have exponentially smaller variance than off-policy evaluation. At the same time, its bias is second order in the impact of the intervention. This yields a striking bias-variance tradeoff so that the DQ estimator effectively dominates state-of-the-art alternatives. From a theoretical perspective, we introduce three separate novel techniques that are of independent interest in the theory of Reinforcement Learning (RL). Our empirical evaluation includes a set of experiments on a city-scale ride-hailing simulator.

Subjects:	Machine Learning (cs.LG); Econometrics (econ.EM); Machine Learning (stat.ML)
Cite as:	arXiv:2206.02371 [cs.LG]
	(or arXiv:2206.02371v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.02371

Submission history

From: Tianyi Peng [view email]
[v1] Mon, 6 Jun 2022 05:53:36 UTC (5,132 KB)
[v2] Thu, 9 Jun 2022 14:13:38 UTC (5,135 KB)

Computer Science > Machine Learning

Title:Markovian Interference in Experiments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Markovian Interference in Experiments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators