More Web Proxy on the site http://driver.im/

research-article

Personalized Treatment Selection using Causal Heterogeneity

Authors:

Cyrus DiCiccio,

Padmini Jaikumar,

Shaunak ChatterjeeAuthors Info & Claims

WWW '21: Proceedings of the Web Conference 2021

Pages 1574 - 1585

https://doi.org/10.1145/3442381.3450075

Published: 03 June 2021 Publication History

Abstract

Randomized experimentation (also known as A/B testing or bucket testing) is widely used in the internet industry to measure the metric impact obtained by different treatment variants. A/B tests identify the treatment variant showing the best performance, which then becomes the chosen or selected treatment for the entire population. However, the effect of a given treatment can differ across experimental units and a personalized approach for treatment selection can greatly improve upon the usual global selection strategy. In this work, we develop a framework for personalization through (i) estimation of heterogeneous treatment effect at either a cohort or member-level, followed by (ii) selection of optimal treatment variants for cohorts (or members) obtained through (deterministic or stochastic) constrained optimization.

We perform a two-fold evaluation of our proposed methods. First, a simulation analysis is conducted to study the effect of personalized treatment selection under carefully controlled settings. This simulation illustrates the differences between the proposed methods and the suitability of each with increasing uncertainty. We also demonstrate the effectiveness of the method through a real-life example related to serving notifications at Linkedin. The solution significantly outperformed both heuristic solutions and the global treatment selection baseline leading to a sizable win on top-line metrics like member visits.

References

[1]

Susan Athey and Guido Imbens. 2016. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences 113, 27(2016), 7353–7360.

[2]

Susan Athey, Guido Imbens, and Yanyang Kong. 2016. causalTree: Recursive Partitioning Causal Trees. R package version 0.0.

[3]

Kinjal Basu and Preetam Nandy. 2019. Optimal convergence for stochastic optimization with multiple expectation constraints. arXiv preprint arXiv:1906.03401(2019).

[4]

Albert Benveniste, Michel Métivier, and Pierre Priouret. 2012. Adaptive algorithms and stochastic approximations. Vol. 22. Springer Science & Business Media.

[5]

Leo Breiman. 2001. Random Forests. Machine Learning 45, 1 (2001), 5–32.

Digital Library

[6]

Shuyang Du, James Lee, and Farzin Ghaffarizadeh. 2019. Improve User Retention with Causal Learning. In Proceedings of Machine Learning Research, Vol. 104. PMLR, Anchorage, Alaska, USA, 34–49.

[7]

John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 12 (2011), 2121–2159.

Digital Library

[8]

Jared C. Foster, Jeremy M.G. Taylor, and Stephen J. Ruberg. 2011. Subgroup identification from randomized clinical trial data. Statistics in Medicine 30, 24 (2011), 2867–2880.

[9]

Yan Gao, Viral Gupta, Jinyun Yan, Changji Shi, Zhongen Tao, PJ Xiao, Curtis Wang, Shipeng Yu, Romer Rosales, Ajith Muralidharan, and Shaunak Chatterjee. 2018. Near Real-time Optimization of Activity-based Notifications. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining(KDD ’18). ACM, New York, NY, USA, 283–292.

Digital Library

[10]

Donald P. Green and Holger L. Kern. 2012. Modeling Heterogeneous Treatment Effects in Survey Experiments with Bayesian Additive Regression Trees. Public Opinion Quarterly 76, 3 (2012), 491–511.

[11]

Kosuke Imai and Marc Ratkovic. 2013. Estimating treatment effect heterogeneity in randomized program evaluation. Ann. Appl. Stat. 7, 1 (03 2013), 443–470.

[12]

Fredrik D. Johansson, Uri Shalit, and David Sontag. 2016. Learning Representations for Counterfactual Inference. In Proceedings of the 33rd International Conference on International Conference on Machine Learning(ICML’16, Vol. 18). 3020–3029.

[13]

Anton J Kleywegt, Alexander Shapiro, and Tito Homem-de Mello. 2002. The sample average approximation method for stochastic discrete optimization. SIAM Journal on Optimization 12, 2 (2002), 479–502.

Digital Library

[14]

Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, and Nils Pohlmann. 2013. Online Controlled Experiments at Large Scale. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD ’13). ACM, New York, NY, USA, 1168–1176.

Digital Library

[15]

Ron Kohavi, Alex Deng, Roger Longbotham, and Ya Xu. 2014. Seven rules of thumb for web site experimenters. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1857–1866.

Digital Library

[16]

Sören R. Künzel, Jasjeet S. Sekhon, Peter J. Bickel, and Bin Yu. 2019. Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences 116, 10(2019), 4156–4165.

[17]

Guanghui Lan and Zhiqiang Zhou. 2020. Algorithms for stochastic optimization with function or expectation constraints. Computational Optimization and Applications 76, 2 (2020), 461–498.

Digital Library

[18]

Andy Liaw and Matthew Wiener. 2002. Classification and Regression by randomForest. R News 2, 3 (2002), 18–22. https://CRAN.R-project.org/doc/Rnews/

[19]

Herbert Robbins and Sutton Monro. 1951. A Stochastic Approximation Method. The Annals of Mathematical Statistics 22, 3 (1951), 400–407.

[20]

Paul R Rosenbaum and Donald B Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 1 (1983), 41–55.

[21]

Donald B Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies.Journal of educational Psychology 66, 5 (1974), 688.

[22]

Uri Shalit, Fredrik D. Johansson, and David Sontag. 2017. Estimating Individual Treatment Effect: Generalization Bounds and Algorithms. In Proceedings of the 34th International Conference on Machine Learning(ICML’17, Vol. 70). JMLR.org, 3076–3085.

[23]

Oleg Sofrygin, Mark J. van der Laan, and Romain Neugebauer. 2017. simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data. Journal of Statistical Software 81, 2 (2017), 1–47.

[24]

Michał Sołtys, Szymon Jaroszewicz, and Piotr Rzepakowski. 2015. Ensemble methods for uplift modeling. Data Mining and Knowledge Discovery 29, 6 (2015), 1531–1559.

Digital Library

[25]

James C Spall. 2005. Introduction to stochastic search and optimization: estimation, simulation, and control. Vol. 65. John Wiley & Sons.

[26]

Pedro Strecht. 2015. A Survey of Merging Decision Trees Data Mining Approaches. In Proc. 10th Doctoral Symposium in Informatics Engineering. 36–47.

[27]

Matt Taddy, Matt Gardner, Liyun Chen, and David Draper. 2016. A nonparametric bayesian analysis of heterogeneous treatment effects in digital experimentation. Journal of Business & Economic Statistics 34, 4 (2016), 661–672.

[28]

Diane Tang, Ashish Agarwal, Deirdre O’Brien, and Mike Meyer. 2010. Overlapping Experiment Infrastructure: More, Better, Faster Experimentation. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD ’10). ACM, New York, NY, USA, 17–26.

Digital Library

[29]

Julie Tibshirani, Susan Athey, and Stefan Wager. 2020. grf: Generalized Random Forests. https://CRAN.R-project.org/package=grf R package version 1.2.0.

[30]

Stefan Wager and Susan Athey. 2018. Estimation and Inference of Heterogeneous Treatment Effects using Random Forests. J. Amer. Statist. Assoc. 113, 523 (2018), 1228–1242.

[31]

Wei Wang and Shabbir Ahmed. 2008. Sample average approximation of expected value constrained stochastic programs. Operations Research Letters 36, 5 (2008), 515–519.

Digital Library

[32]

Ya Xu, Nanyu Chen, Addrian Fernandez, Omar Sinno, and Anmol Bhasin. 2015. From Infrastructure to Culture: A/B Testing Challenges in Large Scale Social Networks. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD ’15). ACM, New York, NY, USA, 2227–2236.

Digital Library

[33]

Liuyi Yao, Sheng Li, Yaliang Li, Mengdi Huai, Jing Gao, and Aidong Zhang. 2018. Representation Learning for Treatment Effect Estimation from Observational Data. In Advances in Neural Information Processing Systems, Vol. 31. Curran Associates, Inc., 2633–2643.

[34]

Hao Yu, Michael J. Neely, and Xiaohan Wei. 2017. Online Convex Optimization with Stochastic Constraints. In Proceedings of the 31st International Conference on Neural Information Processing Systems(NIPS’17). Curran Associates Inc., USA, 1427–1437.

Digital Library

Cited By

Sun ZYang HLiu DWeng YTang XHe X(2024)End-to-End Cost-Effective Incentive Recommendation under Budget Constraint with Uplift ModelingProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688147(560-569)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688147
Quin FWeyns DGalster MSilva C(2024)A/B testingJournal of Systems and Software10.1016/j.jss.2024.112011211:COnline publication date: 2-Jul-2024
https://dl.acm.org/doi/10.1016/j.jss.2024.112011
Cui DGuo JLiu PZhang X(2024)Using Case-Based Causal Reasoning to Provide Explainable Counterfactual Diagnosis in Personalized Sprint TrainingCase-Based Reasoning Research and Development10.1007/978-3-031-63646-2_27(418-429)Online publication date: 24-Jun-2024
https://doi.org/10.1007/978-3-031-63646-2_27
Show More Cited By

Recommendations

LBCF: A Large-Scale Budget-Constrained Causal Forest Algorithm
WWW '22: Proceedings of the ACM Web Conference 2022

Offering incentives (e.g., coupons at Amazon, discounts at Uber and video bonuses at Tiktok) to user is a common strategy used by online platforms to increase user engagement and platform revenue. Despite its proven effectiveness, these marketing ...
Bootstrap corrections of treatment effect estimates following selection

Bias of treatment effect estimators can occur when the maximum effect of several treatments is to be determined or the effect of the selected treatment or subgroup has to be estimated. Since those estimates may contribute to the decision as to whether ...
Collaborative Filtering for Personalised Facet Selection
IAIT '18: Proceedings of the 10th International Conference on Advances in Information Technology

An overwhelming number of facet values causes difficulties in providing an efficient search filter in dynamic facet search. It requires effort and time from the searchers to examine the list in order to select their interested facets. Personalised facet ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

WWW '21: Proceedings of the Web Conference 2021

April 2021

4054 pages

ISBN:9781450383127

DOI:10.1145/3442381

Editors:
Jure Leskovec
Stanford
,
Marko Grobelnik
Jožef Stefan Institute
,
Marc Najork
Google
,
Jie Tang
Tsinghua University
,
Leila Zia
Wikimedia Foundation

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 June 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '21

Sponsor:

SIGWEB

WWW '21: The Web Conference 2021

April 19 - 23, 2021

Ljubljana, Slovenia

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
215
Total Downloads

Downloads (Last 12 months)36
Downloads (Last 6 weeks)1

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sun ZYang HLiu DWeng YTang XHe X(2024)End-to-End Cost-Effective Incentive Recommendation under Budget Constraint with Uplift ModelingProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688147(560-569)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688147
Quin FWeyns DGalster MSilva C(2024)A/B testingJournal of Systems and Software10.1016/j.jss.2024.112011211:COnline publication date: 2-Jul-2024
https://dl.acm.org/doi/10.1016/j.jss.2024.112011
Cui DGuo JLiu PZhang X(2024)Using Case-Based Causal Reasoning to Provide Explainable Counterfactual Diagnosis in Personalized Sprint TrainingCase-Based Reasoning Research and Development10.1007/978-3-031-63646-2_27(418-429)Online publication date: 24-Jun-2024
https://doi.org/10.1007/978-3-031-63646-2_27
Nandy PYu XLiu WTu YBasu KChatterjee S(2023)Generalized Causal Tree for Uplift Modeling2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386842(788-798)Online publication date: 15-Dec-2023
https://doi.org/10.1109/BigData59044.2023.10386842

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten