[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3442381.3450075acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Personalized Treatment Selection using Causal Heterogeneity

Published: 03 June 2021 Publication History

Abstract

Randomized experimentation (also known as A/B testing or bucket testing) is widely used in the internet industry to measure the metric impact obtained by different treatment variants. A/B tests identify the treatment variant showing the best performance, which then becomes the chosen or selected treatment for the entire population. However, the effect of a given treatment can differ across experimental units and a personalized approach for treatment selection can greatly improve upon the usual global selection strategy. In this work, we develop a framework for personalization through (i) estimation of heterogeneous treatment effect at either a cohort or member-level, followed by (ii) selection of optimal treatment variants for cohorts (or members) obtained through (deterministic or stochastic) constrained optimization.
We perform a two-fold evaluation of our proposed methods. First, a simulation analysis is conducted to study the effect of personalized treatment selection under carefully controlled settings. This simulation illustrates the differences between the proposed methods and the suitability of each with increasing uncertainty. We also demonstrate the effectiveness of the method through a real-life example related to serving notifications at Linkedin. The solution significantly outperformed both heuristic solutions and the global treatment selection baseline leading to a sizable win on top-line metrics like member visits.

References

[1]
Susan Athey and Guido Imbens. 2016. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences 113, 27(2016), 7353–7360.
[2]
Susan Athey, Guido Imbens, and Yanyang Kong. 2016. causalTree: Recursive Partitioning Causal Trees. R package version 0.0.
[3]
Kinjal Basu and Preetam Nandy. 2019. Optimal convergence for stochastic optimization with multiple expectation constraints. arXiv preprint arXiv:1906.03401(2019).
[4]
Albert Benveniste, Michel Métivier, and Pierre Priouret. 2012. Adaptive algorithms and stochastic approximations. Vol. 22. Springer Science & Business Media.
[5]
Leo Breiman. 2001. Random Forests. Machine Learning 45, 1 (2001), 5–32.
[6]
Shuyang Du, James Lee, and Farzin Ghaffarizadeh. 2019. Improve User Retention with Causal Learning. In Proceedings of Machine Learning Research, Vol. 104. PMLR, Anchorage, Alaska, USA, 34–49.
[7]
John Duchi, Elad Hazan, and Yoram Singer. 2011. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research 12 (2011), 2121–2159.
[8]
Jared C. Foster, Jeremy M.G. Taylor, and Stephen J. Ruberg. 2011. Subgroup identification from randomized clinical trial data. Statistics in Medicine 30, 24 (2011), 2867–2880.
[9]
Yan Gao, Viral Gupta, Jinyun Yan, Changji Shi, Zhongen Tao, PJ Xiao, Curtis Wang, Shipeng Yu, Romer Rosales, Ajith Muralidharan, and Shaunak Chatterjee. 2018. Near Real-time Optimization of Activity-based Notifications. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining(KDD ’18). ACM, New York, NY, USA, 283–292.
[10]
Donald P. Green and Holger L. Kern. 2012. Modeling Heterogeneous Treatment Effects in Survey Experiments with Bayesian Additive Regression Trees. Public Opinion Quarterly 76, 3 (2012), 491–511.
[11]
Kosuke Imai and Marc Ratkovic. 2013. Estimating treatment effect heterogeneity in randomized program evaluation. Ann. Appl. Stat. 7, 1 (03 2013), 443–470.
[12]
Fredrik D. Johansson, Uri Shalit, and David Sontag. 2016. Learning Representations for Counterfactual Inference. In Proceedings of the 33rd International Conference on International Conference on Machine Learning(ICML’16, Vol. 18). 3020–3029.
[13]
Anton J Kleywegt, Alexander Shapiro, and Tito Homem-de Mello. 2002. The sample average approximation method for stochastic discrete optimization. SIAM Journal on Optimization 12, 2 (2002), 479–502.
[14]
Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, and Nils Pohlmann. 2013. Online Controlled Experiments at Large Scale. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD ’13). ACM, New York, NY, USA, 1168–1176.
[15]
Ron Kohavi, Alex Deng, Roger Longbotham, and Ya Xu. 2014. Seven rules of thumb for web site experimenters. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1857–1866.
[16]
Sören R. Künzel, Jasjeet S. Sekhon, Peter J. Bickel, and Bin Yu. 2019. Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences 116, 10(2019), 4156–4165.
[17]
Guanghui Lan and Zhiqiang Zhou. 2020. Algorithms for stochastic optimization with function or expectation constraints. Computational Optimization and Applications 76, 2 (2020), 461–498.
[18]
Andy Liaw and Matthew Wiener. 2002. Classification and Regression by randomForest. R News 2, 3 (2002), 18–22. https://CRAN.R-project.org/doc/Rnews/
[19]
Herbert Robbins and Sutton Monro. 1951. A Stochastic Approximation Method. The Annals of Mathematical Statistics 22, 3 (1951), 400–407.
[20]
Paul R Rosenbaum and Donald B Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 1 (1983), 41–55.
[21]
Donald B Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies.Journal of educational Psychology 66, 5 (1974), 688.
[22]
Uri Shalit, Fredrik D. Johansson, and David Sontag. 2017. Estimating Individual Treatment Effect: Generalization Bounds and Algorithms. In Proceedings of the 34th International Conference on Machine Learning(ICML’17, Vol. 70). JMLR.org, 3076–3085.
[23]
Oleg Sofrygin, Mark J. van der Laan, and Romain Neugebauer. 2017. simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data. Journal of Statistical Software 81, 2 (2017), 1–47.
[24]
Michał Sołtys, Szymon Jaroszewicz, and Piotr Rzepakowski. 2015. Ensemble methods for uplift modeling. Data Mining and Knowledge Discovery 29, 6 (2015), 1531–1559.
[25]
James C Spall. 2005. Introduction to stochastic search and optimization: estimation, simulation, and control. Vol. 65. John Wiley & Sons.
[26]
Pedro Strecht. 2015. A Survey of Merging Decision Trees Data Mining Approaches. In Proc. 10th Doctoral Symposium in Informatics Engineering. 36–47.
[27]
Matt Taddy, Matt Gardner, Liyun Chen, and David Draper. 2016. A nonparametric bayesian analysis of heterogeneous treatment effects in digital experimentation. Journal of Business & Economic Statistics 34, 4 (2016), 661–672.
[28]
Diane Tang, Ashish Agarwal, Deirdre O’Brien, and Mike Meyer. 2010. Overlapping Experiment Infrastructure: More, Better, Faster Experimentation. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD ’10). ACM, New York, NY, USA, 17–26.
[29]
Julie Tibshirani, Susan Athey, and Stefan Wager. 2020. grf: Generalized Random Forests. https://CRAN.R-project.org/package=grf R package version 1.2.0.
[30]
Stefan Wager and Susan Athey. 2018. Estimation and Inference of Heterogeneous Treatment Effects using Random Forests. J. Amer. Statist. Assoc. 113, 523 (2018), 1228–1242.
[31]
Wei Wang and Shabbir Ahmed. 2008. Sample average approximation of expected value constrained stochastic programs. Operations Research Letters 36, 5 (2008), 515–519.
[32]
Ya Xu, Nanyu Chen, Addrian Fernandez, Omar Sinno, and Anmol Bhasin. 2015. From Infrastructure to Culture: A/B Testing Challenges in Large Scale Social Networks. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD ’15). ACM, New York, NY, USA, 2227–2236.
[33]
Liuyi Yao, Sheng Li, Yaliang Li, Mengdi Huai, Jing Gao, and Aidong Zhang. 2018. Representation Learning for Treatment Effect Estimation from Observational Data. In Advances in Neural Information Processing Systems, Vol. 31. Curran Associates, Inc., 2633–2643.
[34]
Hao Yu, Michael J. Neely, and Xiaohan Wei. 2017. Online Convex Optimization with Stochastic Constraints. In Proceedings of the 31st International Conference on Neural Information Processing Systems(NIPS’17). Curran Associates Inc., USA, 1427–1437.

Cited By

View all
  • (2024)End-to-End Cost-Effective Incentive Recommendation under Budget Constraint with Uplift ModelingProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688147(560-569)Online publication date: 8-Oct-2024
  • (2024)A/B testingJournal of Systems and Software10.1016/j.jss.2024.112011211:COnline publication date: 2-Jul-2024
  • (2024)Using Case-Based Causal Reasoning to Provide Explainable Counterfactual Diagnosis in Personalized Sprint TrainingCase-Based Reasoning Research and Development10.1007/978-3-031-63646-2_27(418-429)Online publication date: 24-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '21: Proceedings of the Web Conference 2021
April 2021
4054 pages
ISBN:9781450383127
DOI:10.1145/3442381
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Constraint optimization
  2. Heterogeneous causal effects
  3. Personalization
  4. Treatment Selection

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

WWW '21
Sponsor:
WWW '21: The Web Conference 2021
April 19 - 23, 2021
Ljubljana, Slovenia

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)40
  • Downloads (Last 6 weeks)3
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)End-to-End Cost-Effective Incentive Recommendation under Budget Constraint with Uplift ModelingProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688147(560-569)Online publication date: 8-Oct-2024
  • (2024)A/B testingJournal of Systems and Software10.1016/j.jss.2024.112011211:COnline publication date: 2-Jul-2024
  • (2024)Using Case-Based Causal Reasoning to Provide Explainable Counterfactual Diagnosis in Personalized Sprint TrainingCase-Based Reasoning Research and Development10.1007/978-3-031-63646-2_27(418-429)Online publication date: 24-Jun-2024
  • (2023)Generalized Causal Tree for Uplift Modeling2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386842(788-798)Online publication date: 15-Dec-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media