Abstract
One of the core challenges in digital marketing is that the business conditions continuously change, which impacts the reception of campaigns. A winning campaign strategy can become unfavored over time, while an old strategy can gain new traction. In data driven digital marketing and web analytics, A/B testing is the prevalent method of comparing digital campaigns, choosing the winning ad, and deciding targeting strategy. A/B testing is suitable when testing variations on similar solutions and having one or more metrics that are clear indicators of success or failure. However, when faced with a complex problem or working on future topics, A/B testing fails to deliver and achieving long-term impact from experimentation is demanding and resource intensive. This study proposes a reinforcement learning based model and demonstrates its application to digital marketing campaigns. We argue and validate with actual-world data that reinforcement learning can help overcome some of the critical challenges that A/B testing, and popular Machine Learning methods currently used in digital marketing campaigns face. We demonstrate the effectiveness of the proposed technique on real actual data for a digital marketing campaign collected from a firm.
Similar content being viewed by others
Notes
UCB-1 belongs to the family of “follow the perturbed leader” algorithms and has proven to retain the optimal logarithmic rate (but with suboptimal constant). A finite-time analysis of this algorithm has been given in Auer et al. (2002); Auer et al. (2002); Auer et al. (2002/03). Other types of padding functions are considered in Audibert et al. (2007).
The focus of this paper is on an abruptly changing environment, but it is believed that the theoretical tools developed to handle the non-stationarity can be applied in different context.
References
Agrawal, R. (1995). Sample mean based index policies by O (log n ) regret for the multi-armed bandit problem. Advances in Applied Probability, 27(4), 1054–1078 {Sutton, 2018 #654}.
American Marketing Association (2021). Digital marketing virtual conference. https://www.ama.org/events/virtual-conference/2021-digital-marketing-virtual-conference/
Arulkumaran, K., Deisenroth, M. P., Brundage, M., & Bharath, A. A. (2017). Deep reinforcement learning: A brief survey. IEEE Signal Processing Magazine, 34(6), 26–38.
Ascarza, E. (2021). When a/B testing Doesn’t tell you the whole story. Harvard Business Review.
Aswani, R., Kar, A. K., & Ilavarasan, P. V. (2018). Detection of spammers in twitter marketing: A hybrid approach using social media analytics and bio inspired computing. Information Systems Frontiers, 20(3), 515–530.
Audibert, J.-Y., Munos, R., & Szepesvári, C. (2007). Tuning bandit algorithms in stochastic environments. In M. Hutter, R. A. Servedio, & E. Takimoto (Eds.), Algorithmic Learning Theory (Vol. 4754, pp. 150–165). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-540-75225-7_15
Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2), 235–256.
Barone, M. J., et al. (2007). Consumer response to retailer use of cause-related marketing: Is more fit better? Journal of Retailing, 83(4), 437–445.
Basse, G. W., & Airoldi, E. M. (2018). Limitations of design-based causal inference and a/B testing under arbitrary and network interference. Sociological Methodology, 48(1), 136–151.
Bennett, K. P., & Parrado-Hernández, E. (2006). The interplay of optimization and machine learning research. Journal of Machine Learning Research, 7(46), 1265–1281.
Bojinov, I., Rambachan, A., & Shephard, N. (2021). Panel experiments and dynamic causal effects: A finite population perspective. Quantitative Economics, 12(4), 1171–1196.
Boone, D. S., & Roehm, M. (2002). Retail Segmentation Using Artificial Neural Networks. International Journal of Research in Marketing, 19(3), 287–301.
Botvinick, M., et al. (2019). Reinforcement Learning, Fast and Slow. Trends in Cognitive Sciences, 23(5), 408–422.
Chen, S.-S., Chobey, B., & Singh, V. (2021a). A neural network based Price sensitive recommender model to predict customer choices based on Price effect. Journal of Retailing and Consumer Services, 61, 102573.
Chen, S.-Y., He, Q.-F., & Lai, C.-F. (2021b). Deep reinforcement learning-based robot exploration for constructing map of unknown environment. Information Systems Frontiers.
Chintagunta, P., Hanssens, D. M., & Hauser, J. R. (2016). Marketing science and big data. Marketing Science, 35(3), 341–342.
Chiusano, S., Cerquitelli, T., Wrembel, R., & Quercia, D. (2021). Breakthroughs on cross-cutting data management, data analytics, and applied data science. Information Systems Frontiers, 23(1), 1–7.
Choi, H., Mela, C. F., Balseiro, S. R., & Leary, A. (2020). Online display advertising markets: A literature review and future directions. Information Systems Research, 31(2), 556–575.
Davenport, Thomas H., et al. (2011). Know what your customers want before they do. Harvard Business Review, Dec. 2011. https://hbr.org/2011/12/know-what-your-customers-want-before-they-do. Accessed 30 Apr 2021.
Davenport, T., Guha, A., Grewal, D., & Bresgott, T. (2020). How artificial intelligence will change the future of marketing. Journal of the Academy of Marketing Science, 48, 24–42.
Du, R. Y., et al. (2021). Capturing Marketing Information to Fuel Growth. Journal of Marketing, 85(1), 163–183.
Erevelles, S., Fukawa, N., & Swayne, L. (2016). Big data consumer analytics and the transformation of marketing. Journal of Business Research, 69(2), 897–904.
Fabijan, A., Dmitriev, P., McFarland, C., Vermeer, L., Holmström Olsson, H., & Bosch, J. (2018). Experimentation growth: Evolving trustworthy a/B testing capabilities in online software companies. Journal of Software: Evolution and Process, 30(12), e2113.
Feit, E. M., & Berman, R. (2019). Test & roll: Profit-maximizing a/b tests. Marketing Science, 38(6), 1038–1058.
Fujimoto, S., Hoof, H., & Meger, D. (2018, July). Addressing function approximation error in actor-critic methods. In International Conference on Machine Learning (pp. 1587-1596). PMLR.
Gallo, A. (2017). A refresher on a/B testing. Harvard Business Review. https://hbr.org/2017/06/a-refresher-on-ab-testing. Accessed 30 Apr 2021.
Garivier, A., & Moulines, E. (2008). On upper-confidence bound policies for non-stationary bandit problems (arXiv:0805.3415). arXiv. http://arxiv.org/abs/0805.3415
Garivier, A., & Moulines, E. (2011). On upper confidence bound policies for switching bandit problems. In International Conference on Algorithmic Learning Theory (pp. 174–188). Springer.
Gershman, S. J., & Daw, N. D. (2017). Reinforcement learning and episodic memory in humans and animals: An integrative framework. Annual Review of Psychology, 68(1), 101–128.
Goldfarb, A., & Tucker, C. (2011). Online display advertising: Targeting and obtrusiveness. Marketing Science, 30(3), 389–404.
Gordini, N., & Veglio, V. (2017). Customers churn prediction and marketing retention strategies. An application of support vector machines based on the AUC parameter-selection technique in B2B e-commerce industry. Industrial Marketing Management, 62, 100–107.
Gordon, B. R., Jerath, K., Katona, Z., Narayanan, S., Shin, J., & Wilbur, K. C. (2021). Inefficiencies in digital advertising markets. Journal of Marketing, 85(1), 7–25.
Gupta, S., et al. (2020). Optimizing creative allocations in digital marketing. In Advances in Computing and Data Sciences (Vol. 1244, pp. 419–429). Springer Singapore.
Haarnoja, T., Zhou, A., Abbeel, P., & Levine, S. (2018, July). Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International conference on machine learning (pp. 1861-e1870). PMLR.
Hanssens, D. M. (2018). The value of empirical generalizations in marketing. Journal of the Academy of Marketing Science, 46(1), 6–8.
Huang, M. H., & Rust, R. T. (2018). Artificial intelligence in service. Journal of Service Research, 21(2), 155–172.
Huang, M. H., & Rust, R. T. (2021). A strategic framework for artificial intelligence in marketing. Journal of the Academy of Marketing Science, 49(1), 30–50.
Huber, R., Oberländer, A. M., Faisst, U., & Röglinger, M. (2022). Disentangling capabilities for industry 4.0-an information systems capability perspective. Information Systems Frontiers, 1–29. https://doi.org/10.1007/s10796-022-10260-x
Iacobucci, D., et al. (2019). The state of marketing analytics in research and practice. Journal of Marketing Analytics, 7(3), 152–181.
Jain, D., Dash, M. K., Kumar, A., & Luthra, S. (2021). How is blockchain used in marketing: A review and research agenda. International Journal of Information Management Data Insights, 1(2), 100044.
Jang, B., et al. (2019). Q-Learning Algorithms: A Comprehensive Classification and Applications. IEEE Access, 7, 133653–133667.
Javanmard, A., & Montanari, A. (2018). Online rules for control of false discovery rate and false discovery exceedance. The Annals of Statistics, 46(2), 526–554.
Johnson, G. A., Lewis, R. A., & Nubbemeyer, E. I. (2017). Ghost ads: Improving the economics of measuring online ad effectiveness. Journal of Marketing Research, 54(6), 867–884.
Kar, A. K., & Kushwaha, A. K. (2021). Facilitators and Barriers of Artificial Intelligence Adoption in Business–Insights from Opinions Using Big Data Analytics. Information Systems Frontiers, 1-24. https://doi.org/10.1007/s10796-021-10219-4
Karuga, G. G., Khraban, A. M., Nair, S. K., & Rice, D. O. (2001). AdPalette: An algorithm for customizing online advertisements on the fly. Decision Support Systems, 32(2), 85–106.
Kim, J., Kang, S., & Lee, K. H. (2021). Evolution of digital marketing communication: Bibliometric analysis and network visualization from key articles. Journal of Business Research, 130, 552–563.
Kleinberg, R., Slivkins, A., & Upfal, E. (2008). Multi-armed bandits in metric spaces. In Proceedings of the fortieth annual ACM symposium on Theory of computing (pp. 681-690).
Kocsis, L., & Szepesvári, C. (2006). Bandit based Monte-Carlo planning. In European conference on machine learning (pp. 282–293). Springer.
Kohavi, R., & Longbotham, R. (2017). Online controlled experiments and a/B testing. Encyclopedia of machine learning and data mining, 7(8), 922–929.
Kohavi, R., & Thomke, S. (2017). The surprising power of online experiments. Harvard Business Review, 95(5), 74–82.
Kushwaha, A. K., & Kar, A. K. (2021). MarkBot–a language model-driven chatbot for interactive marketing in post-modern world. Information Systems Frontiers,1–18. https://doi.org/10.1007/s10796-021-10184-y
Kushwaha, A. K., Kar, A. K., & Dwivedi, Y. K. (2021). Applications of big data in emerging management disciplines: A literature review using text mining. International Journal of Information Management Data Insights, 1(2), 100017.
Lai, L., El Gamal, H., Jiang, H., & Poor, H. V. (2010). Cognitive medium access: Exploration, exploitation, and competition. IEEE Transactions on Mobile Computing, 10(2), 239–253.
Lamberton, C., & Stephen, A. T. (2016). A thematic exploration of digital, social media, and Mobile marketing: Research evolution from 2000 to 2015 and an agenda for future inquiry. Journal of Marketing, 80(6), 146–172.
Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., ... & Wierstra, D. (2015). Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971.
Liu, X., Derakhshani, M., Lambotharan, S., & Van der Schaar, M. (2020). Risk-aware multi-armed bandits with refined upper confidence bounds. IEEE Signal Processing Letters, 28, 269–273.
Liu, R., Gupta, S., & Patel, P. (2021). The application of the principles of responsible AI on social media marketing for digital health. Information Systems Frontiers, 1–25. https://doi.org/10.1007/s10796-021-10191-z
Luo, L., et al. (2013). Marketing via Social Media: A Case Study. Library Hi Tech, 31(3), 455–466.
Ma, L., & Sun, B. (2020). Machine learning and AI in marketing – Connecting computing power to human insights. International Journal of Research in Marketing, 37(3), 481–504.
Merckling, A., et al. (2022). Exploratory state representation learning. Frontiers in Robotics and AI, 9, 1–16.
Miklosik, A., Kuchta, M., Evans, N., & Zak, S. (2019). Towards the adoption of machine learning-based analytical tools in digital marketing. IEEE Access, 7, 85705–85718.
Misra, K., Schwartz, E. M., & Abernethy, J. (2019). Dynamic online pricing with incomplete information using multiarmed bandit experiments. Marketing Science, 38(2), 226–252.
Mnih, V., Badia, A. P., Mirza, M., Graves, A., Lillicrap, T., Harley, T., ... & Kavukcuoglu, K. (2016). Asynchronous methods for deep reinforcement learning. In International conference on machine learning (pp. 1928-1937). PMLR.
Nadkarni, S., & Prügl, R. (2021). Digital transformation: A review, synthesis and opportunities for future research. Management Review Quarterly, 71(2), 233–341.
Netzer, O., Lattin, J. M., & Srinivasan, V. (2008). A hidden Markov model of customer relationship dynamics. Marketing Science, 27(2), 185–204.
Obal, M. W., & Lv, W. (2017). Improving banner ad strategies through predictive modeling. Journal of Research in Interactive Marketing, 11(2), 198–212.
Oh, Junhyuk, et al. (2017). ‘Value prediction network’. Advances in Neural Information Processing Systems, vol. 30, Curran Associates, Inc., Neural Information Processing Systems.
Pandey, N., Nayal, P., & Rathore, A. S. (2020). Digital marketing for B2B organizations: Structured literature review and future research directions. Journal of Business & Industrial Marketing, 35(7), 1191–1204.
Quach, S., Thaichon, P., Martin, K. D., Weaven, S., & Palmatier, R. W. (2022). Digital technologies: Tensions in privacy and data. Journal of the Academy of Marketing Science, 1–25.
Rai, A. (2020). Explainable AI: From black box to glass box. Journal of the Academy of Marketing Science, 48(1), 137–141.
Rathore, H., Sahay, S. K., Nikam, P., & Sewak, M. (2021). Robust android malware detection system against adversarial attacks using q-learning. Information Systems Frontiers, 23(4), 867–882.
Ruiz-Real, J. L., Uribe-Toril, J., Torres, J. A., & De Pablo, J. (2021). Artificial intelligence in business and economics research: Trends and future. Journal of Business Economics and Management, 22(1), 98–117.
Rust, R. T. (2020). The future of marketing. International Journal of Research in Marketing, 37(1), 15–26.
Rutz, O. J., & Watson, G. F. (2019). Endogeneity and marketing strategy research: An overview. Journal of the Academy of Marketing Science, 47, 479–498.
Saura, J. R. (2021). Using data sciences in digital marketing: Framework, methods, and performance metrics. Journal of Innovation & Knowledge, 6(2), 92–102.
Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural Networks, 61, 85–117.
Schwartz, E. M., Bradlow, E. T., & Fader, P. S. (2017). Customer acquisition via display advertising using multi-armed bandit experiments. Marketing Science, 36(4), 500–522.
Senz, K. (2021). Is A/B testing effective? Evidence from 35,000 startups. Retrieved November 25, 2021, from https://hbswk.hbs.edu/item/is-ab-testing-effective-evidence-from-35000-startups. Accessed 30 Apr 2021.
Singh, V., Chen, S. S., Singhania, M., Nanavati, B., & Gupta, A. (2022). How are reinforcement learning and deep learning algorithms used for big data based decision making in financial industries–A review and research agenda. International Journal of Information Management Data Insights, 2(2), 100094
Sridhar, S., & Fang, E. (2019). New vistas for marketing strategy: Digital, data-rich, and developing market (D3) environments. Journal of the Academy of Marketing Science, 47, 977–985.
Stourm, V., & Bax, E. (2017). Incorporating hidden costs of annoying ads in display auctions. International Journal of Research in Marketing, 34(3), 622–640.
Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction (Second ed.). The MIT Press.
Verma, S., Sharma, R., Deb, S., & Maitra, D. (2021). Artificial intelligence in marketing: Systematic review and future research direction. International Journal of Information Management Data Insights, 1(1), 100002.
Votto, A. M., Valecha, R., Najafirad, P., & Rao, H. R. (2021). Artificial intelligence in tactical human resource management: A systematic literature review. International Journal of Information Management Data Insights, 1(2), 100047.
Wang, H., & Hong, M. (2019). Online ad effectiveness evaluation with a two-stage method using a Gaussian filter and decision tree approach. Electronic Commerce Research and Applications, 35, 100852.
Youngmann, B., Yom-Tov, E., Gilad-Bachrach, R., & Karmon, D. (2021). Algorithmic copywriting: Automated generation of health-related advertisements to improve their performance. Information Retrieval Journal, 24(3), 205–239.
Zhang, Y., et al. (2022). A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties. European Journal of Operational Research, 300(2), 418–427.
Acknowledgements
The authors gratefully acknowledge the team at YourFirstad for their insights and sharing of data for our study. This research paper is submitted as part of the efforts undertaken under a collaboration between the research teams of Indian Institute of Technology Delhi and BASF Germany, funded by BASF Germany.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors report no conflict of interest in the study undertaken. All contributors have been listed as authors and all authors have contributed.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Singh, V., Nanavati, B., Kar, A.K. et al. How to Maximize Clicks for Display Advertisement in Digital Marketing? A Reinforcement Learning Approach. Inf Syst Front 25, 1621–1638 (2023). https://doi.org/10.1007/s10796-022-10314-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10796-022-10314-0