[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3488560.3498373acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising

Published: 15 February 2022 Publication History

Abstract

In online advertising, auto-bidding has become an essential tool for advertisers to optimize their preferred ad performance metrics by simply expressing high-level campaign objectives and constraints. Previous works designed auto-bidding tools from the view of single-agent, without modeling the mutual influence between agents. In this paper, we instead consider this problem from a distributed multi-agent perspective, and propose a general \underlineM ulti-\underlineA gent reinforcement learning framework for \underlineA uto-\underlineB idding, namely MAAB, to learn the auto-bidding strategies. First, we investigate the competition and cooperation relation among auto-bidding agents, and propose a temperature-regularized credit assignment to establish a mixed cooperative-competitive paradigm. By carefully making a competition and cooperation trade-off among agents, we can reach an equilibrium state that guarantees not only individual advertiser's utility but also the system performance (i.e., social welfare). Second, to avoid the potential collusion behaviors of bidding low prices underlying the cooperation, we further propose bar agents to set a personalized bidding bar for each agent, and then alleviate the revenue degradation due to the cooperation. Third, to deploy MAAB in the large-scale advertising system with millions of advertisers, we propose a mean-field approach. By grouping advertisers with the same objective as a mean auto-bidding agent, the interactions among the large-scale advertisers are greatly simplified, making it practical to train MAAB efficiently. Extensive experiments on the offline industrial dataset and Alibaba advertising platform demonstrate that our approach outperforms several baseline methods in terms of social welfare and revenue.

Supplementary Material

MP4 File (WSDM22-fp018.mp4)
Presentation video for paper "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising"

References

[1]
Gagan Aggarwal, Ashwinkumar Badanidiyuru, and Aranyak Mehta. 2019. Autobidding with constraints. In WINE. Springer, 17--30.
[2]
Han Cai, Kan Ren, Weinan Zhang, Kleanthis Malialis, Jun Wang, Yong Yu, and Defeng Guo. 2017. Real-time bidding by reinforcement learning in display advertising. In WSDM . 661--670.
[3]
Google Ads Help Center. 2021. About automated bidding. https://support.google.com/google-ads/answer/2979071 . Accessed: January 24, 2021.
[4]
Carl Davidson and Raymond Deneckere. 1986. Long-run competition in capacity, short-run competition in price, and the Cournot model. The Rand Journal of Economics (1986), 404--415.
[5]
Paul Dütting, Zhe Feng, Harikrishna Narasimhan, David Parkes, and Sai Srivatsa Ravindranath. 2019. Optimal auctions through deep learning. In ICML. PMLR, 1706--1715.
[6]
Benjamin Edelman, Michael Ostrovsky, and Michael Schwarz. 2007. Internet advertising and the generalized second-price auction: Selling billions of dollars worth of keywords. American economic review, Vol. 97, 1 (2007), 242--259.
[7]
eMarketer. 2015. Worldwide retail ecommerce sales: eMarketer's updated estimates and forecast through 2019. (2015).
[8]
Facebook. 2021. Facebook. https://www.facebook.com/business/m/one-sheeters/facebook-bid-strategy-guide . Accessed: January 24, 2021.
[9]
Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, and Shimon Whiteson. 2018. Counterfactual multi-agent policy gradients. In AAAI, Vol. 32.
[10]
Google. 2021. Google AdWords API. https://developers.google.com/adwords/api/docs/guides/start . Accessed: January 24, 2021.
[11]
Ziyu Guan, Hongchang Wu, Qingyu Cao, Hao Liu, Wei Zhao, Sheng Li, Cai Xu, Guang Qiu, Jian Xu, and Bo Zheng. 2021. Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search. arXiv preprint arXiv:2106.04075 (2021).
[12]
Garrett Hardin. 2009. The tragedy of the commons. Journal of Natural Resources Policy Research, Vol. 1, 3 (2009), 243--253.
[13]
Pablo Hernandez-Leal, Bilal Kartal, and Matthew E Taylor. 2019. A survey and critique of multiagent deep reinforcement learning. AAMAS, Vol. 33, 6 (2019), 750--797.
[14]
Junqi Jin, Chengru Song, Han Li, Kun Gai, Jun Wang, and Weinan Zhang. 2018. Real-time bidding with multi-agent reinforcement learning in display advertising. In CIKM . 2193--2201.
[15]
Jean-Michel Lasry and Pierre-Louis Lions. 2007. Mean field games. Japanese journal of mathematics, Vol. 2, 1 (2007), 229--260.
[16]
Joel Z Leibo and Marc Lanctot. 2017. Multi-agent Reinforcement Learning in Sequential Social Dilemmas . (2017). arxiv: arXiv:1702.03037v1
[17]
Michael L Littman. 1994. Markov games as a framework for multi-agent reinforcement learning. In Machine learning proceedings 1994 . Elsevier, 157--163.
[18]
Xiangyu Liu, Chuan Yu, Zhilin Zhang, Zhenzhe Zheng, Yu Rong, Hongtao Lv, Da Huo, Yiqing Wang, Dagui Chen, Jian Xu, Fan Wu, Guihai Chen, and Xiaoqiang Zhu. 2021. Neural Auction: End-to-End Learning of Auction Mechanisms for E-Commerce Advertising. In SIGKDD. 3354--3364.
[19]
Ryan Lowe, Yi I Wu, Aviv Tamar, Jean Harb, OpenAI Pieter Abbeel, and Igor Mordatch. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. In NIPS . 6379--6390.
[20]
Robert C Marshall and Leslie M Marx. 2007. Bidder collusion. Journal of Economic Theory, Vol. 133, 1 (2007), 374--402.
[21]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. nature, Vol. 518, 7540 (2015), 529--533.
[22]
Mehryar Mohri and Andres Munoz Medina. 2014. Learning theory and algorithms for revenue optimization in second price auctions with reserve. In ICML. PMLR, 262--270.
[23]
Roger B Myerson. 1981. Optimal auction design. Mathematics of operations research, Vol. 6, 1 (1981), 58--73.
[24]
Michael Ostrovsky and Michael Schwarz. 2011. Reserve prices in internet advertising auctions: A field experiment. In EC . 59--60.
[25]
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder, Gregory Farquhar, Jakob Foerster, and Shimon Whiteson. 2018. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. In ICML. 4295--4304.
[26]
Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinícius Flores Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z Leibo, Karl Tuyls, et al. 2018. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward. In AAMAS. 2085--2087.
[27]
Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan Korjus, Juhan Aru, Jaan Aru, and Raul Vicente. 2017. Multiagent cooperation and competition with deep reinforcement learning. PloS one, Vol. 12, 4 (2017), e0172395.
[28]
Ming Tan. 1993. Multi-agent reinforcement learning: Independent vs. cooperative agents. In ICML . 330--337.
[29]
David RM Thompson and Kevin Leyton-Brown. 2013. Revenue optimization in the generalized second-price auction. In EC . 837--852.
[30]
Chao Wen, Xinghu Yao, Yuhui Wang, and Xiaoyang Tan. 2020. SMIX ($łambda$): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. In AAAI. 7301--7308.
[31]
Di Wu, Xiujun Chen, Xun Yang, Hao Wang, Qing Tan, Xiaoxun Zhang, Jian Xu, and Kun Gai. 2018. Budget constrained bidding by model-free reinforcement learning in display advertising. In CIKM. 1443--1451.
[32]
Xiao Yang, Daren Sun, Ruiwei Zhu, Tao Deng, Zhi Guo, Zongyao Ding, Shouke Qin, and Yanfeng Zhu. 2019. Aiads: Automated and intelligent advertising system for sponsored search. In SIGKDD . 1881--1890.
[33]
Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, and Jun Wang. 2018. Mean field multi-agent reinforcement learning. In ICML. PMLR, 5571--5580.
[34]
Shuai Yuan, Jun Wang, Bowei Chen, Peter Mason, and Sam Seljan. 2014. An empirical study of reserve price optimisation in real-time bidding. In SIGKDD . 1897--1906.

Cited By

View all
  • (2024)A Cloud-Edge Collaboration Solution for Distribution Network Reconfiguration Using Multi-Agent Deep Reinforcement LearningIEEE Transactions on Power Systems10.1109/TPWRS.2023.329646339:2(3867-3879)Online publication date: Mar-2024
  • (2024)Incentive Mechanism Design for ROI-Constrained Auto-biddingPRICAI 2024: Trends in Artificial Intelligence10.1007/978-981-96-0125-7_24(291-296)Online publication date: 12-Nov-2024
  • (2024)Bandits for Sponsored Search Auctions Under Unknown Valuation Model: Case Study in E-Commerce AdvertisingMachine Learning and Knowledge Discovery in Databases. Applied Data Science Track10.1007/978-3-031-70381-2_17(263-279)Online publication date: 22-Aug-2024
  • Show More Cited By

Index Terms

  1. A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining
    February 2022
    1690 pages
    ISBN:9781450391320
    DOI:10.1145/3488560
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 February 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. auto-bidding
    2. bid optimization
    3. e-commerce advertising
    4. multi-agent reinforcement learning

    Qualifiers

    • Research-article

    Funding Sources

    • National Science Foundation of China
    • Shanghai Science and Technology Fund
    • Alibaba Research Intern Program
    • Science and Technology Innovation 2030 ? New Generation Artificial Intelligence
    • Alibaba Innovation Research Program

    Conference

    WSDM '22

    Acceptance Rates

    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)106
    • Downloads (Last 6 weeks)9
    Reflects downloads up to 17 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Cloud-Edge Collaboration Solution for Distribution Network Reconfiguration Using Multi-Agent Deep Reinforcement LearningIEEE Transactions on Power Systems10.1109/TPWRS.2023.329646339:2(3867-3879)Online publication date: Mar-2024
    • (2024)Incentive Mechanism Design for ROI-Constrained Auto-biddingPRICAI 2024: Trends in Artificial Intelligence10.1007/978-981-96-0125-7_24(291-296)Online publication date: 12-Nov-2024
    • (2024)Bandits for Sponsored Search Auctions Under Unknown Valuation Model: Case Study in E-Commerce AdvertisingMachine Learning and Knowledge Discovery in Databases. Applied Data Science Track10.1007/978-3-031-70381-2_17(263-279)Online publication date: 22-Aug-2024
    • (2023)Coordinated dynamic bidding in repeated second-price auctions with budgetsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618607(5052-5086)Online publication date: 23-Jul-2023
    • (2023)Research and applications of game intelligenceSCIENTIA SINICA Informationis10.1360/SSI-2023-001053:10(1892)Online publication date: 16-Oct-2023
    • (2023)A Survey on Bid Optimization in Real-Time Bidding Display AdvertisingACM Transactions on Knowledge Discovery from Data10.1145/362860318:3(1-31)Online publication date: 9-Dec-2023
    • (2023)Learning-Based Ad Auction Design with Externalities: The Framework and A Matching-Based ApproachProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599403(1291-1302)Online publication date: 6-Aug-2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media