More Web Proxy on the site http://driver.im/

research-article

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising

Authors:

Bo ZhengAuthors Info & Claims

WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining

Pages 1129 - 1139

https://doi.org/10.1145/3488560.3498373

Published: 15 February 2022 Publication History

Abstract

In online advertising, auto-bidding has become an essential tool for advertisers to optimize their preferred ad performance metrics by simply expressing high-level campaign objectives and constraints. Previous works designed auto-bidding tools from the view of single-agent, without modeling the mutual influence between agents. In this paper, we instead consider this problem from a distributed multi-agent perspective, and propose a general \underlineM ulti-\underlineA gent reinforcement learning framework for \underlineA uto-\underlineB idding, namely MAAB, to learn the auto-bidding strategies. First, we investigate the competition and cooperation relation among auto-bidding agents, and propose a temperature-regularized credit assignment to establish a mixed cooperative-competitive paradigm. By carefully making a competition and cooperation trade-off among agents, we can reach an equilibrium state that guarantees not only individual advertiser's utility but also the system performance (i.e., social welfare). Second, to avoid the potential collusion behaviors of bidding low prices underlying the cooperation, we further propose bar agents to set a personalized bidding bar for each agent, and then alleviate the revenue degradation due to the cooperation. Third, to deploy MAAB in the large-scale advertising system with millions of advertisers, we propose a mean-field approach. By grouping advertisers with the same objective as a mean auto-bidding agent, the interactions among the large-scale advertisers are greatly simplified, making it practical to train MAAB efficiently. Extensive experiments on the offline industrial dataset and Alibaba advertising platform demonstrate that our approach outperforms several baseline methods in terms of social welfare and revenue.

Supplementary Material

MP4 File (WSDM22-fp018.mp4)

Presentation video for paper "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising"

Download
235.12 MB

References

[1]

Gagan Aggarwal, Ashwinkumar Badanidiyuru, and Aranyak Mehta. 2019. Autobidding with constraints. In WINE. Springer, 17--30.

[2]

Han Cai, Kan Ren, Weinan Zhang, Kleanthis Malialis, Jun Wang, Yong Yu, and Defeng Guo. 2017. Real-time bidding by reinforcement learning in display advertising. In WSDM . 661--670.

[3]

Google Ads Help Center. 2021. About automated bidding. https://support.google.com/google-ads/answer/2979071 . Accessed: January 24, 2021.

[4]

Carl Davidson and Raymond Deneckere. 1986. Long-run competition in capacity, short-run competition in price, and the Cournot model. The Rand Journal of Economics (1986), 404--415.

[5]

Paul Dütting, Zhe Feng, Harikrishna Narasimhan, David Parkes, and Sai Srivatsa Ravindranath. 2019. Optimal auctions through deep learning. In ICML. PMLR, 1706--1715.

[6]

Benjamin Edelman, Michael Ostrovsky, and Michael Schwarz. 2007. Internet advertising and the generalized second-price auction: Selling billions of dollars worth of keywords. American economic review, Vol. 97, 1 (2007), 242--259.

[7]

eMarketer. 2015. Worldwide retail ecommerce sales: eMarketer's updated estimates and forecast through 2019. (2015).

[8]

Facebook. 2021. Facebook. https://www.facebook.com/business/m/one-sheeters/facebook-bid-strategy-guide . Accessed: January 24, 2021.

[9]

Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, and Shimon Whiteson. 2018. Counterfactual multi-agent policy gradients. In AAAI, Vol. 32.

[10]

Google. 2021. Google AdWords API. https://developers.google.com/adwords/api/docs/guides/start . Accessed: January 24, 2021.

[11]

Ziyu Guan, Hongchang Wu, Qingyu Cao, Hao Liu, Wei Zhao, Sheng Li, Cai Xu, Guang Qiu, Jian Xu, and Bo Zheng. 2021. Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search. arXiv preprint arXiv:2106.04075 (2021).

[12]

Garrett Hardin. 2009. The tragedy of the commons. Journal of Natural Resources Policy Research, Vol. 1, 3 (2009), 243--253.

[13]

Pablo Hernandez-Leal, Bilal Kartal, and Matthew E Taylor. 2019. A survey and critique of multiagent deep reinforcement learning. AAMAS, Vol. 33, 6 (2019), 750--797.

Digital Library

[14]

Junqi Jin, Chengru Song, Han Li, Kun Gai, Jun Wang, and Weinan Zhang. 2018. Real-time bidding with multi-agent reinforcement learning in display advertising. In CIKM . 2193--2201.

[15]

Jean-Michel Lasry and Pierre-Louis Lions. 2007. Mean field games. Japanese journal of mathematics, Vol. 2, 1 (2007), 229--260.

[16]

Joel Z Leibo and Marc Lanctot. 2017. Multi-agent Reinforcement Learning in Sequential Social Dilemmas . (2017). arxiv: arXiv:1702.03037v1

[17]

Michael L Littman. 1994. Markov games as a framework for multi-agent reinforcement learning. In Machine learning proceedings 1994 . Elsevier, 157--163.

[18]

Xiangyu Liu, Chuan Yu, Zhilin Zhang, Zhenzhe Zheng, Yu Rong, Hongtao Lv, Da Huo, Yiqing Wang, Dagui Chen, Jian Xu, Fan Wu, Guihai Chen, and Xiaoqiang Zhu. 2021. Neural Auction: End-to-End Learning of Auction Mechanisms for E-Commerce Advertising. In SIGKDD. 3354--3364.

[19]

Ryan Lowe, Yi I Wu, Aviv Tamar, Jean Harb, OpenAI Pieter Abbeel, and Igor Mordatch. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. In NIPS . 6379--6390.

Digital Library

[20]

Robert C Marshall and Leslie M Marx. 2007. Bidder collusion. Journal of Economic Theory, Vol. 133, 1 (2007), 374--402.

[21]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. nature, Vol. 518, 7540 (2015), 529--533.

[22]

Mehryar Mohri and Andres Munoz Medina. 2014. Learning theory and algorithms for revenue optimization in second price auctions with reserve. In ICML. PMLR, 262--270.

[23]

Roger B Myerson. 1981. Optimal auction design. Mathematics of operations research, Vol. 6, 1 (1981), 58--73.

[24]

Michael Ostrovsky and Michael Schwarz. 2011. Reserve prices in internet advertising auctions: A field experiment. In EC . 59--60.

[25]

Tabish Rashid, Mikayel Samvelyan, Christian Schroeder, Gregory Farquhar, Jakob Foerster, and Shimon Whiteson. 2018. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. In ICML. 4295--4304.

[26]

Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinícius Flores Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z Leibo, Karl Tuyls, et al. 2018. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward. In AAMAS. 2085--2087.

[27]

Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan Korjus, Juhan Aru, Jaan Aru, and Raul Vicente. 2017. Multiagent cooperation and competition with deep reinforcement learning. PloS one, Vol. 12, 4 (2017), e0172395.

[28]

Ming Tan. 1993. Multi-agent reinforcement learning: Independent vs. cooperative agents. In ICML . 330--337.

[29]

David RM Thompson and Kevin Leyton-Brown. 2013. Revenue optimization in the generalized second-price auction. In EC . 837--852.

[30]

Chao Wen, Xinghu Yao, Yuhui Wang, and Xiaoyang Tan. 2020. SMIX ($łambda$): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. In AAAI. 7301--7308.

[31]

Di Wu, Xiujun Chen, Xun Yang, Hao Wang, Qing Tan, Xiaoxun Zhang, Jian Xu, and Kun Gai. 2018. Budget constrained bidding by model-free reinforcement learning in display advertising. In CIKM. 1443--1451.

[32]

Xiao Yang, Daren Sun, Ruiwei Zhu, Tao Deng, Zhi Guo, Zongyao Ding, Shouke Qin, and Yanfeng Zhu. 2019. Aiads: Automated and intelligent advertising system for sponsored search. In SIGKDD . 1881--1890.

Digital Library

[33]

Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, and Jun Wang. 2018. Mean field multi-agent reinforcement learning. In ICML. PMLR, 5571--5580.

[34]

Shuai Yuan, Jun Wang, Bowei Chen, Peter Mason, and Sam Seljan. 2014. An empirical study of reserve price optimisation in real-time bidding. In SIGKDD . 1897--1906.

Cited By

Gao HWang RHe SWang LLiu JChen Z(2024)A Cloud-Edge Collaboration Solution for Distribution Network Reconfiguration Using Multi-Agent Deep Reinforcement LearningIEEE Transactions on Power Systems10.1109/TPWRS.2023.329646339:2(3867-3879)Online publication date: Mar-2024
https://doi.org/10.1109/TPWRS.2023.3296463
Li BHao D(2024)Incentive Mechanism Design for ROI-Constrained Auto-biddingPRICAI 2024: Trends in Artificial Intelligence10.1007/978-981-96-0125-7_24(291-296)Online publication date: 12-Nov-2024
https://doi.org/10.1007/978-981-96-0125-7_24
Provodin DJoudioux JDuryev E(2024)Bandits for Sponsored Search Auctions Under Unknown Valuation Model: Case Study in E-Commerce AdvertisingMachine Learning and Knowledge Discovery in Databases. Applied Data Science Track10.1007/978-3-031-70381-2_17(263-279)Online publication date: 22-Aug-2024
https://doi.org/10.1007/978-3-031-70381-2_17
Show More Cited By

Index Terms

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising
1. Information systems
  1. Information systems applications
    1. Computational advertising

Recommendations

A Unified Solution to Constrained Bidding in Online Display Advertising
KDD '21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining

In online display advertising, advertisers usually participate in real-time bidding to acquire ad impression opportunities. In most advertising platforms, a typical impression acquiring demand of advertisers is to maximize the sum value of winning ...
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

Real-time advertising allows advertisers to bid for each impression for a visiting user. To optimize specific goals such as maximizing revenue and return on investment (ROI) led by ad placements, advertisers not only need to estimate the relevance ...
A Personalized Automated Bidding Framework for Fairness-aware Online Advertising
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Powered by machine learning techniques, online advertising platforms have launched various automated bidding strategy services to facilitate intelligent decision-making for advertisers. However, advertisers experience heterogeneous advertising ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining

February 2022

1690 pages

ISBN:9781450391320

DOI:10.1145/3488560

General Chairs:
K. Selcuk Candan
Arizona State University, USA
,
Huan Liu
Arizona State University, USA
,
Program Chairs:
Leman Akoglu
Carnegie Mellon University, USA
,
Xin Luna Dong
Meta Platforms, Inc. (former Facebook), USA
,
Jiliang Tang
Michigan State University, USA

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 February 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Science Foundation of China
Shanghai Science and Technology Fund
Alibaba Research Intern Program
Science and Technology Innovation 2030 ? New Generation Artificial Intelligence
Alibaba Innovation Research Program

Conference

WSDM '22

Sponsor:

WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining

February 21 - 25, 2022

AZ, Virtual Event, USA

Acceptance Rates

Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

7
Total Citations
View Citations
492
Total Downloads

Downloads (Last 12 months)106
Downloads (Last 6 weeks)9

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Gao HWang RHe SWang LLiu JChen Z(2024)A Cloud-Edge Collaboration Solution for Distribution Network Reconfiguration Using Multi-Agent Deep Reinforcement LearningIEEE Transactions on Power Systems10.1109/TPWRS.2023.329646339:2(3867-3879)Online publication date: Mar-2024
https://doi.org/10.1109/TPWRS.2023.3296463
Li BHao D(2024)Incentive Mechanism Design for ROI-Constrained Auto-biddingPRICAI 2024: Trends in Artificial Intelligence10.1007/978-981-96-0125-7_24(291-296)Online publication date: 12-Nov-2024
https://doi.org/10.1007/978-981-96-0125-7_24
Provodin DJoudioux JDuryev E(2024)Bandits for Sponsored Search Auctions Under Unknown Valuation Model: Case Study in E-Commerce AdvertisingMachine Learning and Knowledge Discovery in Databases. Applied Data Science Track10.1007/978-3-031-70381-2_17(263-279)Online publication date: 22-Aug-2024
https://doi.org/10.1007/978-3-031-70381-2_17
Chen YWang QDuan ZSun HChen ZYan XDeng XKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Coordinated dynamic bidding in repeated second-price auctions with budgetsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618607(5052-5086)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3618607
HAO JSHAO KLI KLI DMAO HHU SWANG Z(2023)Research and applications of game intelligenceSCIENTIA SINICA Informationis10.1360/SSI-2023-001053:10(1892)Online publication date: 16-Oct-2023
https://doi.org/10.1360/SSI-2023-0010
Ou WChen BDai XZhang WLiu WTang RYu Y(2023)A Survey on Bid Optimization in Real-Time Bidding Display AdvertisingACM Transactions on Knowledge Discovery from Data10.1145/362860318:3(1-31)Online publication date: 9-Dec-2023
https://dl.acm.org/doi/10.1145/3628603
Li NMa YZhao YDuan ZChen YZhang ZXu JZheng BDeng XSingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)Learning-Based Ad Auction Design with Externalities: The Framework and A Matching-Based ApproachProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599403(1291-1302)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599403

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents