Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
Strategizing against No-Regret Learners in First-Price Auctions
EC '24: Proceedings of the 25th ACM Conference on Economics and ComputationPages 894–921https://doi.org/10.1145/3670865.3673613We study repeated first-price auctions and general repeated Bayesian games between two players, where one player, the learner, employs a no-regret learning algorithm, and the other player, the optimizer, knowing the learner's algorithm, strategizes to ...
- extended-abstractDecember 2024
Steering No-Regret Learners to a Desired Equilibrium
- Brian Hu Zhang,
- Gabriele Farina,
- Ioannis Anagnostides,
- Federico Cacciamani,
- Stephen McAleer,
- Andreas Haupt,
- Andrea Celli,
- Nicola Gatti,
- Vincent Conitzer,
- Tuomas Sandholm
EC '24: Proceedings of the 25th ACM Conference on Economics and ComputationPages 73–74https://doi.org/10.1145/3670865.3673536A mediator observes no-regret learners playing an extensive-form game repeatedly across T rounds. The mediator attempts to steer players toward some desirable predetermined equilibrium by giving (nonnegative) payments to players. We call this the ...
- extended-abstractJuly 2021
Robust Repeated First Price Auctions
EC '21: Proceedings of the 22nd ACM Conference on Economics and ComputationPage 4https://doi.org/10.1145/3465456.3467590We study dynamic mechanisms for optimizing revenue in repeated auctions, that are robust to heterogeneous forward-looking and learning behavior of the buyers. Typically it is assumed that the buyers are either all myopic or are all infinite lookahead, ...
- research-articleMay 2020
Combining No-regret and Q-learning
AAMAS '20: Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent SystemsPages 593–601Counterfactual Regret Minimization (CFR) has found success in settings like poker which have both terminal states and perfect recall. We seek to understand how to relax these requirements. As a first step, we introduce a simple algorithm, local no-...
- research-articleJune 2018Best Student Paper
Selling to a No-Regret Buyer
EC '18: Proceedings of the 2018 ACM Conference on Economics and ComputationPages 523–538https://doi.org/10.1145/3219166.3219233We consider the problem of a single seller repeatedly selling a single item to a single buyer (specifically, the buyer has a value drawn fresh from known distribution D in every round). Prior work assumes that the buyer is fully rational and will ...
- invited-talkJune 2017
Fast convergence of learning in games (invited talk)
STOC 2017: Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of ComputingPage 5https://doi.org/10.1145/3055399.3084098A plethora of recent work has analyzed properties of outcomes in games when each player employs a no-regret learning algorithm. Many algorithms achieve regret against the best fixed action in hindisght that decays at a rate of O(1/'T), when the game is ...
- research-articleMay 2016
Playing Repeated Security Games with No Prior Knowledge
AAMAS '16: Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent SystemsPages 104–112This paper investigates repeated security games with unknown (to the defender) game payoffs and attacker behaviors. As existing work assumes prior knowledge about either the game payoffs or the attacker's behaviors, they are not suitable for tackling ...
- research-articleNovember 2015
Algorithmic game theory and econometrics
ACM SIGecom Exchanges (SIGECOM), Volume 14, Issue 1Pages 105–108https://doi.org/10.1145/2845926.2845935The traditional econometrics approach for inferring properties of strategic interactions that are not fully observable in the data, heavily relies on the assumption that the observed strategic behavior has settled at an equilibrium. This assumption is ...
- research-articleJune 2015
Econometrics for Learning Agents
EC '15: Proceedings of the Sixteenth ACM Conference on Economics and ComputationPages 1–18https://doi.org/10.1145/2764468.2764522The main goal of this paper is to develop a theory of inference of player valuations from observed data in the generalized second price auction without relying on the Nash equilibrium assumption. Existing work in Economics on inferring agent values from ...
- research-articleJune 2015
Commitment Without Regrets: Online Learning in Stackelberg Security Games
EC '15: Proceedings of the Sixteenth ACM Conference on Economics and ComputationPages 61–78https://doi.org/10.1145/2764468.2764478In a Stackelberg Security Game, a defender commits to a randomized deployment of security resources, and an attacker best-responds by attacking a target that maximizes his utility. While algorithms for computing an optimal strategy for the defender to ...
- research-articleJune 2013
Differential privacy for the analyst via private equilibrium computation
STOC '13: Proceedings of the forty-fifth annual ACM symposium on Theory of ComputingPages 341–350https://doi.org/10.1145/2488608.2488651We give new mechanisms for answering exponentially many queries from multiple analysts on a private database, while protecting dif- ferential privacy both for the individuals in the database and for the analysts. That is, our mechanism's answer to each ...
- ArticleOctober 2011
No regret learning for sensor relocation in mobile sensor networks
ICICA'11: Proceedings of the Second international conference on Information Computing and ApplicationsPages 216–223https://doi.org/10.1007/978-3-642-25255-6_28Sensor relocation is a critical issue because it affects the coverage quality and capability of a mobile sensor network. In this paper, the problem of sensor relocation is formulated as a repeated multi-player game. At each step of repeated interactions,...
- ArticleDecember 2009
Energy-efficient dynamic spectrum access using no-regret learning
ICICS'09: Proceedings of the 7th international conference on Information, communications and signal processingPages 566–570In this paper, we consider a cross-layer design of dynamic spectrum access in distributive cognitive radio (CR) networks. We model the licensed channel as a finite-state Markov channel (FSMC) and the CR user selects one channel to access and decides ...
- articleDecember 2006
Adaptive channel allocation spectrum etiquette for cognitive radio networks
Mobile Networks and Applications (MNET), Volume 11, Issue 6Pages 779–797https://doi.org/10.1007/s11036-006-0049-yIn this work, we propose a game theoretic framework to analyze the behavior of cognitive radios for distributed adaptive channel allocation. We define two different objective functions for the spectrum sharing games, which capture the utility of selfish ...