Handling Confounding for Realistic Off-Policy Evaluation
Abstract
References
Index Terms
- Handling Confounding for Realistic Off-Policy Evaluation
Recommendations
Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model
WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data MiningIn real-world recommender systems and search engines, optimizing ranking decisions to present a ranked list of relevant items is critical. Off-policy evaluation (OPE) for ranking policies is thus gaining a growing interest because it enables performance ...
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
WWW '24: Proceedings of the ACM Web Conference 2024We study off-policy evaluation (OPE) in the problem of slate contextual bandits where a policy selects multi-dimensional actions known as slates. This problem is widespread in recommender systems, search engines, marketing, to medical applications, ...
Gradient temporal-difference learning for off-policy evaluation using emphatic weightings
AbstractThe problem of off-policy evaluation (OPE) has long been advocated as one of the foremost challenges in reinforcement learning. Gradient-based and emphasis-based temporal-difference (TD) learning algorithms comprise the major part of ...
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In
- General Chairs:
- Pierre-Antoine Champin,
- Fabien Gandon,
- Lionel Médini,
- Program Chairs:
- Mounia Lalmas,
- Panagiotis G. Ipeirotis
Sponsors
- IW3C2: International World Wide Web Conference Committee
In-Cooperation
Publisher
International World Wide Web Conferences Steering Committee
Republic and Canton of Geneva, Switzerland
Publication History
Check for updates
Author Tags
Qualifiers
- Poster
Conference
- IW3C2
Acceptance Rates
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 313Total Downloads
- Downloads (Last 12 months)53
- Downloads (Last 6 weeks)10
Other Metrics
Citations
View Options
View options
View or Download as a PDF file.
PDFeReader
View online with eReader.
eReaderHTML Format
View this article in HTML Format.
HTML FormatLogin options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in