More Web Proxy on the site http://driver.im/

research-article

Exact-K Recommendation via Maximal Clique Optimization

Authors:

Kenny Q. ZhuAuthors Info & Claims

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 617 - 626

https://doi.org/10.1145/3292500.3330832

Published: 25 July 2019 Publication History

Abstract

This paper targets to a novel but practical recommendation problem named exact-K recommendation. It is different from traditional top-K recommendation, as it focuses more on (constrained) combinatorial optimization which will optimize to recommend a whole set of K items called card, rather than ranking optimization which assumes that "better" items should be put into top positions. Thus we take the first step to give a formal problem definition, and innovatively reduce it to Maximum Clique Optimization based on graph. To tackle this specific combinatorial optimization problem which is NP-hard, we propose Graph Attention Networks (GAttN) with a Multi-head Self-attention encoder and a decoder with attention mechanism. It can end-to-end learn the joint distribution of the K items and generate an optimal card rather than rank individual items by prediction scores. Then we propose Reinforcement Learning from Demonstrations (RLfD) which combines the advantages in behavior cloning and reinforcement learning, making it sufficient-and-efficient to train the model. Extensive experiments on three datasets demonstrate the effectiveness of our proposed GAttN with RLfD method, it outperforms several strong baselines with a relative improvement of 7.7% and 4.7% on average in Precision and Hit Ratio respectively, and achieves state-of-the-art (SOTA) performance for the exact-K recommendation problem.

References

[1]

Pieter Abbeel and Andrew Y Ng. 2004. Apprenticeship learning via inverse reinforcement learning. In ICML .

Digital Library

[2]

Qingyao Ai, Keping Bi, Jiafeng Guo, and W Bruce Croft. 2018. Learning a Deep Listwise Context Model for Ranking Refinement. In SIGIR .

Digital Library

[3]

Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E Hinton. 2016. Layer normalization. In arXiv .

[4]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. In arXiv .

[5]

Irwan Bello, Hieu Pham, Quoc V Le, Mohammad Norouzi, and Samy Bengio. 2016. Neural Combinatorial Optimization with Reinforcement Learning. In arXiv .

[6]

Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. 2015. Scheduled sampling for sequence prediction with recurrent neural networks. In NIPS .

Digital Library

[7]

Paolo Cremonesi, Yehuda Koren, and Roberto Turrin. 2010. Performance of recommender algorithms on top-n recommendation tasks. In RecSys .

Digital Library

[8]

Lu Duan, Haoyuan Hu, Yu Qian, Yu Gong, Xiaodong Zhang, Yinghui Xu, and Jiangwen Wei. 2019. A Multi-task Selected Learning Approach for Solving 3D Flexible Bin Packing Problem. In AAMAS .

Digital Library

[9]

Michael R Garey, David S Johnson, and Larry Stockmeyer. 1974. Some simplified NP-complete problems. In STOC .

Digital Library

[10]

Yu Gong, Xusheng Luo, Kenny Q Zhu, Wenwu Ou, Zhao Li, and Lu Duan. 2018a. Automatic generation of chinese short product titles for mobile display. In arXiv .

[11]

Yu Gong, Xusheng Luo, Yu Zhu, Wenwu Ou, Zhao Li, Muhua Zhu, Kenny Q Zhu, Lu Duan, and Xi Chen. 2018b. Deep Cascade Multi-task Learning for Slot Filling in Online Shopping Assistant. In arXiv .

[12]

Yu Gong, Kaiqi Zhao, and Kenny Qili Zhu. 2016. Representing verbs as argument concepts. In AAAI .

Digital Library

[13]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR .

[14]

Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW .

Digital Library

[15]

Adrian Ion, João Carreira, and Cristian Sminchisescu. 2011. Image segmentation by figure-ground composition into maximal cliques. In ICCV .

Digital Library

[16]

Ray Jiang, Sven Gowal, Yuqiu Qian, Timothy Mann, and Danilo J Rezende. 2018. Beyond Greedy Ranking: Slate Optimization via List-CVAE. In arXiv .

[17]

Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer (2009).

Digital Library

[18]

Minh-Thang Luong, Hieu Pham, and Christopher D Manning. 2015. Effective approaches to attention-based neural machine translation. In arXiv .

[19]

Andres Marzal and Enrique Vidal. 1993. Computation of normalized edit distance and applications. PAMI (1993).

Digital Library

[20]

Ashvin Nair, Bob McGrew, Marcin Andrychowicz, Wojciech Zaremba, and Pieter Abbeel. 2018. Overcoming exploration in reinforcement learning with demonstrations. In ICRA .

[21]

Yanru Qu, Han Cai, Kan Ren, Weinan Zhang, Yong Yu, Ying Wen, and Jun Wang. 2016. Product-based neural networks for user response prediction. In ICDM .

[22]

Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In UAI .

Digital Library

[23]

Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In NIPS .

Digital Library

[24]

Faraz Torabi, Garrett Warnell, and Peter Stone. 2018. Behavioral Cloning from Observation. In arXiv .

[25]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NIPS .

Digital Library

[26]

Oriol Vinyals, Samy Bengio, and Manjunath Kudlur. 2015a. Order matters: Sequence to sequence for sets. In arXiv .

[27]

Oriol Vinyals, Meire Fortunato, and Navdeep Jaitly. 2015b. Pointer networks. In NIPS .

Digital Library

[28]

Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015c. Show and tell: A neural image caption generator. In CVPR .

[29]

Jun Wang, Lantao Yu, Weinan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, and Dell Zhang. 2017. Irgan: A minimax game for unifying generative and discriminative information retrieval models. In SIGIR .

Digital Library

[30]

Yunlun Yang, Yu Gong, and Xi Chen. 2018. Query Tracking for E-commerce Conversational Search: A Machine Comprehension Perspective. In CIKM .

Digital Library

[31]

Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. In AAAI .

Digital Library

[32]

Guanjie Zheng, Fuzheng Zhang, Zihan Zheng, Yang Xiang, Nicholas Jing Yuan, Xing Xie, and Zhenhui Li. 2018. DRN: A Deep Reinforcement Learning Framework for News Recommendation. In WWW .

Digital Library

[33]

Yu Zhu, Hao Li, Yikang Liao, Beidou Wang, Ziyu Guan, Haifeng Liu, and Deng Cai. 2017. What to Do Next: Modeling User Behaviors by Time-LSTM. In IJCAI .

Digital Library

[34]

Yu Zhu, Junxiong Zhu, Jie Hou, Yongliang Li, Beidou Wang, Ziyu Guan, and Deng Cai. 2018. A brand-level ranking system with the customized attention-GRU model. In IJCAI .

Digital Library

Cited By

Liu YLi GPayne TYue YMan K(2024)Non-Stationary Transformer Architecture: A Versatile Framework for Recommendation SystemsElectronics10.3390/electronics1311207513:11(2075)Online publication date: 27-May-2024
https://doi.org/10.3390/electronics13112075
Wang YLi ZZhang CChen SZhang XXu JLin Q(2024)Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-CommerceProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688165(896-901)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688165
Sun YWang KHu ZWu RWu YSong WShen XLv TFan CBaeza-Yates RBonchi F(2024)MGMatch: Fast Matchmaking with Nonlinear Objective and Constraints via Multimodal Deep Graph LearningProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671553(5741-5751)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671553
Show More Cited By

Index Terms

Exact-K Recommendation via Maximal Clique Optimization
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Learning to rank
    2. Retrieval tasks and goals
      1. Recommender systems

Recommendations

A User Trust-Based Collaborative Filtering Recommendation Algorithm
Information and Communications Security
Abstract
Due to the open nature of collaborative recommender systems, they can not effectively prevent malicious users from injecting fake profile data into the ratings database, which can significantly bias the system’s output. With this problem in mind, ...
Efficient Interactive Recommendation via Huffman Tree-based Policy Learning
AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems

Interactive recommender systems~(IRSs) are an essential part of our daily life, as they can suggest items to persistently satisfy our demands. Due to the interactive nature, conventional static recommendation methods such as matrix factorization, and ...
TRecSo: Enhancing Top-k Recommendation With Social Information
WWW '16 Companion: Proceedings of the 25th International Conference Companion on World Wide Web

Due to the data sparsity problem, social network information is often additionally used to improve the performance of recommender system. While most existing works exploit social information to reduce the rating prediction error, e.g., RMSE, a few had ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

July 2019

3305 pages

ISBN:9781450362016

DOI:10.1145/3292500

General Chairs:
Ankur Teredesai
KenSci
,
Vipin Kumar
University of Minnesota
,
Program Chairs:
Ying Li
EV Analysis Corporation
,
Rómer Rosales
LinkedIn
,
Evimaria Terzi
Boston University
,
George Karypis
University of Minnesota

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD '19

Sponsor:

KDD '19: The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 4 - 8, 2019

AK, Anchorage, USA

Acceptance Rates

KDD '19 Paper Acceptance Rate 110 of 1,200 submissions, 9%;

Overall Acceptance Rate 1,089 of 8,328 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

34
Total Citations
View Citations
1,275
Total Downloads

Downloads (Last 12 months)40
Downloads (Last 6 weeks)3

Reflects downloads up to 12 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Liu YLi GPayne TYue YMan K(2024)Non-Stationary Transformer Architecture: A Versatile Framework for Recommendation SystemsElectronics10.3390/electronics1311207513:11(2075)Online publication date: 27-May-2024
https://doi.org/10.3390/electronics13112075
Wang YLi ZZhang CChen SZhang XXu JLin Q(2024)Do Not Wait: Learning Re-Ranking Model Without User Feedback At Serving Time in E-CommerceProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688165(896-901)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3640457.3688165
Sun YWang KHu ZWu RWu YSong WShen XLv TFan CBaeza-Yates RBonchi F(2024)MGMatch: Fast Matchmaking with Nonlinear Objective and Constraints via Multimodal Deep Graph LearningProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671553(5741-5751)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671553
Huang HShen LYe DLiu W(2024) Master–Slave Deep Architecture for Top- K Multiarmed Bandits With Nonlinear Bandit Feedback and Diversity Constraints IEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.330680135:12(17608-17619)Online publication date: Dec-2024
https://doi.org/10.1109/TNNLS.2023.3306801
Lin YLiu YLin FZou LWu PZeng WChen HMiao C(2024)A Survey on Reinforcement Learning for Recommender SystemsIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.328016135:10(13164-13184)Online publication date: Oct-2024
https://doi.org/10.1109/TNNLS.2023.3280161
Li GLiu Y(2024)Recommendation Systems with Non-stationary Transformer2024 29th International Conference on Automation and Computing (ICAC)10.1109/ICAC61394.2024.10718828(1-6)Online publication date: 28-Aug-2024
https://doi.org/10.1109/ICAC61394.2024.10718828
Liu JSu XHe ZLi J(2024)Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation2024 27th International Conference on Computer Supported Cooperative Work in Design (CSCWD)10.1109/CSCWD61410.2024.10580175(1262-1267)Online publication date: 8-May-2024
https://doi.org/10.1109/CSCWD61410.2024.10580175
Yang WYang CLi JTan YLu XShi C(2024)Non-autoregressive personalized bundle generationInformation Processing & Management10.1016/j.ipm.2024.10381461:5(103814)Online publication date: Sep-2024
https://doi.org/10.1016/j.ipm.2024.103814
He LLuo KDing ZShao HBai B(2024)A Reinforcement Learning Approach for Personalized Diversity in Feeds RecommendationArtificial Intelligence10.1007/978-981-99-9119-8_42(463-475)Online publication date: 3-Feb-2024
https://doi.org/10.1007/978-981-99-9119-8_42
Singha Roy AD'Amico ETragos ELawlor AHurley N(2023)Scalable Deep Q-Learning for Session-Based Slate RecommendationProceedings of the 17th ACM Conference on Recommender Systems10.1145/3604915.3608843(877-882)Online publication date: 14-Sep-2023
https://dl.acm.org/doi/10.1145/3604915.3608843
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents