[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3331184.3331370acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Reinforcement Learning for User Intent Prediction in Customer Service Bots

Published: 18 July 2019 Publication History

Abstract

A customer service bot is now a necessary component of an e-commerce platform. As a core module of the customer service bot, user intent prediction can help predict user questions before they ask. A typical solution is to find top candidate questions that a user will be interested in. Such solution ignores the inter-relationship between questions and often aims to maximize the immediate reward such as clicks, which may not be ideal in practice. Hence, we propose to view the problem as a sequential decision making process to better capture the long-term effects of each recommendation in the list. Intuitively, we formulate the problem as a Markov decision process and consider using reinforcement learning for the problem. With this approach, questions presented to users are both relevant and diverse. Experiments on offline real-world dataset and online system demonstrate the effectiveness of our proposed approach.

References

[1]
K. Arulkumaran, M. P. Deisenroth, M. Brundage, and A. A. Bharath. 2017. Deep Reinforcement Learning: A Brief Survey. IEEE Signal Processing Magazine(2017).
[2]
Heng-Tze Cheng and et al. 2016. Wide & deep learning for recommender systems. Proceedings of the 1st Workshop on Deep Learning for Recommender Systems(2016).
[3]
Mukund Deshpande and George Karypis. 2004. Item-based top-n recommendation algorithms. TOIS 22, 1 (2004), 143--177.
[4]
Yue Feng, Jun Xu, Yanyan Lan, Jiafeng Guo, Wei Zeng, and Xueqi Cheng. 2018. From Greedy Selection to Exploratory Decision-Making: Diverse Ranking with Policy-Value Networks. In SIGIR (SIGIR'18). 125--134.
[5]
Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He. 2017. Deep FM: A Factorization-Machine based Neural Network for CTR Prediction. CoRRabs/1703.04247 (2017).
[6]
Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 EMNLP. 1746--1751.
[7]
V. R. Konda and J. N. Tsitsiklis. 2003. On Actor-Critic Algorithms. SIAM J. Control Optim. (2003).
[8]
F. Li, M. Qiu, H. Chen, X. Wang, X. Gao, J. Huang, J. Ren, Z. Zhao, W. Zhao, L. Wang, and G. Jin. 2017. Ali Me Assist: An Intelligent Assistant for Creating an Innovative E-commerce Experience. In CIKM '17.
[9]
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRRabs/1509.02971 (2015).
[10]
Tie-Yan Liu et al. 2009. Learning to rank for information retrieval. Foundations and Trends® in Information Retrieval3, 3 (2009), 225--331.
[11]
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A.Graves, M. A. Riedmiller, A. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis. 2015. Human-level control through deep reinforcement learning. Nature(2015).
[12]
Steffen Rendle. 2010. Factorization Machines. In Proceedings of the 2010 IEEE International Conference on Data Mining (ICDM '10). 995--1000.
[13]
G. A. Rummery and M. Niranjan. 1994.Online Q-Learning Using Connectionist Systems. Technical Report. University of Cambridge.
[14]
Guy Shani, David Heckerman, and Ronen I Brafman. 2005. An MDP-based recommender system. JMLR 6 (2005), 1265--1295.
[15]
R. S. Sutton and A. G. Barto. 1998.Reinforcement Learning - An Introduction. MIT Press.
[16]
Ruoxi Wang, Bin Fu, Gang Fu, and Mingliang Wang. 2017. Deep & Cross Network for Ad Click Predictions. In Proc. of the ADKDD'17. 12:1--12:7.
[17]
Zeng Wei, Jun Xu, Yanyan Lan, Jiafeng Guo, and Xueqi Cheng. 2017. Reinforcement Learning to Rank with Markov Decision Process. In SIGIR 17. 945--948.
[18]
R. J. Williams. 1992. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. Machine Learning(1992).
[19]
Xiangyu Zhao, Liang Zhang, Zhuoye Ding, Dawei Yin, Yihong Zhao, and Jiliang Tang. 2017. Deep reinforcement learning for list-wise recommendations. arXiv preprint arXiv:1801.00209(2017).

Cited By

View all
  • (2024)Utterance Intent Recognition for Online Retail2024 3rd International Conference on Digital Transformation and Applications (ICDXA)10.1109/ICDXA61007.2024.10470915(199-204)Online publication date: 29-Jan-2024
  • (2023)Developing smart city services using intent‐aware recommendation systems: A surveyTransactions on Emerging Telecommunications Technologies10.1002/ett.472834:4Online publication date: 12-Jan-2023
  • (2022)FinBrain 2.0: when finance meets trustworthy AI金融大脑2.0:当金融遇到可信人工智能Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.220003923:12(1747-1764)Online publication date: 30-Sep-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2019
1512 pages
ISBN:9781450361729
DOI:10.1145/3331184
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. customer service bots
  2. reinforcement learning
  3. user intent prediction

Qualifiers

  • Short-paper

Conference

SIGIR '19
Sponsor:

Acceptance Rates

SIGIR'19 Paper Acceptance Rate 84 of 426 submissions, 20%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)41
  • Downloads (Last 6 weeks)7
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Utterance Intent Recognition for Online Retail2024 3rd International Conference on Digital Transformation and Applications (ICDXA)10.1109/ICDXA61007.2024.10470915(199-204)Online publication date: 29-Jan-2024
  • (2023)Developing smart city services using intent‐aware recommendation systems: A surveyTransactions on Emerging Telecommunications Technologies10.1002/ett.472834:4Online publication date: 12-Jan-2023
  • (2022)FinBrain 2.0: when finance meets trustworthy AI金融大脑2.0:当金融遇到可信人工智能Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.220003923:12(1747-1764)Online publication date: 30-Sep-2022
  • (2022)IRnatorProceedings of the 2022 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3539813.3545152(138-143)Online publication date: 23-Aug-2022
  • (2022)Graph Based Long-Term And Short-Term Interest Model for Click-Through Rate PredictionProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557336(1818-1826)Online publication date: 17-Oct-2022
  • (2022)Exploring Spatial UI Transition Mechanisms with Head-Worn Augmented RealityProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3517723(1-16)Online publication date: 29-Apr-2022
  • (2022)Multimodal Marketing Intent Analysis for Effective Targeted AdvertisingIEEE Transactions on Multimedia10.1109/TMM.2021.307326724(1830-1843)Online publication date: 2022
  • (2022)Monetization of customer futures through machine learning and artificial intelligence based persuasive technologiesJournal of Science and Technology Policy Management10.1108/JSTPM-09-2021-013614:4(734-757)Online publication date: 31-May-2022
  • (2021)A Study of Deep Reinforcement Learning Based Recommender Systems2021 2nd International Conference on Secure Cyber Computing and Communications (ICSCCC)10.1109/ICSCCC51823.2021.9478178(218-220)Online publication date: 21-May-2021
  • (2020)Moving Deep Learning to the EdgeAlgorithms10.3390/a1305012513:5(125)Online publication date: 18-May-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media