[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/3304222.3304379guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

An ensemble of retrieval-based and generation-based human-computer conversation systems

Published: 13 July 2018 Publication History

Abstract

Human-computer conversation systems have attracted much attention in Natural Language Processing. Conversation systems can be roughly divided into two categories: retrieval-based and generation-based systems. Retrieval systems search a user-issued utterance (namely a query) in a large conversational repository and return a reply that best matches the query. Generative approaches synthesize new replies. Both ways have certain advantages but suffer from their own disadvantages. We propose a novel ensemble of retrieval-based and generation-based conversation system. The retrieved candidates, in addition to the original query, are fed to a reply generator via a neural network, so that the model is aware of more information. The generated reply together with the retrieved ones then participates in a re-ranking process to find the final reply to output. Experimental results show that such an ensemble system outperforms each single module by a large margin.

References

[1]
Dzmitry Bahdanau, KyungHyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. ICLR, 2015.
[2]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3(993-1022), 2003.
[3]
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv:1406.1078, 2014.
[4]
Joseph L Fleiss. Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5):378, 1971.
[5]
Shizhu He, Cao Liu, Kang Liu, and Jun Zhao. Generating natural answers by incorporating copying and retrieving mechanisms in sequence-to-sequence learning. ACL, pages 199-208, july 2017.
[6]
Sepp Hochreiter and Jurgen Schmidhuber. Long short-term memory. Neural computation, 9(8):1735-1780, 1997.
[7]
Thomas Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1):177-196, 2001.
[8]
Charles Lee Isbell, Michael Kearns, Dave Kormann, Satinder Singh, and Peter Stone. Cobot in LambdaMOO: A social statistics agent. In AAAI, pages 36-41, 2000.
[9]
Zongcheng Ji, Zhengdong Lu, and Hang Li. An information retrieval approach to short text conversation. arXiv:1408.6988, 2014.
[10]
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. A diversity-promoting objective function for neural conversation models. In NAACL-HLT, pages 110-119, 2016.
[11]
Xiang Li, Lili Mou, Rui Yan, and Ming Zhang. StalemateBreaker: A proactive content-introducing approach to automatic human-computer conversation. In IJCAI, pages 2845-2851, 2016.
[12]
Jindrich Libovicky and Jindrich Helcl. Attention strategies for multisource sequence-to-sequence learning. ACL, pages 196-202, july 2017.
[13]
Chia-Wei Liu, Ryan Lowe, Iulian V Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. How NOT to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. In EMNLP, 2016.
[14]
Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. Sequence to backward and forward sequences: A content-introducing approach to generative short-text conversation. COLING, pages 3349-3358, 2016.
[15]
Alan Ritter, Colin Cherry, and William B Dolan. Data-driven response generation in social media. In EMNLP, pages 583-593, 2011.
[16]
Iulian V Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, and Joelle Pineau. Building end-to-end dialogue systems using generative hierarchical neural network models. In AAAI, pages 3776- 3783, 2016.
[17]
Iulian Vlad Serban, Alessandro Sordoni, Ryan Lowe, Laurent Charlin, Joelle Pineau, Aaron Courville, and Yoshua Bengio. A hierarchical latent variable encoder-decoder model for generating dialogues. arXiv:1605.06069, 2016.
[18]
Lifeng Shang, Zhengdong Lu, and Hang Li. Neural responding machine for short-text conversation. In ACL-IJCNLP, pages 1577-1586, July 2015.
[19]
Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, and Bill Dolan. A neural network approach to context-sensitive generation of conversational responses. In NAACL-HLT, pages 196-205, 2015.
[20]
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. Sequence to sequence learning with neural networks. In NIPS, pages 3104-3112, 2014.
[21]
Oriol Vinyals and Quoc Le. A neural conversational model. arXiv:1506.05869, 2015.
[22]
Chen Xing, Wei Wu, Yu Wu, Jie Liu, Yalou Huang, Ming Zhou, and Wei-Ying Ma. Topic augmented neural response generation with a joint attention mechanism. arXiv:1606.08340, 2016.
[23]
Rui Yan, Yiping Song, and Hua Wu. Learning to respond with deep neural networks for retrieval-based human-computer conversation system. In SIGIR, pages 55-64, 2016.
[24]
Rui Yan, Yiping Song, Xiangyang Zhou, and Hua Wu. "shall i be your chat companion?": Towards an online human-computer conversation system. CIKM, pages 649-658, 2016.
[25]
Jerry Ye, Jyh-Herng Chow, Jiang Chen, and Zhaohui Zheng. Stochastic gradient boosted distributed decision trees. Proceedings of the 18th ACM conference on Information and knowledge management., pages 2061- 2064, 2009.
[26]
Matthew D Zeiler. AdaDelta: An adaptive learning rate method. arXiv:1212.5701, 2012.
[27]
Barret Zoph and Kevin Knight. Multi-source neural translation. In NAACL-ACL, pages 30-34, 2016.

Cited By

View all
  • (2021)Coherent Dialog Generation with Query GraphACM Transactions on Asian and Low-Resource Language Information Processing10.1145/346255120:6(1-23)Online publication date: 12-Aug-2021
  • (2021)Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based ChatbotsACM Transactions on Information Systems10.1145/345318339:4(1-25)Online publication date: 17-Aug-2021
  • (2021)Data-Driven Imitation Learning for a Shopkeeper Robot with Periodically Changing Product InformationACM Transactions on Human-Robot Interaction10.1145/345188310:4(1-20)Online publication date: 14-Jul-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
IJCAI'18: Proceedings of the 27th International Joint Conference on Artificial Intelligence
July 2018
5885 pages
ISBN:9780999241127

Sponsors

  • Adobe
  • IBMR: IBM Research
  • ERICSSON
  • Microsoft: Microsoft
  • AI Journal: AI Journal

Publisher

AAAI Press

Publication History

Published: 13 July 2018

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Coherent Dialog Generation with Query GraphACM Transactions on Asian and Low-Resource Language Information Processing10.1145/346255120:6(1-23)Online publication date: 12-Aug-2021
  • (2021)Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based ChatbotsACM Transactions on Information Systems10.1145/345318339:4(1-25)Online publication date: 17-Aug-2021
  • (2021)Data-Driven Imitation Learning for a Shopkeeper Robot with Periodically Changing Product InformationACM Transactions on Human-Robot Interaction10.1145/345188310:4(1-20)Online publication date: 14-Jul-2021
  • (2021)Meta-evaluation of Conversational Search Evaluation MetricsACM Transactions on Information Systems10.1145/344502939:4(1-42)Online publication date: 1-Sep-2021
  • (2019)EnsembleGANProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331193(435-444)Online publication date: 18-Jul-2019

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media