short-paper

Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots

Authors:

Si Wei,

Xiaodan ZhuAuthors Info & Claims

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

Pages 2041 - 2044

https://doi.org/10.1145/3340531.3412330

Published: 19 October 2020 Publication History

Get Access

Abstract

In this paper, we study the problem of employing pre-trained language models for multi-turn response selection in retrieval-based chatbots. A new model, named Speaker-Aware BERT (SA-BERT), is proposed in order to make the model aware of the speaker change information, which is an important and intrinsic property of multi-turn dialogues. Furthermore, a speaker-aware disentanglement strategy is proposed to tackle the entangled dialogues. This strategy selects a small number of most important utterances as the filtered context according to the speakers' information in them. Finally, domain adaptation is performed to incorporate the in-domain knowledge into pre-trained language models. Experiments on five public datasets show that our proposed model outperforms the present models on all metrics by large margins and achieves new state-of-the-art performances for multi-turn response selection.

Supplementary Material

MP4 File (3340531.3412330.mp4)

Download
7.05 MB

References

[1]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of NAACL-HLT. 4171--4186.

Google Scholar

[2]

Jianxiong Dong and Jim Huang. 2018. Enhance word representation for out-of-vocabulary on Ubuntu dialogue corpus. CoRR, Vol. abs/1802.02614 (2018).

Google Scholar

[3]

Jia-Chen Gu, Zhen-Hua Ling, and Quan Liu. 2019. Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. In Proceedings of the 28th ACM International Conference of CIKM 2019. 2321--2324.

Digital Library

Google Scholar

[4]

Matthew Henderson, Ivan Vulic, Daniela Gerz, I n igo Casanueva, Pawel Budzianowski, Sam Coope, Georgios Spithourakis, Tsung-Hsien Wen, Nikola Mrksic, and Pei-Hao Su. 2019. Training Neural Response Selection for Task-Oriented Dialogue Systems. In Proceedings of the 57th Conference of ACL 2019. 5392--5404.

Crossref

Google Scholar

[5]

Seokhwan Kim, Michel Galley, Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, et almbox. 2019. The Eighth Dialog System Technology Challenge. arXiv preprint arXiv:1911.06394 (2019).

Google Scholar

[6]

Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. 2015. The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems. In Proceedings of the SIGDIAL 2015 Conference. 285--294.

Crossref

Google Scholar

[7]

Ryan Thomas Lowe, Nissan Pow, Iulian Vlad Serban, Laurent Charlin, Chia-Wei Liu, and Joelle Pineau. 2017. Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus. (2017), 31--65.

Google Scholar

[8]

Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019 a. Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. In Proceedings of the Twelfth ACM International Conference of WSDM 2019. 267--275.

Digital Library

Google Scholar

[9]

Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019 b. One Time of Interaction May Not Be Enough: Go Deep with an Interaction-over-Interaction Network for Response Selection in Dialogues. In Proceedings of the 57th Conference of ACL 2019. 1--11.

Crossref

Google Scholar

[10]

Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. 2017. Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots. In Proceedings of the 55th Conference of ACL 2017. 496--505.

Crossref

Google Scholar

[11]

Chunyuan Yuan, Wei Zhou, Mingming Li, Shangwen Lv, Fuqing Zhu, Jizhong Han, and Songlin Hu. 2019. Multi-hop Selector Network for Multi-turn Response Selection in Retrieval-based Chatbots. In Proceedings of the 2019 Conference of EMNLP-IJCNLP. 111--120.

Crossref

Google Scholar

[12]

Zhuosheng Zhang, Jiangtong Li, Pengfei Zhu, Hai Zhao, and Gongshen Liu. 2018. Modeling Multi-turn Conversation with Deep Utterance Aggregation. In Proceedings of the 27th International Conference of COLING 2018. 3740--3752.

Google Scholar

[13]

Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu, and Hua Wu. 2018. Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network. In Proceedings of the 56th Conference of ACL 2018. 1118--1127.

Crossref

Google Scholar

Cited By

View all

Chen MGuo BWang HLi HZhao QLiu JDing YPan YYu Z(2025)The future of cognitive strategy-enhanced persuasive dialogue agents: new perspectives and trendsFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-024-40057-x19:5Online publication date: 1-May-2025
https://dl.acm.org/doi/10.1007/s11704-024-40057-x
Hu HXiang ZLi JGao HWang S(2024)Research on Effective Information Extraction Techniques for Multi-Round Dialogues of Large-Scale Models in Deep Learning EnvironmentApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-35699:1Online publication date: 27-Nov-2024
https://doi.org/10.2478/amns-2024-3569
Hu JGuo JTang NMa XYao YYang CXu Y(2024)Designing the Conversational Agent: Asking Follow-up Questions for Information ElicitationProceedings of the ACM on Human-Computer Interaction10.1145/36373208:CSCW1(1-30)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3637320
Show More Cited By

Index Terms

Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

We consider context-response matching with multiple types of representations for multi-turn response selection in retrieval-based chatbots. The representations encode semantics of contexts and responses on words, n-grams, and sub-sequences of utterances, ...
Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots
CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management

In this paper, we propose an interactive matching network (IMN) for the multi-turn response selection task. First, IMN constructs word representations from three aspects to address the challenge of out-of-vocabulary (OOV) words. Second, an attentive ...
Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based Chatbots
Existing multi-turn context-response matching methods mainly concentrate on obtaining multi-level and multi-dimension representations and better interactions between context utterances and response. However, in real-place conversation scenarios, whether a ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

October 2020

3619 pages

ISBN:9781450368599

DOI:10.1145/3340531

General Chairs:
Mathieu d'Aquin
DSI, Insight, NUI Galway, Ireland
,
Stefan Dietze
GESIS, Cologne, Germany, Heinrich-Heine-University Düsseldorf, Germany, L3S Research Center, Germany
,
Program Chairs:
Claudia Hauff
TU Delft, The Netherlands
,
Edward Curry
DSI, Insight, NUI Galway, Ireland
,
Philippe Cudre Mauroux
eXascale, University of Fribourg, Switzerland

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

CIKM '20

Sponsor:

CIKM '20: The 29th ACM International Conference on Information and Knowledge Management

October 19 - 23, 2020

Virtual Event, Ireland

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

56
Total Citations
View Citations
771
Total Downloads

Downloads (Last 12 months)70
Downloads (Last 6 weeks)4

Reflects downloads up to 11 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Chen MGuo BWang HLi HZhao QLiu JDing YPan YYu Z(2025)The future of cognitive strategy-enhanced persuasive dialogue agents: new perspectives and trendsFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-024-40057-x19:5Online publication date: 1-May-2025
https://dl.acm.org/doi/10.1007/s11704-024-40057-x
Hu HXiang ZLi JGao HWang S(2024)Research on Effective Information Extraction Techniques for Multi-Round Dialogues of Large-Scale Models in Deep Learning EnvironmentApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-35699:1Online publication date: 27-Nov-2024
https://doi.org/10.2478/amns-2024-3569
Hu JGuo JTang NMa XYao YYang CXu Y(2024)Designing the Conversational Agent: Asking Follow-up Questions for Information ElicitationProceedings of the ACM on Human-Computer Interaction10.1145/36373208:CSCW1(1-30)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3637320
Xie YSun CLiu YJi ZLiu BSerra ESpezzano F(2024)UniMPC: Towards a Unified Framework for Multi-Party ConversationsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679864(2639-2649)Online publication date: 21-Oct-2024
https://dl.acm.org/doi/10.1145/3627673.3679864
Zhang ZZhao HLiu L(2024)Channel-Aware Decoupling Network for Multiturn Dialog ComprehensionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.322004735:6(7685-7696)Online publication date: Jun-2024
https://doi.org/10.1109/TNNLS.2022.3220047
Chen TShen YChen XZhang LZhao S(2024)MPEG: A Multi-Perspective Enhanced Graph Attention Network for Causal Emotion Entailment in ConversationsIEEE Transactions on Affective Computing10.1109/TAFFC.2023.331575215:3(1004-1017)Online publication date: Jul-2024
https://doi.org/10.1109/TAFFC.2023.3315752
Gao XZhou XCao RZhang M(2024)TGAT-DGL: Triple Graph Attention Networks on Dual-Granularity Level for Multi-party Dialogue Reading Comprehension2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10651541(1-8)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10651541
Priyadarshana YLiang ZPiumarta I(2024)ProDepDet: Out-of-domain Knowledge Transfer of Pre-trained Large Language Models for Depression Detection in Text-Based Multi-Party Conversations2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650774(1-8)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650774
Huang SHuang PXu YLiang JNiu J(2024)Exploring Label Hierarchy in Dialogue Intent ClassificationICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10448380(11511-11515)Online publication date: 14-Apr-2024
https://doi.org/10.1109/ICASSP48485.2024.10448380
Huang XRuan WHuang WJin GDong YWu CBensalem SMu RQi YZhao XCai KZhang YWu SXu PWu DFreitas AMustafa M(2024)A survey of safety and trustworthiness of large language models through the lens of verification and validationArtificial Intelligence Review10.1007/s10462-024-10824-057:7Online publication date: 17-Jun-2024
https://doi.org/10.1007/s10462-024-10824-0
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots

Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots

Dialogue History Matters! Personalized Response Selection in Multi-Turn Retrieval-Based Chatbots