[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3340531.3412330acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots

Published: 19 October 2020 Publication History

Abstract

In this paper, we study the problem of employing pre-trained language models for multi-turn response selection in retrieval-based chatbots. A new model, named Speaker-Aware BERT (SA-BERT), is proposed in order to make the model aware of the speaker change information, which is an important and intrinsic property of multi-turn dialogues. Furthermore, a speaker-aware disentanglement strategy is proposed to tackle the entangled dialogues. This strategy selects a small number of most important utterances as the filtered context according to the speakers' information in them. Finally, domain adaptation is performed to incorporate the in-domain knowledge into pre-trained language models. Experiments on five public datasets show that our proposed model outperforms the present models on all metrics by large margins and achieves new state-of-the-art performances for multi-turn response selection.

Supplementary Material

MP4 File (3340531.3412330.mp4)
In this paper, we study the problem of employing pre-trained language models for multi-turn response selection in retrieval-based chatbots. A new model, named Speaker-Aware BERT (SA-BERT), is proposed. We come to the conclusion that speaker change is an important and intrinsic property of multi-turn dialogues, which should be modeled in PLMs. In addition to general knowledge, specific in-domain knowledge is also important for response selection in retrieval-based chatbots.\r\n

References

[1]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of NAACL-HLT. 4171--4186.
[2]
Jianxiong Dong and Jim Huang. 2018. Enhance word representation for out-of-vocabulary on Ubuntu dialogue corpus. CoRR, Vol. abs/1802.02614 (2018).
[3]
Jia-Chen Gu, Zhen-Hua Ling, and Quan Liu. 2019. Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. In Proceedings of the 28th ACM International Conference of CIKM 2019. 2321--2324.
[4]
Matthew Henderson, Ivan Vulic, Daniela Gerz, I n igo Casanueva, Pawel Budzianowski, Sam Coope, Georgios Spithourakis, Tsung-Hsien Wen, Nikola Mrksic, and Pei-Hao Su. 2019. Training Neural Response Selection for Task-Oriented Dialogue Systems. In Proceedings of the 57th Conference of ACL 2019. 5392--5404.
[5]
Seokhwan Kim, Michel Galley, Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, et almbox. 2019. The Eighth Dialog System Technology Challenge. arXiv preprint arXiv:1911.06394 (2019).
[6]
Ryan Lowe, Nissan Pow, Iulian Serban, and Joelle Pineau. 2015. The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems. In Proceedings of the SIGDIAL 2015 Conference. 285--294.
[7]
Ryan Thomas Lowe, Nissan Pow, Iulian Vlad Serban, Laurent Charlin, Chia-Wei Liu, and Joelle Pineau. 2017. Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus. (2017), 31--65.
[8]
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019 a. Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots. In Proceedings of the Twelfth ACM International Conference of WSDM 2019. 267--275.
[9]
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, and Rui Yan. 2019 b. One Time of Interaction May Not Be Enough: Go Deep with an Interaction-over-Interaction Network for Response Selection in Dialogues. In Proceedings of the 57th Conference of ACL 2019. 1--11.
[10]
Yu Wu, Wei Wu, Chen Xing, Ming Zhou, and Zhoujun Li. 2017. Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots. In Proceedings of the 55th Conference of ACL 2017. 496--505.
[11]
Chunyuan Yuan, Wei Zhou, Mingming Li, Shangwen Lv, Fuqing Zhu, Jizhong Han, and Songlin Hu. 2019. Multi-hop Selector Network for Multi-turn Response Selection in Retrieval-based Chatbots. In Proceedings of the 2019 Conference of EMNLP-IJCNLP. 111--120.
[12]
Zhuosheng Zhang, Jiangtong Li, Pengfei Zhu, Hai Zhao, and Gongshen Liu. 2018. Modeling Multi-turn Conversation with Deep Utterance Aggregation. In Proceedings of the 27th International Conference of COLING 2018. 3740--3752.
[13]
Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu, and Hua Wu. 2018. Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network. In Proceedings of the 56th Conference of ACL 2018. 1118--1127.

Cited By

View all
  • (2025)The future of cognitive strategy-enhanced persuasive dialogue agents: new perspectives and trendsFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-024-40057-x19:5Online publication date: 1-May-2025
  • (2024)Research on Effective Information Extraction Techniques for Multi-Round Dialogues of Large-Scale Models in Deep Learning EnvironmentApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-35699:1Online publication date: 27-Nov-2024
  • (2024)Designing the Conversational Agent: Asking Follow-up Questions for Information ElicitationProceedings of the ACM on Human-Computer Interaction10.1145/36373208:CSCW1(1-30)Online publication date: 26-Apr-2024
  • Show More Cited By

Index Terms

  1. Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
    October 2020
    3619 pages
    ISBN:9781450368599
    DOI:10.1145/3340531
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 October 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. multi-turn response selection
    2. retrieval-based chatbot
    3. speaker-aware bert

    Qualifiers

    • Short-paper

    Conference

    CIKM '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)70
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 11 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)The future of cognitive strategy-enhanced persuasive dialogue agents: new perspectives and trendsFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-024-40057-x19:5Online publication date: 1-May-2025
    • (2024)Research on Effective Information Extraction Techniques for Multi-Round Dialogues of Large-Scale Models in Deep Learning EnvironmentApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-35699:1Online publication date: 27-Nov-2024
    • (2024)Designing the Conversational Agent: Asking Follow-up Questions for Information ElicitationProceedings of the ACM on Human-Computer Interaction10.1145/36373208:CSCW1(1-30)Online publication date: 26-Apr-2024
    • (2024)UniMPC: Towards a Unified Framework for Multi-Party ConversationsProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679864(2639-2649)Online publication date: 21-Oct-2024
    • (2024)Channel-Aware Decoupling Network for Multiturn Dialog ComprehensionIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.322004735:6(7685-7696)Online publication date: Jun-2024
    • (2024)MPEG: A Multi-Perspective Enhanced Graph Attention Network for Causal Emotion Entailment in ConversationsIEEE Transactions on Affective Computing10.1109/TAFFC.2023.331575215:3(1004-1017)Online publication date: Jul-2024
    • (2024)TGAT-DGL: Triple Graph Attention Networks on Dual-Granularity Level for Multi-party Dialogue Reading Comprehension2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10651541(1-8)Online publication date: 30-Jun-2024
    • (2024)ProDepDet: Out-of-domain Knowledge Transfer of Pre-trained Large Language Models for Depression Detection in Text-Based Multi-Party Conversations2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650774(1-8)Online publication date: 30-Jun-2024
    • (2024)Exploring Label Hierarchy in Dialogue Intent ClassificationICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10448380(11511-11515)Online publication date: 14-Apr-2024
    • (2024)A survey of safety and trustworthiness of large language models through the lens of verification and validationArtificial Intelligence Review10.1007/s10462-024-10824-057:7Online publication date: 17-Jun-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media