Computer Science > Computation and Language

arXiv:2307.07047 (cs)

[Submitted on 13 Jul 2023 (v1), last revised 20 Feb 2024 (this version, v2)]

Title:Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?

Authors:Bo-Ru Lu, Nikita Haduong, Chia-Hsuan Lee, Zeqiu Wu, Hao Cheng, Paul Koester, Jean Utke, Tao Yu, Noah A. Smith, Mari Ostendorf

View PDF HTML (experimental)

Abstract:The capabilities of pretrained language models have opened opportunities to explore new application areas, but applications involving human-human interaction are limited by the fact that most data is protected from public release for privacy reasons. Problem-solving human dialogues in real applications can be much more complex than existing Wizard-of-Oz collections, preventing successful domain transfer. To support information extraction (IE) for a private call center dataset, we introduce a human-in-the-loop dialogue generation framework capable of synthesizing realistic dialogues. In IE experiments with auto insurance call center dialogues, we observe 25\% relative improvement in $F_1$ after augmenting a small set of real human conversations with synthetic data. We release code and our synthetic dataset to illustrate the complexity of real-world call center conversations and encourage development of complex dialogue datasets that are more representative of natural data.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2307.07047 [cs.CL]
	(or arXiv:2307.07047v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2307.07047

Submission history

From: Bo-Ru Lu [view email]
[v1] Thu, 13 Jul 2023 20:02:50 UTC (7,592 KB)
[v2] Tue, 20 Feb 2024 06:12:39 UTC (7,588 KB)

Computer Science > Computation and Language

Title:Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators