Computer Science > Computation and Language

arXiv:2311.02271 (cs)

[Submitted on 3 Nov 2023 (v1), last revised 8 Nov 2023 (this version, v2)]

Title:FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization

Authors:Nan Zhang, Yusen Zhang, Wu Guo, Prasenjit Mitra, Rui Zhang

View PDF

Abstract:Summaries of medical text shall be faithful by being consistent and factual with source inputs, which is an important but understudied topic for safety and efficiency in healthcare. In this paper, we investigate and improve faithfulness in summarization on a broad range of medical summarization tasks. Our investigation reveals that current summarization models often produce unfaithful outputs for medical input text. We then introduce FaMeSumm, a framework to improve faithfulness by fine-tuning pre-trained language models based on medical knowledge. FaMeSumm performs contrastive learning on designed sets of faithful and unfaithful summaries, and it incorporates medical terms and their contexts to encourage faithful generation of medical terms. We conduct comprehensive experiments on three datasets in two languages: health question and radiology report summarization datasets in English, and a patient-doctor dialogue dataset in Chinese. Results demonstrate that FaMeSumm is flexible and effective by delivering consistent improvements over mainstream language models such as BART, T5, mT5, and PEGASUS, yielding state-of-the-art performances on metrics for faithfulness and general quality. Human evaluation by doctors also shows that FaMeSumm generates more faithful outputs. Our code is available at this https URL .

Comments:	Main Conference of EMNLP 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.02271 [cs.CL]
	(or arXiv:2311.02271v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.02271

Submission history

From: Nan Zhang [view email]
[v1] Fri, 3 Nov 2023 23:25:53 UTC (8,046 KB)
[v2] Wed, 8 Nov 2023 22:54:33 UTC (8,046 KB)

Computer Science > Computation and Language

Title:FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators