Computer Science > Computation and Language

arXiv:2210.07469 (cs)

[Submitted on 14 Oct 2022 (v1), last revised 14 Apr 2023 (this version, v2)]

Title:StyLEx: Explaining Style Using Human Lexical Annotations

Authors:Shirley Anugrah Hayati, Kyumin Park, Dheeraj Rajagopal, Lyle Ungar, Dongyeop Kang

View PDF

Abstract:Large pre-trained language models have achieved impressive results on various style classification tasks, but they often learn spurious domain-specific words to make predictions (Hayati et al., 2021). While human explanation highlights stylistic tokens as important features for this task, we observe that model explanations often do not align with them. To tackle this issue, we introduce StyLEx, a model that learns from human-annotated explanations of stylistic features and jointly learns to perform the task and predict these features as model explanations. Our experiments show that StyLEx can provide human-like stylistic lexical explanations without sacrificing the performance of sentence-level style prediction on both in-domain and out-of-domain datasets. Explanations from StyLEx show significant improvements in explanation metrics (sufficiency, plausibility) and when evaluated with human annotations. They are also more understandable by human judges compared to the widely-used saliency-based explanation baseline.

Comments:	EACL 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.07469 [cs.CL]
	(or arXiv:2210.07469v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.07469

Submission history

From: Shirley Anugrah Hayati [view email]
[v1] Fri, 14 Oct 2022 02:35:47 UTC (694 KB)
[v2] Fri, 14 Apr 2023 17:06:50 UTC (1,135 KB)

Computer Science > Computation and Language

Title:StyLEx: Explaining Style Using Human Lexical Annotations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:StyLEx: Explaining Style Using Human Lexical Annotations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators