Computer Science > Computation and Language

arXiv:2210.06408 (cs)

[Submitted on 12 Oct 2022]

Title:PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Authors:Ishan Jindal, Alexandre Rademaker, Khoi-Nguyen Tran, Huaiyu Zhu, Hiroshi Kanayama, Marina Danilevsky, Yunyao Li

View PDF

Abstract:Semantic role labeling (SRL) identifies the predicate-argument structure in a sentence. This task is usually accomplished in four steps: predicate identification, predicate sense disambiguation, argument identification, and argument classification. Errors introduced at one step propagate to later steps. Unfortunately, the existing SRL evaluation scripts do not consider the full effect of this error propagation aspect. They either evaluate arguments independent of predicate sense (CoNLL09) or do not evaluate predicate sense at all (CoNLL05), yielding an inaccurate SRL model performance on the argument classification task. In this paper, we address key practical issues with existing evaluation scripts and propose a more strict SRL evaluation metric PriMeSRL. We observe that by employing PriMeSRL, the quality evaluation of all SoTA SRL models drops significantly, and their relative rankings also change. We also show that PriMeSRLsuccessfully penalizes actual failures in SoTA SRL models.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.06408 [cs.CL]
	(or arXiv:2210.06408v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.06408

Submission history

From: Ishan Jindal [view email]
[v1] Wed, 12 Oct 2022 17:04:28 UTC (13,103 KB)

Computer Science > Computation and Language

Title:PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators