Computer Science > Machine Learning

arXiv:2205.15171 (cs)

[Submitted on 30 May 2022 (v1), last revised 4 Jun 2023 (this version, v5)]

Title:Modular and On-demand Bias Mitigation with Attribute-Removal Subnetworks

Authors:Lukas Hauzenberger, Shahed Masoudian, Deepak Kumar, Markus Schedl, Navid Rekabsaz

View PDF

Abstract:Societal biases are reflected in large pre-trained language models and their fine-tuned versions on downstream tasks. Common in-processing bias mitigation approaches, such as adversarial training and mutual information removal, introduce additional optimization criteria, and update the model to reach a new debiased state. However, in practice, end-users and practitioners might prefer to switch back to the original model, or apply debiasing only on a specific subset of protected attributes. To enable this, we propose a novel modular bias mitigation approach, consisting of stand-alone highly sparse debiasing subnetworks, where each debiasing module can be integrated into the core model on-demand at inference time. Our approach draws from the concept of \emph{diff} pruning, and proposes a novel training regime adaptable to various representation disentanglement optimizations. We conduct experiments on three classification tasks with gender, race, and age as protected attributes. The results show that our modular approach, while maintaining task performance, improves (or at least remains on-par with) the effectiveness of bias mitigation in comparison with baseline finetuning. Particularly on a two-attribute dataset, our approach with separately learned debiasing subnetworks shows effective utilization of either or both the subnetworks for selective bias mitigation.

Comments:	Accepted in Findings of ACL 2023
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computers and Society (cs.CY)
Cite as:	arXiv:2205.15171 [cs.LG]
	(or arXiv:2205.15171v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.15171

Submission history

From: Lukas Hauzenberger [view email]
[v1] Mon, 30 May 2022 15:21:25 UTC (463 KB)
[v2] Fri, 29 Jul 2022 11:31:44 UTC (6,413 KB)
[v3] Wed, 3 May 2023 04:54:51 UTC (12,604 KB)
[v4] Thu, 4 May 2023 13:29:47 UTC (12,604 KB)
[v5] Sun, 4 Jun 2023 14:40:45 UTC (12,604 KB)

Computer Science > Machine Learning

Title:Modular and On-demand Bias Mitigation with Attribute-Removal Subnetworks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Modular and On-demand Bias Mitigation with Attribute-Removal Subnetworks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators