Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Levy, Shahar; Lazar, Koren; Stanovsky, Gabriel

Computer Science > Computation and Language

arXiv:2109.03858 (cs)

[Submitted on 8 Sep 2021 (v1), last revised 10 Sep 2021 (this version, v2)]

Title:Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Authors:Shahar Levy, Koren Lazar, Gabriel Stanovsky

View PDF

Abstract:Recent works have found evidence of gender bias in models of machine translation and coreference resolution using mostly synthetic diagnostic datasets. While these quantify bias in a controlled experiment, they often do so on a small scale and consist mostly of artificial, out-of-distribution sentences. In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gender-role assignments (e.g., female nurses versus male dancers) in corpora from three domains, resulting in a first large-scale gender bias dataset of 108K diverse real-world English sentences. We manually verify the quality of our corpus and use it to evaluate gender bias in various coreference resolution and machine translation models. We find that all tested models tend to over-rely on gender stereotypes when presented with natural inputs, which may be especially harmful when deployed in commercial systems. Finally, we show that our dataset lends itself to finetuning a coreference resolution model, finding it mitigates bias on a held out set. Our dataset and models are publicly available at this http URL. We hope they will spur future research into gender bias evaluation mitigation techniques in realistic settings.

Comments:	Accepted to Findings of EMNLP 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.03858 [cs.CL]
	(or arXiv:2109.03858v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.03858

Submission history

From: Gabriel Stanovsky [view email]
[v1] Wed, 8 Sep 2021 18:14:11 UTC (437 KB)
[v2] Fri, 10 Sep 2021 06:20:17 UTC (437 KB)

Computer Science > Computation and Language

Title:Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators