Learning a Cost-Effective Annotation Policy for Question Answering

Kratzwald, Bernhard; Feuerriegel, Stefan; Sun, Huan

Computer Science > Computation and Language

arXiv:2010.03476 (cs)

[Submitted on 7 Oct 2020 (v1), last revised 8 Nov 2020 (this version, v2)]

Title:Learning a Cost-Effective Annotation Policy for Question Answering

Authors:Bernhard Kratzwald, Stefan Feuerriegel, Huan Sun

View PDF

Abstract:State-of-the-art question answering (QA) relies upon large amounts of training data for which labeling is time consuming and thus expensive. For this reason, customizing QA systems is challenging. As a remedy, we propose a novel framework for annotating QA datasets that entails learning a cost-effective annotation policy and a semi-supervised annotation scheme. The latter reduces the human effort: it leverages the underlying QA system to suggest potential candidate annotations. Human annotators then simply provide binary feedback on these candidates. Our system is designed such that past annotations continuously improve the future performance and thus overall annotation cost. To the best of our knowledge, this is the first paper to address the problem of annotating questions with minimal annotation cost. We compare our framework against traditional manual annotations in an extensive set of experiments. We find that our approach can reduce up to 21.1% of the annotation cost.

Comments:	Accepted at EMNLP 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2010.03476 [cs.CL]
	(or arXiv:2010.03476v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.03476

Submission history

From: Bernhard Kratzwald [view email]
[v1] Wed, 7 Oct 2020 15:25:41 UTC (1,036 KB)
[v2] Sun, 8 Nov 2020 20:20:49 UTC (1,039 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Bernhard Kratzwald
Stefan Feuerriegel
Huan Sun

export BibTeX citation

Computer Science > Computation and Language

Title:Learning a Cost-Effective Annotation Policy for Question Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning a Cost-Effective Annotation Policy for Question Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators